×

Transliterating semitic languages including diacritics

  • US 8,612,206 B2
  • Filed: 12/08/2009
  • Issued: 12/17/2013
  • Est. Priority Date: 12/08/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method of transliterating text, the method comprising:

  • receiving a Romanized input text of a natural language that comprises a native character set that includes diacritics;

    setting a threshold probability;

    selecting at least one candidate transliteration rule in response to determining that a probability that the at least one candidate transliteration rule should apply is at least equal to the threshold probability;

    applying each selected candidate transliteration rule to the Romanized input text to transliterate the Romanized input text into at least one corresponding candidate diacritized text in the native character set of the natural language;

    computing a confidence score for each candidate diacritized text;

    ranking each candidate diacritized text based at least on the computed confidence scores; and

    outputting at least one candidate diacritized text based at least on the ranking.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×