×

Text language detection

  • US 7,035,801 B2
  • Filed: 09/05/2001
  • Issued: 04/25/2006
  • Est. Priority Date: 09/06/2000
  • Status: Active Grant
First Claim
Patent Images

1. A method of determining the language of a text message that comprises a plurality of words, the method comprising:

  • receiving an input text message consisting of a plurality of characters;

    dividing the plurality of characters into a series of character groups, each consisting of n characters;

    for each character group, except the last in the message, determining respective probability values for selected languages, each probability value being indicative of the probability that the character group concerned is followed by the next character group in the message in the language concerned;

    for each language, accumulating the probability values determined for each character group in the message; and

    determining the language of the text message on the basis of the accumulated probability values.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×