Text language detection
First Claim
Patent Images
1. A method of determining the language of a text message that comprises a plurality of words, the method comprising:
- receiving an input text message consisting of a plurality of characters;
dividing the plurality of characters into a series of character groups, each consisting of n characters;
for each character group, except the last in the message, determining respective probability values for selected languages, each probability value being indicative of the probability that the character group concerned is followed by the next character group in the message in the language concerned;
for each language, accumulating the probability values determined for each character group in the message; and
determining the language of the text message on the basis of the accumulated probability values.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of determining the language of a text message received by a mobile telecommunications device indicates receiving an input text message at a mobile telecommunications device; analyzing the input text message using language information stored in the mobile telecommunications device; selecting, from a group of languages defined by the language information, a most likely language for the input text message; and outputting, from the mobile telecommunications device, speech signals corresponding to the input text message, in the selected language.
-
Citations
9 Claims
-
1. A method of determining the language of a text message that comprises a plurality of words, the method comprising:
-
receiving an input text message consisting of a plurality of characters; dividing the plurality of characters into a series of character groups, each consisting of n characters; for each character group, except the last in the message, determining respective probability values for selected languages, each probability value being indicative of the probability that the character group concerned is followed by the next character group in the message in the language concerned; for each language, accumulating the probability values determined for each character group in the message; and determining the language of the text message on the basis of the accumulated probability values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification