METHODS AND SYSTEMS FOR SELECTING A LANGUAGE FOR TEXT SEGMENTATION
First Claim
1. A method, comprising:
- identifying at least a first candidate language and a second candidate language associated with a string of characters;
determining at least a first segmented result associated with the first candidate language from the string of characters and a second segmented result associated with the second candidate language from the string of characters;
determining a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result; and
identifying an operable language from the first candidate language and the second candidate language based at least in part on the first frequency of occurrence and the second frequency of occurrence.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for selecting a language for text segmentation are disclosed. In one embodiment, at least a first candidate language and a second candidate language associated with a string of characters are identified, at least a first segmented result associated with the first candidate language and a second segmented result associated with the second candidate language are determined, a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result are determined, and an operable language is identified from the first candidate language and the second candidate language based at least in part on the first frequency of occurrence and the second frequency of occurrence.
5 Citations
21 Claims
-
1. A method, comprising:
-
identifying at least a first candidate language and a second candidate language associated with a string of characters; determining at least a first segmented result associated with the first candidate language from the string of characters and a second segmented result associated with the second candidate language from the string of characters; determining a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result; and identifying an operable language from the first candidate language and the second candidate language based at least in part on the first frequency of occurrence and the second frequency of occurrence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-readable medium containing program code, comprising:
-
program code for identifying at least a first candidate language and a second candidate language associated with a string of characters; program code for determining at least a first segmented result associated with the first candidate language from the string of characters and a second segmented result associated with the second candidate language from the string of characters; program code for determining a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result; and program code for identifying an operable language from the first candidate language and the second candidate language based at least in part on the first frequency of occurrence and the second frequency of occurrence. - View Dependent Claims (19, 20)
-
-
21-35. -35. (canceled)
Specification