Systems and methods for spell correction of non-roman characters and words
First Claim
1. A method, comprising:
- receiving an input entry in a first language;
converting the input entry to at least one intermediate entry in an intermediate representation different from the first language;
converting the intermediate entry to at least one possible alternative form of the input entry in the first language;
comparing the input entry to at least one possible alternative form of the input entry to locate a match; and
determining that the input entry is a questionable input entry based on the comparing.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.
234 Citations
39 Claims
-
1. A method, comprising:
-
receiving an input entry in a first language;
converting the input entry to at least one intermediate entry in an intermediate representation different from the first language;
converting the intermediate entry to at least one possible alternative form of the input entry in the first language;
comparing the input entry to at least one possible alternative form of the input entry to locate a match; and
determining that the input entry is a questionable input entry based on the comparing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system, comprising:
-
a first converter configured to convert the input in a first language to at least one intermediate entry in an intermediate representation different from the first language;
a second converter configured to convert the intermediate entry to at least one possible alternative spelling of the input in the first language; and
a comparator configured to compare the input entry to at least one possible alternative spelling to locate a match, the comparator further being configured to determine whether the input entry is a questionable input entry based on the comparing. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium on which are stored instructions executable on a computer processor, the instructions including:
-
receiving an input entry in a first language;
converting the input entry to at least one intermediate entry in an intermediate representation different from the first language;
converting the intermediate entry to at least one possible alternative form of the input entry in the first language;
comparing the input entry to at least one possible alternative form of the input entry to locate a match; and
determining that the input entry is a questionable input entry based on the comparing. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
Specification