Fault-tolerant romanized input method for non-roman characters
First Claim
1. A computer-implemented method for identifying most likely conversions of original inputs that include potentially incorrect original inputs, the method comprising:
- receiving an original sequence of original Pinyin inputs, each original Pinyin input having associated proximity measurements, each proximity measurement being indicative of a probability of a respective intended Pinyin input being entered as the original Pinyin input;
identifying each original Pinyin input that has proximity measurements indicating one or more non-zero probabilities for respective intended Pinyin inputs that are different from the original Pinyin input as a potentially incorrect original Pinyin input;
generating alternative sequences of Pinyin inputs from the original sequence of original Pinyin inputs, each alternative sequence of Pinyin inputs including at least one intended Pinyin input that is different from a potentially incorrect original Pinyin input in place of the potentially incorrect original Pinyin input;
determining a corresponding conversion to Hanzi characters for the original sequence of original Pinyin inputs and for each alternative sequence of Pinyin inputs;
associating with each conversion a likelihood, each likelihood being based on the proximity measurements associated with the intended Pinyin inputs included in the corresponding sequences of Pinyin inputs;
identifying most likely conversions for presentation, the most likely conversions including;
a first conversion to first Hanzi characters converted from the original sequence of original Pinyin inputs, the first conversion having a first likelihood; and
a second conversion to second Hanzi characters converted from one of the alternative sequences of Pinyin inputs and having a second likelihood that exceeds the first likelihood by a predetermined threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
Fault-tolerant systems and methods to process and correct input errors are disclosed. Original inputs can be received, each original input being defined as mapping to multiple characters in a language, each original input having one or more associated proximity measurements. Original inputs that have proximity measurements with one or more non-zero probabilities for respective intended inputs are identified as potentially incorrect original inputs. Each of the potentially incorrect original inputs is expanded to the respective intended inputs, and each of the original inputs and each of the respective intended inputs are converted to characters in the language, each of the conversions being a combination of the converted original inputs and/or the respective intended inputs. Likelihoods of conversions of the original inputs and the respective intended inputs to the characters based in part on the proximity measurements are determined, and most likely conversions are identified for presentation.
18 Citations
10 Claims
-
1. A computer-implemented method for identifying most likely conversions of original inputs that include potentially incorrect original inputs, the method comprising:
-
receiving an original sequence of original Pinyin inputs, each original Pinyin input having associated proximity measurements, each proximity measurement being indicative of a probability of a respective intended Pinyin input being entered as the original Pinyin input; identifying each original Pinyin input that has proximity measurements indicating one or more non-zero probabilities for respective intended Pinyin inputs that are different from the original Pinyin input as a potentially incorrect original Pinyin input; generating alternative sequences of Pinyin inputs from the original sequence of original Pinyin inputs, each alternative sequence of Pinyin inputs including at least one intended Pinyin input that is different from a potentially incorrect original Pinyin input in place of the potentially incorrect original Pinyin input; determining a corresponding conversion to Hanzi characters for the original sequence of original Pinyin inputs and for each alternative sequence of Pinyin inputs; associating with each conversion a likelihood, each likelihood being based on the proximity measurements associated with the intended Pinyin inputs included in the corresponding sequences of Pinyin inputs; identifying most likely conversions for presentation, the most likely conversions including; a first conversion to first Hanzi characters converted from the original sequence of original Pinyin inputs, the first conversion having a first likelihood; and a second conversion to second Hanzi characters converted from one of the alternative sequences of Pinyin inputs and having a second likelihood that exceeds the first likelihood by a predetermined threshold. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium on which are stored instructions executable on a computer processor, and which upon such execution cause the computer processor to perform the operations of:
-
receiving an original sequence of original Pinyin inputs, each original Pinyin input having associated proximity measurements, each proximity measurement being indicative of a probability of a respective intended Pinyin input being entered as the original Pinyin input; identifying each original Pinyin input that has proximity measurements indicating one or more non-zero probabilities for respective intended Pinyin inputs that are different from the original Pinyin input as a potentially incorrect original Pinyin input; generating alternative sequences of Pinyin inputs from the original sequence of original Pinyin inputs, each alternative sequence of Pinyin inputs including at least one intended Pinyin input that is different from a potentially incorrect original Pinyin input in place of the potentially incorrect original Pinyin input; determining a corresponding conversion to Hanzi characters for the original sequence of original Pinyin inputs and for each alternative sequence of Pinyin inputs; associating with each conversion a likelihood, each likelihood being based on the proximity measurements associated with the intended Pinyin inputs included in the corresponding sequences of Pinyin inputs; identifying most likely conversions for presentation; identifying each most likely conversion that is a conversion of an alternative sequence of Pinyin inputs for presentation in a first presentation style; and identifying each most likely conversion that is a conversion of the original sequence of original Pinyin inputs for presentation in a second presentation style, the second presentation style being different form the first presentation style. - View Dependent Claims (7, 8, 9, 10)
-
Specification