Determining proximity measurements indicating respective intended inputs
First Claim
1. A computer-implemented method for determining proximity measurements indicating respective intended inputs for corresponding original inputs, the method comprising:
- receiving Pinyin inputs;
converting the Pinyin inputs to corresponding candidates, each of the corresponding candidates comprising one or more Hanzi characters;
receiving rates of user selections for each of the corresponding candidates, the rates indicating how often users select each of the corresponding candidates;
identifying, from the received rates, Pinyin inputs converted to candidates having low rates of user selection as non-selected Pinyin inputs;
identifying, from the received rates, Pinyin inputs converted to candidates having higher rates of user selection than the non-selected Pinyin inputs, as intended Pinyin inputs;
comparing the intended Pinyin inputs to the non-selected Pinyin inputs to identify one or more-non-selected Pinyin input and intended Pinyin input pairs;
for each non-selected Pinyin input, determining a number of times that users did not select from the corresponding candidates and a number of times the non-selected Pinyin input was entered as input by users; and
generating a proximity measurement for each-non-selected Pinyin input and intended Pinyin input pair based on a ratio of the number of times the corresponding candidates were not selected by users to the number of times the non-selected Pinyin input was entered as input by users.
2 Assignments
0 Petitions
Accused Products
Abstract
Determination of proximity measurements indicative of respective intended inputs are disclosed. User inputs are received, where each user input is one of a predefined plurality of inputs that each map to multiple characters in a language. Rates of user selections of candidates decoded from the user inputs into the language are received, where each of the candidates includes one or more characters in the language. User inputs for the candidates having low rates of selection as non-selected user inputs are identified. User inputs for the candidates having high rates of selection as intended inputs are identified. The intended user inputs to the non-selected user inputs are compared to identify one or more misspelled input and intended input pairs. A proximity measurement for each misspelled input and intended input pair is determined based on a ratio of the number of times corresponding candidates for the misspelled input were not selected to the number of times the misspelled input was entered.
19 Citations
10 Claims
-
1. A computer-implemented method for determining proximity measurements indicating respective intended inputs for corresponding original inputs, the method comprising:
-
receiving Pinyin inputs; converting the Pinyin inputs to corresponding candidates, each of the corresponding candidates comprising one or more Hanzi characters; receiving rates of user selections for each of the corresponding candidates, the rates indicating how often users select each of the corresponding candidates; identifying, from the received rates, Pinyin inputs converted to candidates having low rates of user selection as non-selected Pinyin inputs; identifying, from the received rates, Pinyin inputs converted to candidates having higher rates of user selection than the non-selected Pinyin inputs, as intended Pinyin inputs; comparing the intended Pinyin inputs to the non-selected Pinyin inputs to identify one or more-non-selected Pinyin input and intended Pinyin input pairs; for each non-selected Pinyin input, determining a number of times that users did not select from the corresponding candidates and a number of times the non-selected Pinyin input was entered as input by users; and generating a proximity measurement for each-non-selected Pinyin input and intended Pinyin input pair based on a ratio of the number of times the corresponding candidates were not selected by users to the number of times the non-selected Pinyin input was entered as input by users. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving Pinyin inputs; converting the Pinyin inputs to corresponding candidates, each of the corresponding candidates comprising one or more Hanzi characters; receiving rates of user selections for each of the corresponding candidates, the rates indicating how often users select each of the corresponding candidates; identifying, from the received rates, Pinyin inputs converted to candidates having low rates of user selection as non-selected Pinyin inputs; identifying, from the received rates, Pinyin inputs converted to candidates having higher rates of user selection than the non-selected Pinyin inputs, as intended Pinyin inputs; comparing the intended Pinyin inputs to the non-selected Pinyin inputs to identify one or more-non-selected Pinyin input and intended Pinyin input pairs; for each non-selected Pinyin input, determining a number of times that users did not select from the corresponding candidates and a number of times the non-selected Pinyin input was entered as input by users; and generating a proximity measurement for each-non-selected Pinyin input and intended Pinyin input pair based on a ratio of the number of times the corresponding candidates were not selected by users to the number of times the non-selected Pinyin input was entered as input by users. - View Dependent Claims (7, 8, 9, 10)
-
Specification