USER INPUT CLASSIFICATION
First Claim
Patent Images
1. A method of classifying a user input, comprising:
- receiving a user input of Roman characters;
determining that the user input includes an ambiguous word, the ambiguous word being a non-pinyin word written in the Roman characters and also being a valid pinyin input of one or more pinyin syllables;
in response to determining that the user input includes an ambiguous word;
classifying one or more words adjacent to the ambiguous word as belonging to a pinyin context or belonging to a non-pinyin context;
classifying the ambiguous word as a pinyin input if the one or more words adjacent to the ambiguous word are classified as belonging to the pinyin context; and
classifying the ambiguous word as a non-pinyin word if the one or more words adjacent to the ambiguous word are classified as belonging to the non-pinyin context.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods of classifying user input are disclosed. The user input can be, for example, in the form of Roman characters. An ambiguous word (e.g., a word that is a non-pinyin word written in Roman characters and a valid pinyin word) can be identified in the user input. Contextual words (e.g., words adjacent to the ambiguous word) are classified as a pinyin context or a non-pinyin context. The ambiguous word is classified based on the context of the contextual words.
288 Citations
23 Claims
-
1. A method of classifying a user input, comprising:
-
receiving a user input of Roman characters; determining that the user input includes an ambiguous word, the ambiguous word being a non-pinyin word written in the Roman characters and also being a valid pinyin input of one or more pinyin syllables; in response to determining that the user input includes an ambiguous word; classifying one or more words adjacent to the ambiguous word as belonging to a pinyin context or belonging to a non-pinyin context; classifying the ambiguous word as a pinyin input if the one or more words adjacent to the ambiguous word are classified as belonging to the pinyin context; and classifying the ambiguous word as a non-pinyin word if the one or more words adjacent to the ambiguous word are classified as belonging to the non-pinyin context. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for classifying a user input, comprising:
-
a user interface to receive a user input of Roman characters and to provide possible Chinese queries that correspond to the user input; and a pinyin classifier configured to receive the user input from the user interface and determine whether the user input includes an ambiguous word, the ambiguous word being a non-pinyin word written in the Roman characters and also being a valid pinyin input of one or more pinyin syllables, the pinyin classifier further configured to classify one or more words adjacent to the ambiguous word as belonging to a pinyin context or belonging to a non-pinyin context and classify the ambiguous word based on the classification of the one or more adjacent words. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer-implemented method, comprising:
-
receiving a user input of Roman characters; identifying pinyin candidates based on the user input, each pinyin candidate comprising a portion of the user input; determining that the user input includes an ambiguous word, the ambiguous word being a non-pinyin word written in the Roman characters and also being a valid pinyin input of one or more pinyin syllables; classifying the ambiguous word as a pinyin candidate or a non-pinyin word based on a classification of one or more contextual words that are adjacent to the ambiguous word; providing the pinyin candidates to a pinyin decoder, the pinyin decoder including an updated Chinese dictionary that includes new valid words identified based on an information gain value; receiving possible Chinese word strings that correspond to the pinyin candidates from the pinyin decoder, each possible Chinese word string having a weight that indicates a probability that the possible Chinese word string corresponds to the user input; and providing a predefined number of possible Chinese word strings to a user device based on the weights. - View Dependent Claims (19, 20, 21, 22, 23)
-
Specification