Suggesting and refining user input based on original user input
First Claim
1. A computer-implemented method comprising:
- obtaining a plurality of queries received from a user in a current query session, wherein a most recent query received from the user in the current query session includes a first sequence of terms;
identifying a plurality of second sequences of terms having highest-ranked measures of similarity with the first sequence of terms, the respective measures of similarity being determined between (1) a first feature vector for the first sequence of terms and (2) respective second feature vectors for each of the second sequences of terms, each of the one or more second sequences of terms being a subsequence of the first sequence of terms or being a sequence of which the first sequence of terms is a subsequence, wherein each value of the first feature vector and the respective second feature vectors is based on a count of a corresponding co-occurring term occurring in a corpus adjacent to each respective sequence of terms;
generating a plurality of query suggestions for a particular query received in the current query session, including replacing the first sequence of terms in the most recent query with each of the plurality of highest-ranked second sequences of terms, wherein the first sequence of terms in the most recent query is a subsequence of the second sequence of terms or the second sequence of terms is a subsequence of the first sequence of terms in the most recent query;
determining a respective score for each of the plurality of query suggestions, wherein the score is based on a relevance between each query suggestion and the plurality of queries received from the user in the current query session; and
ranking the query suggestions by the determined scores.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods to generate modified/refined user inputs based on the original user input, such as a search query, are disclosed. The method may be implemented for Roman-based and/or non-Roman based language such as Chinese. The method may generally include receiving an original user input and identifying core terms therein, determining potential alternative inputs by replacing core term(s) in the original input with another term according to a similarity matrix and/or substituting a word sequence in the original input with another word sequence according to an expansion/contraction table where one word sequence is a substring of the other, computing likelihood of each potential alternative input, and selecting most likely alternative inputs according to a predetermined criteria, e.g., likelihood of the alternative input being at least that of the original input. A cache containing pre-computed original user inputs and corresponding alternative inputs may be provided.
56 Citations
21 Claims
-
1. A computer-implemented method comprising:
-
obtaining a plurality of queries received from a user in a current query session, wherein a most recent query received from the user in the current query session includes a first sequence of terms; identifying a plurality of second sequences of terms having highest-ranked measures of similarity with the first sequence of terms, the respective measures of similarity being determined between (1) a first feature vector for the first sequence of terms and (2) respective second feature vectors for each of the second sequences of terms, each of the one or more second sequences of terms being a subsequence of the first sequence of terms or being a sequence of which the first sequence of terms is a subsequence, wherein each value of the first feature vector and the respective second feature vectors is based on a count of a corresponding co-occurring term occurring in a corpus adjacent to each respective sequence of terms; generating a plurality of query suggestions for a particular query received in the current query session, including replacing the first sequence of terms in the most recent query with each of the plurality of highest-ranked second sequences of terms, wherein the first sequence of terms in the most recent query is a subsequence of the second sequence of terms or the second sequence of terms is a subsequence of the first sequence of terms in the most recent query; determining a respective score for each of the plurality of query suggestions, wherein the score is based on a relevance between each query suggestion and the plurality of queries received from the user in the current query session; and ranking the query suggestions by the determined scores. - View Dependent Claims (2, 3, 4, 5, 6, 7, 18)
-
-
8. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; obtaining a plurality of queries received from a user in a current query session, wherein a most recent query received from the user in the current query session includes a first sequence of terms; identifying a plurality of second sequences of terms having highest-ranked measures of similarity with the first sequence of terms, the respective measures of similarity being determined between (1) a first feature vector for the first sequence of terms and (2) respective second feature vectors for each of the second sequences of terms, each of the one or more second sequences of terms being a subsequence of the first sequence of terms or being a sequence of which the first sequence of terms is a subsequence, wherein each value of the first feature vector and the respective second feature vectors is based on a count of a corresponding co-occurring term occurring in a corpus adjacent to each respective sequence of terms; generating a plurality of query suggestions for a particular query received in the current query session, including replacing the first sequence of terms in the most recent query with each of the plurality of highest-ranked second sequences of terms, wherein the first sequence of terms in the most recent query is a subsequence of the second sequence of terms or the second sequence of terms is a subsequence of the first sequence of terms in the most recent query; determining a respective score for each of the plurality of query suggestions, where the score is based on a relevance between each query suggestion and the plurality of queries received from the user in the current query session; and ranking the query suggestions by the determined scores. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product, encoded on one or more non-transitory computer storage media, comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
obtaining a plurality of queries received from a user in a current query session, wherein a most recent query received from the user in the current query session includes a first sequence of terms; identifying a plurality of second sequences of terms having highest-ranked measures of similarity with the first sequence of terms, the respective measures of similarity being determined between (1) a first feature vector for the first sequence of terms and (2) respective second feature vectors for each of the second sequences of terms, each of the one or more second sequences of terms being a subsequence of the first sequence of terms or being a sequence of which the first sequence of terms is a subsequence, wherein each value of the first feature vector and the respective second feature vectors is based on a count of a corresponding co-occurring term occurring in a corpus adjacent to each respective sequence of terms; generating a plurality of query suggestions for a particular query received in the current query session, including replacing the first sequence of terms in the most recent query with each of the plurality of highest-ranked second sequences of terms, wherein the first sequence of terms in the most recent query is a subsequence of the second sequence of terms or the second sequence of terms is a subsequence of the first sequence of terms in the most recent query; determining a respective score for each of the plurality of query suggestions, where the score is based on a relevance between each query suggestion and the plurality of queries received from the user in the current query session; and ranking the query suggestions by the determined scores. - View Dependent Claims (16, 17, 19, 20, 21)
-
Specification