Suggesting and refining user input based on original user input
First Claim
1. A computer-implemented method, comprising:
- receiving an original query;
generating a first feature vector for a first term in the original query;
generating a respective feature vector for each of one or more different terms in a collection of terms;
associating a respective similarity value with each of the one or more different terms, wherein the similarity value is based at least in part on a similarity measure between the first feature vector for the first term and a respective feature vector for each of the one or more different terms;
identifying one or more similar terms from the one or more different terms based on the respective similarity values associated with each of the one or more different terms;
generating an alternative query for each of the one or more identified similar terms by substituting the first term in the original query with a respective identified similar term;
computing a score for each alternative query based on the similarity value associated with an identified similar term in the respective alternative query; and
identifying one or more of the alternative queries as a query suggestion for the original query based at least in part on the computed score for each alternative query.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods to generate modified/refined user inputs based on the original user input, such as a search query, are disclosed. The method may be implemented for Roman-based and/or non-Roman based language such as Chinese. The method may generally include receiving an original user input and identifying core terms therein, determining potential alternative inputs by replacing core term(s) in the original input with another term according to a similarity matrix and/or substituting a word sequence in the original input with another word sequence according to an expansion/contraction table where one word sequence is a substring of the other, computing likelihood of each potential alternative input, and selecting most likely alternative inputs according to a predetermined criteria, e.g., likelihood of the alternative input being at least that of the original input. A cache containing pre-computed original user inputs and corresponding alternative inputs may be provided.
68 Citations
28 Claims
-
1. A computer-implemented method, comprising:
-
receiving an original query; generating a first feature vector for a first term in the original query; generating a respective feature vector for each of one or more different terms in a collection of terms; associating a respective similarity value with each of the one or more different terms, wherein the similarity value is based at least in part on a similarity measure between the first feature vector for the first term and a respective feature vector for each of the one or more different terms; identifying one or more similar terms from the one or more different terms based on the respective similarity values associated with each of the one or more different terms; generating an alternative query for each of the one or more identified similar terms by substituting the first term in the original query with a respective identified similar term; computing a score for each alternative query based on the similarity value associated with an identified similar term in the respective alternative query; and identifying one or more of the alternative queries as a query suggestion for the original query based at least in part on the computed score for each alternative query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
a server device configured to receive an original query and to perform operations including; generating a first feature vector for a first term in the original query; generating a respective feature vector for each of one or more different terms in a collection of terms; associating a respective similarity value with each of the one or more different terms, wherein the similarity value is based at least in part on a similarity measure between the first feature vector for the first term and a respective feature vector for each of the one or more different terms; identifying one or more similar terms from the one or more different terms based on the respective similarity values associated with each of the one or more different terms; generating an alternative query for each of the one or more identified similar terms by substituting the first term in the original query with a respective identified similar term; computing a score for each alternative query based on the similarity value associated with an identified similar term in the respective alternative query; and identifying one or more of the alternative queries as a query suggestion for the original query based at least in part on the computed score for each alternative query. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage device on which are stored instructions executable on a computer processor, the instructions comprising:
-
receiving an original query; generating a first feature vector for a first term in the original query; generating a respective feature vector for each of one or more different terms in a collection of terms; associating a respective similarity value with each of the one or more different terms, wherein the similarity value is based at least in part on a similarity measure between the first feature vector for the first term and a respective feature vector for each of the one or more different terms; identifying one or more similar terms from the one or more different terms based on the respective similarity values associated with each of the one or more different terms; generating an alternative query for each of the one or more identified similar terms by substituting the first term in the original query with a respective identified similar term; computing a score for each alternative query based on the similarity value associated with an identified similar term in the respective alternative query; and identifying one or more of the alternative queries as a query suggestion for the original query based at least in part on the computed score for each alternative query. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computer-implemented method, comprising:
-
receiving an original query; identifying a first compound comprising a first sequence of one or more terms in the original query; identifying a second compound comprising a second different sequence of one or more terms, wherein the second compound is an expansion or a contraction of the first compound; generating an alternative query by substituting the first compound in the original query with the second compound identified as an expansion or a contraction of the first compound; computing a score for the alternative query based at least in part on a relevance between the alternative query and a history of one or more previously received queries; and identifying the alternative query as a query suggestion for the original query based at least in part on the computed score for the alternative query. - View Dependent Claims (21, 22)
-
-
23. A computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium on which are stored instructions executable on a computer processor, the instructions including:
-
receiving an original query; identifying a first compound comprising a first sequence of one or more terms in the original query; identifying a second compound comprising a second different sequence of one or more terms, wherein the second compound is an expansion or a contraction of the first compound; generating an alternative query by substituting the first compound in the original query with the second compound identified as an expansion or a contraction of the first compound; computing a score for the alternative query based at least in part on a relevance between the alternative query and a history of one or more previously received queries; and identifying the alternative query as a query suggestion for the original query based at least in part on the computed score for the alternative query. - View Dependent Claims (27, 28)
-
-
24. A system, comprising:
-
a server configured to receive an original query and to perform operations including; identifying a first compound comprising a first sequence of one or more terms in the original query; identifying a second compound comprising a second different sequence of one or more terms, wherein the second compound is an expansion or a contraction of the first compound; generating an alternative query by substituting the first compound in the original query with the second compound identified as an expansion or a contraction of the first compound; computing a score for the alternative query based at least in part on a relevance between the alternative query and a history of one or more previously received queries; and identifying the alternative query as a query suggestion for the original query based at least in part on the computed score for the alternative query. - View Dependent Claims (25, 26)
-
Specification