Estimating confidence for query revision models
First Claim
1. A computer-implemented method comprising:
- obtaining training data including;
for each of one or more original queries;
(i) one or more features of the original query,(ii) one or more features of a revised query which was generated by a query revisor for the original query, and(iii) a label indicating whether the revised query is a successful revision of the original query;
training, using the training data obtained for each of the one or more original queries that includes, for each respective original query, (i) the one or more features of the original query, (ii) the one or more features of the revised query which was generated by the query revisor for the original query, and (iii) the label indicating whether the revised query is a successful revision of the original query, a predictive model for predicting a likelihood that a given revised query is a successful revision of a given original query; and
after receiving a particular original query, (a) using the predictive model to predict a likelihood that a particular revised query is a successful revision of the particular original query, and (b) determining whether to submit the particular revised search query to a search engine based on the likelihood.
2 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval system includes a query revision architecture that integrates multiple different query revisers, each implementing one or more query revision strategies. A revision server receives a user'"'"'s query, and interfaces with the various query revisers, each of which generates one or more potential revised queries. The revision server evaluates the potential revised queries, and selects one or more of them to provide to the user. A session-based reviser suggests one or more revised queries, given a first query, by calculating an expected utility for the revised query. The expected utility is calculated as the product of a frequency of occurrence of the query pair and an increase in quality of the revised query over the first query.
100 Citations
20 Claims
-
1. A computer-implemented method comprising:
- obtaining training data including;
for each of one or more original queries;(i) one or more features of the original query, (ii) one or more features of a revised query which was generated by a query revisor for the original query, and (iii) a label indicating whether the revised query is a successful revision of the original query; training, using the training data obtained for each of the one or more original queries that includes, for each respective original query, (i) the one or more features of the original query, (ii) the one or more features of the revised query which was generated by the query revisor for the original query, and (iii) the label indicating whether the revised query is a successful revision of the original query, a predictive model for predicting a likelihood that a given revised query is a successful revision of a given original query; and after receiving a particular original query, (a) using the predictive model to predict a likelihood that a particular revised query is a successful revision of the particular original query, and (b) determining whether to submit the particular revised search query to a search engine based on the likelihood. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
- obtaining training data including;
-
16. A system comprising:
-
one or more computers; and a non-transitory computer-readable medium coupled to the one or more computers having instructions stored thereon, which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; obtaining training data including, for each of one or more original queries; (i) one or more features of the original query, (ii) one or more features of a revised query which was generated by a query revisor for the original query, and (iii) a label indicating whether the revised query is a successful revision of the original query; training, using the training data obtained for each of the one or more original queries that includes, for each respective original query, (i) the one or more features of the original query, (ii) the one or more features of the revised query which was generated by the query revisor for the original query, and (iii) the label indicating whether the revised query is a successful revision of the original query, a predictive model for predicting a likelihood that a given revised query is a successful revision of a given original query; and after receiving a particular original query, (a) using the predictive model to predict a likelihood that a particular revised query is a successful revision of the particular original query, and (b) determining whether to submit the particular revised search query to a search engine based on the likelihood. - View Dependent Claims (17, 18)
-
-
19. A non-transitory computer-readable storage medium encoded with a computer program, the program comprising instructions that when executed by a data processing apparatus cause the data processing apparatus to perform operations comprising:
-
obtaining training data including, for each of one or more original queries; (i) one or more features of the original query, (ii) one or more features of a revised query which was generated by a query revisor for the original query, and (iii) a label indicating whether the revised query is a successful revision of the original query; training, using the training data obtained for each of the one or more original queries that includes, for each respective original query, (i) the one or more features of the original query, (ii) the one or more features of the revised query which was generated by the query revisor for the original query, and (iii) the label indicating whether the revised query is a successful revision of the original query, a predictive model for predicting a likelihood that a given revised query is a successful revision of a given original query; and after receiving a particular original query, (a) using the predictive model to predict a likelihood that a particular revised query is a successful revision of the particular original query, and (b) determining whether to submit the particular revised search query to a search engine based on the likelihood. - View Dependent Claims (20)
-
Specification