Estimating confidence for query revision models
First Claim
1. A method for scoring likelihood of success of revised queries suggested for an original query, the method comprising:
- maintaining log files of user clicks on results provided in response to the revised queries suggested for the original query, the log files including data representing features associated with the original query and the revised queries, and a respective feature value for each feature;
selecting a condition, wherein the condition specifies one or more feature values for a corresponding one or more features;
selecting the revised queries that, for a particular feature, are associated with a feature value that matches one or more of the feature values specified by the condition for the particular feature;
collecting click data for the selected revised queries from the log files, the click data including click length data, wherein a long click on a result for a particular revised query in the log files is taken as indicating a success of the particular revised query;
analyzing the click data for the selected revised queries to determine a weight for the condition;
formulating a rule by one or more computers, wherein the rule includes the condition and the weight; and
adding the rule to a predictive model, wherein the model includes a set of rules that contribute to a prediction of a respective likelihood of success for the revised queries.
2 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval system includes a query revision architecture that integrates multiple different query revisers, each implementing one or more query revision strategies. A revision server receives a user'"'"'s query, and interfaces with the various query revisers, each of which generates one or more potential revised queries. The revision server evaluates the potential revised queries, and selects one or more of them to provide to the user. A session-based reviser suggests one or more revised queries, given a first query, by calculating an expected utility for the revised query. The expected utility is calculated as the product of a frequency of occurrence of the query pair and an increase in quality of the revised query over the first query.
-
Citations
24 Claims
-
1. A method for scoring likelihood of success of revised queries suggested for an original query, the method comprising:
-
maintaining log files of user clicks on results provided in response to the revised queries suggested for the original query, the log files including data representing features associated with the original query and the revised queries, and a respective feature value for each feature; selecting a condition, wherein the condition specifies one or more feature values for a corresponding one or more features; selecting the revised queries that, for a particular feature, are associated with a feature value that matches one or more of the feature values specified by the condition for the particular feature; collecting click data for the selected revised queries from the log files, the click data including click length data, wherein a long click on a result for a particular revised query in the log files is taken as indicating a success of the particular revised query; analyzing the click data for the selected revised queries to determine a weight for the condition; formulating a rule by one or more computers, wherein the rule includes the condition and the weight; and adding the rule to a predictive model, wherein the model includes a set of rules that contribute to a prediction of a respective likelihood of success for the revised queries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for scoring likelihood of success of revised queries suggested for an original query, comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; maintaining log files of user clicks on results provided in response to the revised queries suggested for the original query, the log files including data representing features associated with the original query and the revised queries, and a respective feature value for each feature; selecting a condition, wherein the condition specifies one or more feature values for a corresponding one or more features; selecting the revised queries that, for a particular feature, are associated with a feature value that matches one or more of the feature values specified by the condition for the particular feature; collecting click data for the selected revised queries from the log files, the click data including click length data, wherein a long click on a result for a particular revised query in the log files is taken as indicating a success of the particular revised query; analyzing the click data for the selected revised queries to determine a weight for the condition; formulating a rule, wherein the rule includes the condition and the weight; and adding the rule to a predictive model, wherein the model includes a set of rules that contribute to a prediction of a respective likelihood of success for the revised queries. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
17. A non-transitory computer-readable medium storing software for scoring likelihood of success of revised queries suggested for an original query, the software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
maintaining log files of user clicks on results provided in response to the revised queries suggested for the original query, the log files including data representing features associated with the original query and the revised queries, and a respective feature value for each feature; selecting a condition, wherein the condition specifies one or more feature values for a corresponding one or more features; selecting the revised queries that, for a particular feature, are associated with a feature value that matches one or more of the feature values specified by the condition for the particular feature; collecting click data for the selected revised queries from the log files, the click data including click length data, wherein a long click on a result for a particular revised query in the log files is taken as indicating a success of the particular revised query; analyzing the click data for the selected revised queries to determine a weight for the condition; formulating a rule, wherein the rule includes the condition and the weight; and adding the rule to a predictive model, wherein the model includes a set of rules that contribute to a prediction of a respective likelihood of success for the revised queries. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification