Ranking search results using click-based data
First Claim
1. One or more computer-readable media storing computer-useable instructions that, when used by one or more computing devices, causes the one or more computing devices to perform a method of generating a machine-learned model for ranking search results using click-based data comprising:
- referencing data from one or more user queries, wherein the data is referenced from one or more of a general search engine and a vertical search engine;
generating a training set of data, wherein the training set comprises one or more search results extracted from the data;
associating one or more click-based judgments with each of the one or more search results in the training set, wherein associating one or more click-based judgments with each of the one or more search results in the training set comprises;
(1) determining that a plurality of queries are tail queries, wherein a tail query satisfies a threshold for the data referenced from one or more queries;
(2) aggregating the search results of the plurality of tail queries into one or more classes of tail queries, such that the training set comprises at least one class of tail queries having a plurality of tail queries; and
(3) associating one or more click-based judgments with each of the one or more classes of tail queries in the training set;
based on associating one or more click-based judgments with each of the one or more search results in the training set, determining one or more identifiable features from the training set; and
based on determining one or more identifiable features, generating a rule set for ranking subsequent search results of one or more user queries.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and computer-storage media having computer-executable instructions embodied thereon that facilitate generating a machine-learned model for ranking search results using click-based data are provided. Data is referenced from user queries, which may include search results generated by general search engines and vertical search engines. A training set is generated from the search results and click-based judgments are associated with the search results in the training set. Based on click-based judgments, identifiable features are determined from the search results in a training set. Based on determining identifiable features in a training set, a rule set is generated for ranking subsequent search results.
18 Citations
18 Claims
-
1. One or more computer-readable media storing computer-useable instructions that, when used by one or more computing devices, causes the one or more computing devices to perform a method of generating a machine-learned model for ranking search results using click-based data comprising:
-
referencing data from one or more user queries, wherein the data is referenced from one or more of a general search engine and a vertical search engine; generating a training set of data, wherein the training set comprises one or more search results extracted from the data; associating one or more click-based judgments with each of the one or more search results in the training set, wherein associating one or more click-based judgments with each of the one or more search results in the training set comprises; (1) determining that a plurality of queries are tail queries, wherein a tail query satisfies a threshold for the data referenced from one or more queries; (2) aggregating the search results of the plurality of tail queries into one or more classes of tail queries, such that the training set comprises at least one class of tail queries having a plurality of tail queries; and (3) associating one or more click-based judgments with each of the one or more classes of tail queries in the training set; based on associating one or more click-based judgments with each of the one or more search results in the training set, determining one or more identifiable features from the training set; and based on determining one or more identifiable features, generating a rule set for ranking subsequent search results of one or more user queries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method performed by one or more server devices for ranking search results, the method comprising:
-
referencing data from one or more user queries, wherein the data comprises one or more of general search engine results and vertical search engine results; generating a training set, wherein the training set comprises one or more search results extracted from the data; associating one or more click-based judgments with each of the one or more search results in the training set, wherein associating one or more click-based judgments with each of the one or more search results in the training set comprises; (1) determining that a plurality user queries are tail queries, wherein a tail query satisfies a threshold for the data referenced from one or more user queries; (2) aggregating the search results of plurality of tail queries into one or more classes of tail queries, such that the training set comprises at least one class of tail queries having a plurality of tail queries; and (3) associating one or more click-based judgments with each of the one or more classes of tail queries in the training set; determining one or more identifiable features from the training set based on the associated one or more click-based judgments; and based on determining one or more identifiable features from the training set, generating a rule set for ranking search results. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. One or more computer-readable media storing computer-useable instructions that, when used by one or more computing devices, causes the one or more computing devices to perform a method for ranking search results using click-based data comprising:
-
referencing data from one or more user queries, wherein the data is referenced from one or more of general search engines and vertical search engines; generating a training set of data, wherein the training set comprises a plurality of search results from the data, further wherein the plurality of search results comprises one or more of a general search engine result and a vertical search engine result; associating one or more click-based judgments with the plurality of search results in the training set based on; (1) determining whether one or more user queries is a head query, wherein a head query satisfies a threshold for the data referenced from one or more user queries; (2) associating one or more click-based judgments directly with each of the plurality of search results for the one or more head queries in the training set; (3) determining whether a plurality of user queries are tail queries, wherein a tail query satisfies a threshold for the data referenced from one or more user queries; (4) aggregating the plurality of search results of one or more tail queries into one or more classes of tail queries, such that the training set comprises at least one class of tail queries having a plurality of tail queries; and (5) associating one or more click-based judgments with each of the classes of tail queries in the training set; based on associating the one or more click-based judgments with each of the plurality of search results in the training set, determining one or more identifiable features from the training set, wherein the one or more identifiable features comprise at least one of; clicks on only textual search results; clicks on vertical search results; clicks on captions and URLs of query-URL pairs; query classification confidence; vertical query confidence; overall clicks on the entire webpage; clicks on advertisements; and clicks on links to see the next page of results appending an absolute numerical value to each of the one or more identifiable features; generating a rule set based on the one or more identifiable features of the one or more user queries; and utilizing the rule set to rank subsequent search results.
-
Specification