Modifying search result ranking based on corpus search statistics
First Claim
Patent Images
1. A method implemented by data processing apparatus, the method comprising:
- determining, for a plurality of search results responsive to a query, a respective count of times search results in the plurality of search results that refer to documents in a base corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the base corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the base corpus are for searches initiated by users in a plurality of different countries who employ a specific language;
determining, for the plurality of search results responsive to the query, a respective count of times search results in the plurality of search results that refer to documents in a second corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the second corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the second corpus are for searches initiated by users in the plurality of different countries who employ the specific language;
calculating a click through rate of the base corpus for the query based at least in part on the respective counts for the base corpus;
calculating a click through rate of the second corpus for the query based at least in part on the respective counts for the second corpus;
calculating a measure of relative relevance based at least in part on a ratio of the second corpus click through rate to the base corpus click through rate; and
providing the measure of relative relevance to a ranking engine for ranking of search results for a search corresponding to the query; and
wherein fewer search results of the plurality refer to documents in the second corpus than to documents in the base corpus.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer program products, for ranking search results of a search query using corpus search statistics. In one aspect, a method includes determining a first relevance of a first corpus to a search query, determining a second relevance of a second corpus to the search query, determining a measure of relative relevance of the first corpus and the second corpus to the search query, and providing the measure of relative relevance to a ranking engine for ranking of search results for a new search corresponding to the search query.
-
Citations
24 Claims
-
1. A method implemented by data processing apparatus, the method comprising:
-
determining, for a plurality of search results responsive to a query, a respective count of times search results in the plurality of search results that refer to documents in a base corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the base corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the base corpus are for searches initiated by users in a plurality of different countries who employ a specific language; determining, for the plurality of search results responsive to the query, a respective count of times search results in the plurality of search results that refer to documents in a second corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the second corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the second corpus are for searches initiated by users in the plurality of different countries who employ the specific language; calculating a click through rate of the base corpus for the query based at least in part on the respective counts for the base corpus; calculating a click through rate of the second corpus for the query based at least in part on the respective counts for the second corpus; calculating a measure of relative relevance based at least in part on a ratio of the second corpus click through rate to the base corpus click through rate; and providing the measure of relative relevance to a ranking engine for ranking of search results for a search corresponding to the query; and wherein fewer search results of the plurality refer to documents in the second corpus than to documents in the base corpus. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a storage device encoded with instructions; and data processing apparatus operable to execute the instructions to perform operations comprising; determining, for a plurality of search results responsive to a query, a respective count of times search results in the plurality of search results that refer to documents in a base corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the base corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the base corpus are for searches initiated by users in a plurality of different countries who employ a specific language; determining, for the plurality of search results responsive to the query, a respective count of times search results in the plurality of search results that refer to documents in a second corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the second corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the second corpus are for searches initiated by users in the plurality of different countries who employ the specific language; calculating a click through rate of the base corpus for the query based at least in part on the respective counts for the base corpus; calculating a click through rate of the second corpus for the query based at least in part on the respective counts for the second corpus; calculating a measure of relative relevance based at least in part on a ratio of the second corpus click through rate to the base corpus click through rate; and providing the measure of relative relevance to a ranking engine for ranking of search results for a search corresponding to the query; and wherein fewer search results of the plurality refer to documents in the second corpus than to documents in the base corpus. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A storage device encoded with a program product, the program product which, when executed by data processing apparatus, cause the data processing apparatus to perform operations comprising:
-
determining, for a plurality of search results responsive to a query, a respective count of times search results in the plurality of search results that refer to documents in a base corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the base corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the base corpus are for searches initiated by users in a plurality of different countries who employ a specific language; determining, for the plurality of search results responsive to the query, a respective count of times search results in the plurality of search results that refer to documents in a second corpus have been presented, and a respective count of times search results in the plurality of search results that refer to documents in the second corpus have been selected, wherein the respective counts for presentations and selections of search results that refer to documents in the second corpus are for searches initiated by users in the plurality of different countries who employ the specific language; calculating a click through rate of the base corpus for the query based at least in part on the respective counts for the base corpus; calculating a click through rate of the second corpus for the query based at least in part on the respective counts for the second corpus; calculating a measure of relative relevance based at least in part on a ratio of the second corpus click through rate to the base corpus click through rate; and providing the measure of relative relevance to a ranking engine for ranking of search results for a search corresponding to the query; and wherein fewer search results of the plurality refer to documents in the second corpus than to documents in the base corpus. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification