Generating a related set of documents for an initial set of documents
First Claim
1. A computer-implemented method for identifying one or more second documents related to one or more documents of a set of first documents, the method comprising:
- determining a respective strength of relationship score between each candidate document in a group of candidate documents and each of the first documents by aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document has been presented to a user on a search results web page in response to a query;
calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and
selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying one or more second documents related to one or more first documents. Strength of relationship scores between candidate documents in a group of candidate documents and each first document are determined by aggregating user selection data for users, the user selection data indicating, for each user, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query. An aggregate strength of relationship score is calculated for each candidate document from the strength of relationship scores for the candidate document. Second documents are selected from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
-
Citations
60 Claims
-
1. A computer-implemented method for identifying one or more second documents related to one or more documents of a set of first documents, the method comprising:
-
determining a respective strength of relationship score between each candidate document in a group of candidate documents and each of the first documents by aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document has been presented to a user on a search results web page in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 47, 48, 49, 50, 51, 52, 53)
-
-
14. A system for identifying one or more second documents related to one or more documents of a set of first documents, the system comprising:
one or more computers configured to perform operations comprising; determining a respective strength of relationship score between each candidate document in a group of candidate documents and each of the first documents by aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document has been presented to a user on a search results web page in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 54, 55, 56, 57, 58, 59, 60)
-
27. A non-transitory computer storage medium having instructions stored thereon that, when executed by data processing apparatus, cause the data processing apparatus to perform operations comprising:
-
determining a respective strength of relationship score between each candidate document in a group of candidate documents and each of the first documents by aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document has been presented to a user on a search results web page in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46)
-
Specification