Generating a related set of documents for an initial set of documents
First Claim
1. A computer-implemented method for identifying one or more second documents related to one or more documents of a set of first documents, the method comprising:
- aggregating user selection data for multiple users, the first documents and a group of candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed one of the candidate documents during a window of time after a search result corresponding to one of the first documents was presented to the user on a search results web page in response to a query;
determining, using the aggregated user selection data, a respective strength of relationship score between each candidate document in the group of candidate documents and each first document in the set of first documents, each respective strength of relationship score being determined based on whether each user of the multiple users viewed the candidate document during the window of time after a search result corresponding to the first document was presented to the user on a search results web page in response to a query;
calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and
selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying one or more second documents related to one or more first documents. Strength of relationship scores between candidate documents in a group of candidate documents and each first document are determined by aggregating user selection data for users, the user selection data indicating, for each user, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query. An aggregate strength of relationship score is calculated for each candidate document from the strength of relationship scores for the candidate document. Second documents are selected from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
323 Citations
29 Claims
-
1. A computer-implemented method for identifying one or more second documents related to one or more documents of a set of first documents, the method comprising:
-
aggregating user selection data for multiple users, the first documents and a group of candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed one of the candidate documents during a window of time after a search result corresponding to one of the first documents was presented to the user on a search results web page in response to a query; determining, using the aggregated user selection data, a respective strength of relationship score between each candidate document in the group of candidate documents and each first document in the set of first documents, each respective strength of relationship score being determined based on whether each user of the multiple users viewed the candidate document during the window of time after a search result corresponding to the first document was presented to the user on a search results web page in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for identifying one or more second documents related to one or more documents of a set of first documents, the system comprising:
one or more computers configured to perform operations comprising; aggregating user selection data for multiple users, the first documents and a group of candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed one of the candidate documents during a window of time after a search result corresponding to one of the first is documents was presented to the user on a search results web page in response to a query; determining, using the aggregated user selection data, a respective strength of relationship score between each candidate document in the group of candidate documents and each first document in the set of first documents, each respective strength of relationship score being determined based on whether each user of the multiple users viewed the candidate document during the window of time after a search result corresponding to the first document was presented to the user on a search results web page in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
29. A non-transitory computer storage medium having instructions stored thereon that, when executed by data processing apparatus, cause the data processing apparatus to perform operations comprising:
-
aggregating user selection data for multiple users, the first documents and a group of candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, whether the user viewed one of the candidate documents during a window of time after a search result corresponding to one of the first documents was presented to the user on a search results web page in response to a query; determining, using the aggregated user selection data, a respective strength of relationship score between each candidate document in the group of candidate documents and each first document in the set of first documents, each respective strength of relationship score being determined based on whether each user of the multiple users viewed the candidate document during the window of time after a search result corresponding to the first document was presented to the user on a search results web page in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
-
Specification