Generating a related set of documents for an initial set of documents
First Claim
1. A computer-implemented method for identifying one or more second documents related to one or more documents of a set of first documents, the method comprising:
- for each candidate document in a plurality of candidate documents and each of the first documents, aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, an amount of time the user viewed the candidate document during a window of time after the first document was presented to the user on a search results web page in response to a query;
determining a respective strength of relationship score between each candidate document in the plurality of candidate documents and each of the first documents based on the aggregated user selection data, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document was presented to a user as a search result in response to a query;
calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and
selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying one or more second documents related to one or more first documents. Strength of relationship scores between candidate documents in a group of candidate documents and each first document are determined by aggregating user selection data for users, the user selection data indicating, for each user, whether the user viewed the candidate document during a window of time after the first document is presented to the user on a search results web page in response to a query. An aggregate strength of relationship score is calculated for each candidate document from the strength of relationship scores for the candidate document. Second documents are selected from the candidate documents according to the aggregate strength of relationship scores for the candidate documents.
-
Citations
36 Claims
-
1. A computer-implemented method for identifying one or more second documents related to one or more documents of a set of first documents, the method comprising:
-
for each candidate document in a plurality of candidate documents and each of the first documents, aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, an amount of time the user viewed the candidate document during a window of time after the first document was presented to the user on a search results web page in response to a query; determining a respective strength of relationship score between each candidate document in the plurality of candidate documents and each of the first documents based on the aggregated user selection data, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document was presented to a user as a search result in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
one or more computers programmed to perform operations comprising; for each candidate document in a plurality of candidate documents and each of the first documents, aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, an amount of time the user viewed the candidate document during a window of time after the first document was presented to the user on a search results web page in response to a query; determining a respective strength of relationship score between each candidate document in the plurality of candidate documents and each of the first documents based on the aggregated user selection data, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document was presented to a user as a search result in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
25. A non-transitory computer-readable storage medium having instructions stored thereon that, when executed by one or more computers, cause the one or more computers to perform operations comprising:
-
for each candidate document in a plurality of candidate documents and each of the first documents, aggregating user selection data for multiple users, the first documents and the candidate documents being in a corpus of web documents, the user selection data indicating, for each of the multiple users, an amount of time the user viewed the candidate document during a window of time after the first document was presented to the user on a search results web page in response to a query; determining a respective strength of relationship score between each candidate document in the plurality of candidate documents and each of the first documents based on the aggregated user selection data, wherein the strength of relationship score is a probability that the candidate document will be viewed given that the first document was presented to a user as a search result in response to a query; calculating an aggregate strength of relationship score for each candidate document from the respective strength of relationship scores for the candidate document; and selecting the one or more second documents from the candidate documents according to the aggregate strength of relationship scores for the candidate documents. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification