Clustering documents based on common document selections
First Claim
1. A method, performed by one or more server devices, the method comprising:
- receiving, by at least one of one or more server devices, first navigation information identifying a first set of documents that are selected after a first document is provided,the first navigation information identifying a first plurality of documents, of the first set of documents, that are selected,each of the first plurality of documents being selected after the first document is provided, andeach of the first plurality of documents being selected based on information associated with the first document, andthe first navigation information including information identifying a quantity of selections of the first plurality of documents after the first document is provided;
receiving, by at least one of the one or more server devices, second navigation information identifying a second set of documents that are selected after a second document is provided,the second navigation information identifying a second plurality of documents, of the second set of documents, that are selected,each of the second plurality of documents being selected after the second document is provided, andeach of the second plurality of documents being selected based on information associated with the second document;
generating, by at least one of the one or more server devices, a first data structure that includes information associating the first document with the first navigation information;
generating, by at least one of the one or more server devices, a second data structure that includes information associating the second document with the second navigation information;
comparing, by at least one of the one or more server devices and using the first data structure and the second data structure, the first set of documents to the second set of documents;
generating, by at least one of the one or more server devices, a similarity score based on the comparing and based on the information identifying the quantity of selections of the first plurality of documents after the first document is provided;
determining, by at least one of the one or more server devices, based on the similarity score, that the first document is similar to the second document; and
generating, by at least one of the one or more server devices and based on determining that the first document is similar to the second document, a cluster that includes identification information identifying the first document and the second document.
2 Assignments
0 Petitions
Accused Products
Abstract
One or more server devices may receive first navigation information identifying a first set of documents that are selected after a first document is presented and second navigation information identifying a second set of documents that are selected after a second document is presented; compare the first set of documents to the second set of documents; generate a similarity score based on the comparing; determine based on the similarity score, that the first document is similar to the second document; and generate, based on determining that the first document is similar to the second document, a cluster that includes identification information identifying the first document and the second document.
-
Citations
18 Claims
-
1. A method, performed by one or more server devices, the method comprising:
-
receiving, by at least one of one or more server devices, first navigation information identifying a first set of documents that are selected after a first document is provided, the first navigation information identifying a first plurality of documents, of the first set of documents, that are selected, each of the first plurality of documents being selected after the first document is provided, and each of the first plurality of documents being selected based on information associated with the first document, and the first navigation information including information identifying a quantity of selections of the first plurality of documents after the first document is provided; receiving, by at least one of the one or more server devices, second navigation information identifying a second set of documents that are selected after a second document is provided, the second navigation information identifying a second plurality of documents, of the second set of documents, that are selected, each of the second plurality of documents being selected after the second document is provided, and each of the second plurality of documents being selected based on information associated with the second document; generating, by at least one of the one or more server devices, a first data structure that includes information associating the first document with the first navigation information; generating, by at least one of the one or more server devices, a second data structure that includes information associating the second document with the second navigation information; comparing, by at least one of the one or more server devices and using the first data structure and the second data structure, the first set of documents to the second set of documents; generating, by at least one of the one or more server devices, a similarity score based on the comparing and based on the information identifying the quantity of selections of the first plurality of documents after the first document is provided; determining, by at least one of the one or more server devices, based on the similarity score, that the first document is similar to the second document; and generating, by at least one of the one or more server devices and based on determining that the first document is similar to the second document, a cluster that includes identification information identifying the first document and the second document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium storing instructions, the instructions comprising:
a set of instructions, which, when executed by one or more processors, cause the one or more processors to; receive first navigation information identifying a first set of documents that are selected after a first document is provided, the first navigation information identifying a first plurality of documents, of the first set of documents, that are selected, each of the first plurality of documents being selected after the first document is provided, and each of the first plurality of documents being selected based on information associated with the first document, and the first navigation information including information identifying a quantity of selections of the first plurality of documents after the first document is provided; receive second navigation information identifying a second set of documents that are selected after a second document is provided, the second navigation information identifying a second plurality of documents, of the second set of documents, that are selected, each of the second plurality of documents being selected after the second document is provided, and each of the second plurality of documents being selected based on information associated with the second document; generate a first data structure that includes information associating the first document with the first navigation information; generate a second data structure that includes information associating the second document with the second navigation information; compare, using the first data structure and the second data structure, the first set of documents to the second set of documents; generate a similarity score based on the comparing and based on the information identifying the quantity of selections of the first plurality of documents after the first document is provided; determine, based on the similarity score, that the first document is similar to the second document; and assign, based on determining that the first document is similar to the second document, the first document and the second document to a cluster, the cluster including identification information identifying the first document and the second document. - View Dependent Claims (11, 12, 13)
-
14. A system comprising:
-
one or more memory devices storing instructions; and one or more processors to execute the instructions to; receive first navigation information identifying a first set of documents that are selected after a first document is provided, the first navigation information further identifying a quantity of times that each document, in the first set of documents, was selected after the first document was provided, the first navigation information further identifying a first plurality of documents, of the first set of documents, that are selected, each of the first plurality of documents being selected after the first document is provided, and each of the first plurality of documents being selected based on information associated with the first document; receive second navigation information identifying a second set of documents that are selected after a second document is provided, the second navigation information further identifying a quantity of times that each document, in the second set of documents, was selected after the second document was provided, the second navigation information further identifying a second plurality of documents, of the second set of documents, that are selected,
each of the second plurality of documents being selected after the second document is provided, and
each of the second plurality of documents being selected based on information associated with the second document;generate a first data structure that includes information associating the first document with the first navigation information; generate a second data structure that includes information associating the second document with the second navigation information; compare, using the first data structure and the second data structure, the first set of documents to the second set of documents, when comparing the first set of documents to the second set of documents, the one or more processors are to; generate a similarity score based on the comparing, the similarity score being based on at least one of; the quantity of times each document, in the first set of documents, was selected after the first document was provided, or the quantity of times each document, in the second set of documents, was selected after the second document was provided; determine, based on the similarity score, that the first document is similar to the second document; and generate, based on determining that the first document is similar to the second document, a cluster that includes identification information identifying the first document and the second document. - View Dependent Claims (15, 16, 17, 18)
-
Specification