Detecting novel document content
First Claim
Patent Images
1. A method comprising:
- identifying, by one or more processors, a group of documents related to a particular topic;
identifying, by the one or more processors, in a first document, of the group of documents, first content that is different from second content in other documents of the group of documents;
determining, by the one or more processors, a degree of difference between the first content and the second content;
determining, by the one or more processors, a score for the first document based on the degree of difference; and
modifying, by the one or more processors, a ranking of the first document relative to the other documents based on the score.
1 Assignment
0 Petitions
Accused Products
Abstract
A system determines an ordered sequence of documents and determines an amount of novel content contained in each document of the ordered sequence of documents. The system assigns a novelty score to each document based on the determined amount of novel content.
46 Citations
20 Claims
-
1. A method comprising:
-
identifying, by one or more processors, a group of documents related to a particular topic; identifying, by the one or more processors, in a first document, of the group of documents, first content that is different from second content in other documents of the group of documents; determining, by the one or more processors, a degree of difference between the first content and the second content; determining, by the one or more processors, a score for the first document based on the degree of difference; and modifying, by the one or more processors, a ranking of the first document relative to the other documents based on the score. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable medium storing instructions, the instructions comprising:
-
one or more instructions which, when executed by one or more processors, cause the one or more processors to identify a first document and a second document, the first document and the second document being related to a topic, and the topic being received from a client; one or more instructions which, when executed by the one or more processors, cause the one or more processors to identify, in the first document, first content that is different from second content in the second document; one or more instructions which, when executed by the one or more processors, cause the one or more processors to determine a degree of difference between the first content and the second content; one or more instructions which, when executed by the one or more processors, cause the one or more processors to determine a score for the first document based on the degree of difference; and one or more instructions which, when executed by the one or more processors, cause the one or more processors to rank the first document relative to the second document based on the score. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
one or more processors to; identify a plurality of documents related to a search query, identify, in a first document of the plurality of documents, first content that is different from second content in a second document of the plurality of documents, determine a degree of difference between the first content and the second content, determine a score for the first document based on the degree of difference, and rank, based on the score, the first document relative to the second document and among the plurality of documents. - View Dependent Claims (16, 17, 18, 19, 20)
Specification