Detecting novel document content
First Claim
Patent Images
1. A method implemented by one or more computing devices, the method comprising:
- identifying, by a processor of one of the one or more computing devices, a temporally-ordered sequence of documents based on a search query received from a client device;
identifying, by a processor of one of the one or more computing devices, in a particular document of the temporally-ordered sequence of documents, novel content including content not present in other documents of the temporally-ordered sequence of documents;
assigning, by a processor of one of the one or more computing devices, a score to the particular document based on an amount of the novel content in the particular document; and
ranking, by a processor of one of the one or more computing devices, the particular document among the temporally-ordered sequence of documents based on the assigned score.
1 Assignment
0 Petitions
Accused Products
Abstract
A system determines an ordered sequence of documents and determines an amount of novel content contained in each document of the ordered sequence of documents. The system assigns a novelty score to each document based on the determined amount of novel content.
45 Citations
20 Claims
-
1. A method implemented by one or more computing devices, the method comprising:
-
identifying, by a processor of one of the one or more computing devices, a temporally-ordered sequence of documents based on a search query received from a client device; identifying, by a processor of one of the one or more computing devices, in a particular document of the temporally-ordered sequence of documents, novel content including content not present in other documents of the temporally-ordered sequence of documents; assigning, by a processor of one of the one or more computing devices, a score to the particular document based on an amount of the novel content in the particular document; and ranking, by a processor of one of the one or more computing devices, the particular document among the temporally-ordered sequence of documents based on the assigned score. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A memory device that stores computer-executable instructions, the memory device comprising:
-
instructions for obtaining one or more textual sequences from a document of a sequence of documents; instructions for identifying one or more pairs of the one or more textual sequences that occur within a paragraph of one another in the document; instructions for identifying, based on the one or more textual sequences and the one or more pairs, a presence of novel content in the document where the novel content includes content that does not occur in other documents in the sequence of documents; and instructions for assigning a score to the document based on the identified novel content of the document. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A computer-implemented method, comprising:
-
identifying, by a processor, in a document of a plurality of documents, one or more textual sequences; identifying, by the processor, based on the one or more textual sequences, a presence of novel content in the document where the novel content includes content that does not occur in other documents of the plurality of documents; assigning, by the processor, a score to the document based on the identified novel content including each of the one or more textual sequences; and ranking, by the processor, the document among the plurality of documents based on the assigned score. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification