Ranking content using content and content authors
First Claim
Patent Images
1. A computer-implemented method, comprising:
- accessing, by at least one processor, a corpus of documents;
determining, by the at least one processor, that a particular document by a particular author and in the corpus of documents includes two or more different content pieces that each occur in at least one of one or more other documents in the corpus of documents;
determining, by the at least one processor, a quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents;
adjusting, by the at least one processor, a rank of the particular author in relation to other authors based at least in part on the quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; and
indexing, by the at least one processor, a quantity of the particular document and other documents by the particular author at a greater frequency than a quantity of documents by another author who is ranked lower than the particular author, wherein the quantity of the particular document and other documents by the particular author is greater than the quantity of documents by the other author who is ranked lower than the particular author.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer program products for identifying original content. In one aspect a method is described that includes identifying a first document in a collection of documents. The first document contains a content piece and the content piece does not occur in any earlier document in the collection. The first document is associated with a first author and the first author associated with a first rank. The first rank of the first author is determined using a score of the content piece. The score is a figure of merit of the content piece.
-
Citations
17 Claims
-
1. A computer-implemented method, comprising:
-
accessing, by at least one processor, a corpus of documents; determining, by the at least one processor, that a particular document by a particular author and in the corpus of documents includes two or more different content pieces that each occur in at least one of one or more other documents in the corpus of documents; determining, by the at least one processor, a quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; adjusting, by the at least one processor, a rank of the particular author in relation to other authors based at least in part on the quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; and indexing, by the at least one processor, a quantity of the particular document and other documents by the particular author at a greater frequency than a quantity of documents by another author who is ranked lower than the particular author, wherein the quantity of the particular document and other documents by the particular author is greater than the quantity of documents by the other author who is ranked lower than the particular author. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer readable medium having stored thereon instructions, which, when executed by one or more processors, causes the one or more processors to perform the operations comprising:
-
accessing, by at least one processor, a corpus of documents; determining, by the at least one processor, that a particular document by a particular author and in the corpus of documents includes two or more different content pieces that each occur in at least one of one or more other documents in the corpus of documents; determining, by the at least one processor, a quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; adjusting, by the at least one processor, a rank of the particular author in relation to other authors based at least in part on the quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; and indexing, by the at least one processor, a quantity of the particular document and other documents by the particular author at a greater frequency than a quantity of documents by another author who is ranked lower than the particular author, wherein the quantity of the particular document and other documents by the particular author is greater than the quantity of documents by the other author who is ranked lower than the particular author. - View Dependent Claims (11, 12, 13)
-
-
14. A system comprising:
-
one or more processors; and a computer readable medium coupled to the data processing apparatus, having instructions stored thereon which, when executed by the data processing apparatus, cause the data processing apparatus to perform operations comprising; accessing, by at least one processor, a corpus of documents; determining, by the at least one processor, that a particular document by a particular author and in the corpus of documents includes two or more different content pieces that each occur in at least one of one or more other documents in the corpus of documents; determining, by the at least one processor, a quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; adjusting, by the at least one processor, a rank of the particular author in relation to other authors based at least in part on the quantity of (i) other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents, or (ii) authors associated with the other documents in the corpus of documents whose content pieces are included in the particular document by the particular author and in the corpus of documents; and indexing, by the at least one processor, a quantity of the particular document and other documents by the particular author at a greater frequency than a quantity of documents by another author who is ranked lower than the particular author, wherein the quantity of the particular document and other documents by the particular author is greater than the quantity of documents by the other author who is ranked lower than the particular author. - View Dependent Claims (15, 16, 17)
-
Specification