Ranking content using content and content authors
First Claim
Patent Images
1. A computer-implemented method comprising:
- determining a content baseline that specifies a threshold date before which authorship of content pieces is attributed to no particular author;
identifying two or more documents, wherein each of the two or more documents contains a same particular content piece;
determining whether an earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which content pieces are deemed neither original nor copied; and
determining whether to attribute an authorship of the particular content piece, in a later occurring of the two or more documents, to (i) an author associated with the earliest occurring of the two or more documents or to (ii) no particular author, based on determining whether the earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which authorship of content pieces is attributed to no particular author.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer program products for identifying original content. In one aspect a method is described that includes identifying a first document in a collection of documents. The first document contains a content piece and the content piece does not occur in any earlier document in the collection. The first document is associated with a first author and the first author associated with a first rank. The first rank of the first author is determined using a score of the content piece. The score is a figure of merit of the content piece.
-
Citations
28 Claims
-
1. A computer-implemented method comprising:
-
determining a content baseline that specifies a threshold date before which authorship of content pieces is attributed to no particular author; identifying two or more documents, wherein each of the two or more documents contains a same particular content piece; determining whether an earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which content pieces are deemed neither original nor copied; and determining whether to attribute an authorship of the particular content piece, in a later occurring of the two or more documents, to (i) an author associated with the earliest occurring of the two or more documents or to (ii) no particular author, based on determining whether the earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which authorship of content pieces is attributed to no particular author. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A non-transitory computer-readable medium storing software having thereon instructions, which, when executed by one or more computers, cause the one or more computers to perform operations of:
-
determining a content baseline that specifies a threshold date before which authorship of content pieces is attributed to no particular author; identifying two or more documents, wherein each of the two or more documents contains a same particular content piece; determining an earliest occurring of the two or more documents; determining whether the earliest an earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which content pieces are deemed neither original nor copied; and determining whether to attribute an authorship of the particular content piece, in a later occurring of the two or more documents, to (i) an author associated with the earliest occurring of the two or more documents or to (ii) no particular author, based on determining whether the earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which authorship of content pieces is attributed to no particular author. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or computers, to cause the one or more computers to perform operations comprising; determining a content baseline that specifies a threshold date before which authorship of content pieces is attributed to no particular author; identifying two or more documents, wherein each of the two or more documents contains a same particular content piece; determining an earliest occurring of the two or more documents; determining whether the earliest an earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which content pieces are deemed neither original nor copied; and determining whether to attribute an authorship of the particular content piece, in a later occurring of the two or more documents, to (i) an author associated with the earliest occurring of the two or more documents or to (ii) no particular author, based on determining whether the earliest occurring of the two or more documents occurred before the content baseline that specifies the threshold date before which authorship of content pieces is attributed to no particular author. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
Specification