Method and apparatus for clustering news online content based on content freshness and quality of content source
First Claim
1. A method comprising:
- identifying, by one or more processors, online content,the online content being associated with a document;
calculating, by one or more processors, a first score for the online content, based on a measure of freshness of the online content,the online content including content relating to an event, andthe measure of freshness being based on a first time when the event occurred and a second time when the online content was published;
determining, by the one or more processors, a second score for the online content based on a quality of a source that published the online content,the source being an entity that publishes documents;
ranking, by the one or more processors, the online content based on the first score and the second score; and
providing, by the one or more processors, the ranked online content for display.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus are described for scoring documents in response, in part, to parameters related to the document, source, and/or cluster score. Methods and apparatus are also described for scoring a cluster in response, in part, to parameters related to documents within the cluster and/or sources corresponding to the documents within the cluster. In one embodiment, the invention may identify the source; detect a plurality of documents published by the source; analyze the plurality of documents with respect to at least one parameter; and determine a source score for the source in response, in part, to the parameter. In another embodiment, the invention may identify a topic; identify a plurality of clusters in response to the topic; analyze at least one parameter corresponding to each of the plurality of clusters; and calculate a cluster score for each of the plurality of clusters in response, in part, to the parameter.
76 Citations
20 Claims
-
1. A method comprising:
-
identifying, by one or more processors, online content, the online content being associated with a document; calculating, by one or more processors, a first score for the online content, based on a measure of freshness of the online content, the online content including content relating to an event, and the measure of freshness being based on a first time when the event occurred and a second time when the online content was published; determining, by the one or more processors, a second score for the online content based on a quality of a source that published the online content, the source being an entity that publishes documents; ranking, by the one or more processors, the online content based on the first score and the second score; and providing, by the one or more processors, the ranked online content for display. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
one or more processors to; identify online content, the online content being associated with a document; calculate a first score for the online content based on a measure of freshness of the online content, the online content including content relating to an event, and the measure of freshness being based on a first time when the event occurred and a second time when the online content was published; determine a second score for the online content based on a quality of a source that published the online content, the source including an entity that publishes documents; rank the online content based on the first score and the second score; and provide the ranked online content for display. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
15. A non-transitory computer-readable medium storing instructions, the instructions comprising:
one or more instructions that, when executed by one or more processors, cause the one or more processors to; identify online content, the online content being associated with a document; calculate a first score for the online content based on a measure of freshness of the online content, the online content including content relating to an event, and the measure of freshness being based on a first time when the event occurred and a second time when the online content was published; determine a second score for the online content based on a quality of a source that published the online content, the source including an entity that publishes documents; rank the online content based on the first score and the second score; and provide the ranked online content for display. - View Dependent Claims (16, 17, 18, 19, 20)
Specification