Methods and apparatus for clustering news online content based on content freshness and quality of content source
First Claim
1. A computer-implemented method comprising:
- identifying, by a processor, online documents published online by one or more sources;
calculating, by the processor, a first score based on a measure of freshness of a first online document of the online documents, the measure of freshness being based on an amount of time between a first time when the first online document of the online documents was published and a second time when an event described by the first online document occurred;
calculating, by the processor, a second score based on a quantity of the online documents that have a relationship to the first online document;
ranking, by the processor, the first online document based on the first score and the second score; and
providing, by the processor, the first online document for display based on the ranking of the first online document.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus are described for scoring documents in response, in part, to parameters related to the document, source, and/or cluster score. Methods and apparatus are also described for scoring a cluster in response, in part, to parameters related to documents within the cluster and/or sources corresponding to the documents within the cluster. In one embodiment, the invention may identify the source; detect a plurality of documents published by the source; analyze the plurality of documents with respect to at least one parameter, and determine a source score for the source in response, in part, to the parameter. In another embodiment, the invention may identify a topic; identify a plurality of clusters in response to the topic; analyze at least one parameter corresponding to each of the plurality of clusters; and calculate a cluster score for each of the plurality of clusters in response, in part, to the parameter.
84 Citations
21 Claims
-
1. A computer-implemented method comprising:
-
identifying, by a processor, online documents published online by one or more sources; calculating, by the processor, a first score based on a measure of freshness of a first online document of the online documents, the measure of freshness being based on an amount of time between a first time when the first online document of the online documents was published and a second time when an event described by the first online document occurred; calculating, by the processor, a second score based on a quantity of the online documents that have a relationship to the first online document; ranking, by the processor, the first online document based on the first score and the second score; and providing, by the processor, the first online document for display based on the ranking of the first online document. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and a non-transitory computer readable medium storing instructions that, when executed by the processor, cause the processor to perform operations comprising; identifying online documents published online by one or more sources; calculating a first score based on a measure of freshness of a first online document of the online documents, the measure of freshness being based on an amount of time between a first time when the first online document of the online documents was published and a second time when an event described by the first online document occurred; calculating a second score based on a quantity of the online documents that have a relationship to the first online document; ranking the first online document based on the first score and the second score; and providing the first online document for display based on the ranking of the first online document. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium having computer executable instructions for performing a method comprising:
-
identifying, by a processor, online documents published online by one or more sources; calculating, by the processor, a first score based on a measure of freshness of a first online document of the online documents, the measure of freshness being based on an amount of time between a first time when the first online document of the online documents was published and a second time when an event described by the first online document occurred; calculating, by the processor, a second score based on a quantity of the online documents that have a relationship to the first online document; ranking, by the processor, the first online document based on the first score and the second score; and providing, by the processor, the first online document for display based on the ranking of the first online document. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification