Methods and apparatus for clustering news content
First Claim
1. A computer-implemented method comprising:
- identifying, by one or more processors, a plurality of documents published online by a source;
calculating, by the one or more processors, a measure of freshness, for the plurality of documents published by the source, where the measure of freshness is derived from a difference between a time that the source published the plurality of documents and a time that a news event, described by the plurality of documents, occurred; and
deriving, by the one or more processors, a source score for the source based, at least in part, on the measure of freshness.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus are described for scoring documents in response, in part, to parameters related to the document, source, and/or cluster score. Methods and apparatus are also described for scoring a cluster in response, in part, to parameters related to documents within the cluster and/or sources corresponding to the documents within the cluster. In one embodiment, the invention may identify the source; detect a plurality of documents published by the source; analyze the plurality of documents with respect to at least one parameter; and determine a source score for the source in response, in part, to the parameter. In another embodiment, the invention may identify a topic; identify a plurality of clusters in response to the topic; analyze at least one parameter corresponding to each of the plurality of clusters; and calculate a cluster score for each of the plurality of clusters in response, in part, to the parameter.
56 Citations
33 Claims
-
1. A computer-implemented method comprising:
-
identifying, by one or more processors, a plurality of documents published online by a source; calculating, by the one or more processors, a measure of freshness, for the plurality of documents published by the source, where the measure of freshness is derived from a difference between a time that the source published the plurality of documents and a time that a news event, described by the plurality of documents, occurred; and deriving, by the one or more processors, a source score for the source based, at least in part, on the measure of freshness. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system, comprising:
-
a processor; and a memory to store one or more instructions, which when executed by the processor, cause the processor to; identify a plurality of documents published online by a source; calculate a measure of freshness, for the plurality of documents published by the source, where the measure of freshness is derived from a difference between a time that the source published the plurality of documents and a time that a news event, described by the plurality of documents, occurred; and derive a source score for the source based, at least in part, on the measure of freshness. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory memory device that stores one or more computer-executable instructions for execution by one or more processors, the instructions, comprising:
one or more instructions, which, when executed by the one or more processors, cause the one or more processors to; identify a plurality of documents published online by a source; calculate a measure of freshness, for the plurality of documents published by the source, where the measure of freshness is derived from a difference between a time that the source published the plurality of documents and a time that a news event, described by the plurality of documents, occurred; and derive a source score for the source based, at least in part, on the measure of freshness. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
Specification