System and method for news events detection and visualization
First Claim
1. An electronic device comprising:
- one or more computer-readable storage media configured to store instructions; and
one or more processors configured to execute the instructions to cause the electronic device to;
obtain a document;
obtain from the document a plurality of tokens;
obtain a document vector based on a plurality of frequencies associated with the plurality of tokens;
obtain one or more clusters of documents, each cluster being associated with a plurality of documents, a cluster vector, a cluster weight, and a score;
mark a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined weight;
determine a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster;
update a database to associate the document with the matching cluster; and
update the score of the matching cluster based at least on a plurality of documents associated with the matching cluster.
8 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for news events detection and visualization. In accordance with one implementation, a method is provided for news events detection and visualization. The method includes, for example, obtaining a document, obtaining from the document a plurality of tokens, obtaining a document vector based on a plurality of frequencies associated with the plurality of tokens, obtaining one or more clusters of documents, each cluster associated with a plurality of documents and a cluster vector, determining a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster, and updating a database to associate the document with the matching cluster.
-
Citations
23 Claims
-
1. An electronic device comprising:
-
one or more computer-readable storage media configured to store instructions; and one or more processors configured to execute the instructions to cause the electronic device to; obtain a document; obtain from the document a plurality of tokens; obtain a document vector based on a plurality of frequencies associated with the plurality of tokens; obtain one or more clusters of documents, each cluster being associated with a plurality of documents, a cluster vector, a cluster weight, and a score; mark a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined weight; determine a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster; update a database to associate the document with the matching cluster; and update the score of the matching cluster based at least on a plurality of documents associated with the matching cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method performed by one or more processors, the method comprising:
-
obtaining a document; obtaining from the document a plurality of tokens; obtaining a document vector based on a plurality of frequencies associated with the plurality of tokens; obtaining one or more clusters of documents, each cluster associated with a plurality of documents, a cluster vector, a cluster weight, and a score; marking a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined weight; determining a matching cluster from the one or more of clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster; updating a database to associate the document with the matching cluster; and updating the score of the matching cluster based at least on a plurality of documents associated with the matching cluster. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable medium storing a set of instructions that are executable by one or more processors of one or more electronic devices to cause the one or more electronic devices to perform a method, the method comprising:
-
obtaining a document; obtaining from the document a plurality of tokens; obtaining a document vector based on a plurality of frequencies associated with the plurality of tokens; obtaining one or more clusters of documents, each cluster associated with a plurality of documents, a cluster vector, a cluster weight, and a score; marking a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined weight; determining a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster; providing an update to a database to associate the document with the matching cluster; and updating the score of the matching cluster based at least on a plurality of documents associated with the matching cluster. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
Specification