System and method for news events detection and visualization
First Claim
Patent Images
1. An electronic device comprising:
- one or more computer-readable storage media configured to store instructions; and
one or more processors configured to execute the instructions to cause the electronic device to;
obtain a document vector based from a document;
obtain one or more clusters of documents, each cluster being associated with a plurality of documents, a cluster vector and a cluster weight;
determine a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster;
associate the document with the matching cluster;
periodically decrease the cluster weights of each of the one or more clusters; and
mark a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined weight.
8 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for news events detection and visualization. In accordance with one implementation, a method is provided for news events detection and visualization. The method includes, for example, obtaining a document vector based from a document, obtaining one or more clusters of documents, each cluster associated with a plurality of documents, a cluster vector, and a cluster weight, determining a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster, and associating the document with the matching cluster.
-
Citations
17 Claims
-
1. An electronic device comprising:
-
one or more computer-readable storage media configured to store instructions; and one or more processors configured to execute the instructions to cause the electronic device to; obtain a document vector based from a document; obtain one or more clusters of documents, each cluster being associated with a plurality of documents, a cluster vector and a cluster weight; determine a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster; associate the document with the matching cluster; periodically decrease the cluster weights of each of the one or more clusters; and mark a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined weight. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method performed by one or more processors, the method comprising:
-
obtaining a document vector based from a document; obtaining one or more clusters of documents, each cluster being associated with a plurality of documents, a cluster vector and a cluster weight; determining a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster; associating the document with the matching cluster; and periodically decreasing the cluster weights of each of the one or more clusters; wherein the determination of the matching cluster is further based on the cluster weight of the matching cluster, and wherein the method further comprises; marking a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined value. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A non-transitory computer-readable medium storing a set of instructions that are executable by one or more processors of one or more electronic devices to cause the one or more electronic devices to perform a method, the method comprising:
-
obtaining a document vector based from a document; obtaining one or more clusters of documents, each cluster being associated with a plurality of documents, a cluster vector and a cluster weight; determining a matching cluster from the one or more clusters based at least on the similarity between the document vector and the cluster vector of the matching cluster; associating the document with the matching cluster; and periodically decreasing the cluster weights of each of the one or more clusters; wherein the determination of the matching cluster is further based on the cluster weight of the matching cluster, the method further comprising; marking a cluster of the one or more clusters as inactive if the cluster weight of the cluster is below a predetermined value. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification