System and method for generating cluster spines
First Claim
1. A system for generating cluster spines, comprising:
- a set of clusters, each cluster comprising one or more documents and each document associated with a document concept formed from one or more terms extracted from that document;
a cluster concept determination module to determine at least one cluster concept for each cluster, comprising;
a concept ranking module to rank the document concepts;
a concept selection module to select at least one of the document concepts that is highly ranked as the cluster concept; and
a concept removal module to discard those cluster concepts that are referenced by more than 10% of the clusters;
a spine generation module to form one or more spines each comprising two or more clusters from the cluster set that share at least one of the cluster concepts, wherein the shared cluster concept is identified as a spine concept;
a cluster assignment module to assign one or more of the clusters remaining in the cluster set to the spines based on a similarity between the cluster concepts for the remaining clusters and the spine concepts for the formed spines; and
a processor to execute the modules.
9 Assignments
0 Petitions
Accused Products
Abstract
A system and method for generating cluster spines is provided. Clusters of documents are maintained. Each document is associated with a document concept that is formed from one or more terms extracted from that document. At least one cluster concept is determined for each cluster. The document concepts are ranked and at least one of the document concepts that is highly ranked is selected as the cluster concept. One or more spines are formed. Each spine includes two or more clusters that share at least one of the cluster concepts. The shared cluster concept is identified as a spine concept. One or more of the remaining clusters is assigned to the spines based on a similarity between the cluster concepts for the remaining clusters and the spine concepts for the formed spines.
219 Citations
18 Claims
-
1. A system for generating cluster spines, comprising:
-
a set of clusters, each cluster comprising one or more documents and each document associated with a document concept formed from one or more terms extracted from that document; a cluster concept determination module to determine at least one cluster concept for each cluster, comprising; a concept ranking module to rank the document concepts; a concept selection module to select at least one of the document concepts that is highly ranked as the cluster concept; and a concept removal module to discard those cluster concepts that are referenced by more than 10% of the clusters; a spine generation module to form one or more spines each comprising two or more clusters from the cluster set that share at least one of the cluster concepts, wherein the shared cluster concept is identified as a spine concept; a cluster assignment module to assign one or more of the clusters remaining in the cluster set to the spines based on a similarity between the cluster concepts for the remaining clusters and the spine concepts for the formed spines; and a processor to execute the modules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for generating cluster spines by executing on a computer the method comprising:
-
maintaining a set of clusters, each cluster comprising one or more documents and each document associated with a document concept formed from one or more terms extracted from that document; determining at least one cluster concept for each cluster, comprising; ranking the document concepts; selecting at least one of the document concepts that is highly ranked as the cluster concept; and discarding those cluster concepts that are referenced by more than 10% of the clusters; forming one or more spines each comprising two or more clusters from the cluster set that share at least one of the cluster concepts, wherein the shared cluster concept is identified as a spine concept; and assigning one or more of the clusters remaining in the cluster set to the spines based on a similarity between the cluster concepts for the remaining clusters and the spine concepts for the formed spines. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification