RECOMMENDING TOPIC CLUSTERS FOR UNSTRUCTURED TEXT DOCUMENTS
First Claim
1. A method comprising:
- accessing a plurality of electronic text documents comprising a plurality of terms;
analyzing, by at least one processor, the plurality of terms to determine a significance value for each term;
identifying, based on the significance value determined for each term, a key term from the plurality of terms;
determining, from the plurality of terms, one or more related terms associated with the key term;
generating a topic cluster comprising the key term and the one or more related terms associated with the key term; and
providing, to a client device associated with a user, at least one electronic text document from the plurality of electronic text documents that corresponds to the topic cluster.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the present disclosure generally relate to a content management system that automatically determines and generates topic clusters from a collection of electronic text documents. For example, the content management system analyzes a collection of electronic text documents to identify key terms and terms related to the key terms. Based on the key terms and related terms, the content management system generates a topic cluster that includes the key term and related terms. The content management system then organizes the electronic text documents based on terms within a given text document matching terms within a given topic cluster. Further, the content management system presents the topic clusters and organized electronic text documents to a user.
47 Citations
20 Claims
-
1. A method comprising:
-
accessing a plurality of electronic text documents comprising a plurality of terms; analyzing, by at least one processor, the plurality of terms to determine a significance value for each term; identifying, based on the significance value determined for each term, a key term from the plurality of terms; determining, from the plurality of terms, one or more related terms associated with the key term; generating a topic cluster comprising the key term and the one or more related terms associated with the key term; and providing, to a client device associated with a user, at least one electronic text document from the plurality of electronic text documents that corresponds to the topic cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
at least one processor; and at least one non-transitory computer-readable storage medium storing instructions that, when executed by the at least one processor, cause the system to; access a plurality of electronic text documents comprising a plurality of terms; analyze the plurality of terms to determine a significance value for each term; identify a first key term from the plurality of terms based on the first key term having a highest significance value; determine, from the plurality of terms, a first set of related terms associated with the first key term; generate a first topic cluster comprising the first key term and the first set of related terms associated with the first key term; and provide, for presentation to a user, a first topic corresponding to the first topic cluster. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A method comprising:
-
receiving, from a plurality of users, a plurality of unstructured text responses received in response to an electronic survey; analyzing, by at least one processor, the plurality of unstructured text responses to identify a plurality of terms; determining a significance value for each term of the plurality of terms; identifying, based on the significance value determined for each term, key terms from the plurality of terms; determining, from the plurality of terms, one or more related terms corresponding to each key term; generating topic clusters, wherein each topic cluster comprises a key term and at least one related term corresponding to the key term; and providing, for presentation to an administrative user, a list of topics corresponding to the topic clusters. - View Dependent Claims (19, 20)
-
Specification