System and method for generating a taxonomy from a plurality of documents
First Claim
Patent Images
1. A method comprising:
- inputting text;
extracting phrases from the text;
constructing clusters of the phrases illustrating associations between the phrases;
selecting leader phrases from the clusters that are connected to a pre-determined number of other ones of the phrases; and
defining a taxonomy that classifies categories of information within the text, based on the leader phrases.
15 Assignments
0 Petitions
Accused Products
Abstract
A system and method for generating a taxonomy is provided in which the taxonomy is generated based on clusters of phrases and a topical library. The taxonomy permits a user of a text processing system to rapidly search through a database and find relevant documents since the classifications in the taxonomy are narrow enough to limit the number of documents classified in each of the classifications.
-
Citations
1 Claim
-
1. A method comprising:
-
inputting text;
extracting phrases from the text;
constructing clusters of the phrases illustrating associations between the phrases;
selecting leader phrases from the clusters that are connected to a pre-determined number of other ones of the phrases; and
defining a taxonomy that classifies categories of information within the text, based on the leader phrases.
-
Specification