GENERATING A TAXONOMY FROM UNSTRUCTURED INFORMATION
First Claim
Patent Images
1. A method [200A] for generating a taxonomy from unstructured information, said method comprising:
- extracting [202] at least one term from unstructured information [122];
validating [204] said at least one term [124];
determining [206] a sense of at least one extracted and validated term [108];
clustering [208] said at least one extracted and validated term [108] into at least one group [112] of terms according to said determined sense; and
generating [210] a taxonomy [120] based on said clustering and a minin of accessible taxonomies.
2 Assignments
0 Petitions
Accused Products
Abstract
At least one term is extracted [202] from unstructured information. The at least one term is validated [204]. Then, a sense of the at least one extracted and validated term is determined [206]. The at least one extracted and validated term is clustered [208] into at least one group of terms according to the determined sense. A taxonomy is generated [210] based on the clustering and a mining of accessible taxonomies.
-
Citations
15 Claims
-
1. A method [200A] for generating a taxonomy from unstructured information, said method comprising:
-
extracting [202] at least one term from unstructured information [122]; validating [204] said at least one term [124]; determining [206] a sense of at least one extracted and validated term [108]; clustering [208] said at least one extracted and validated term [108] into at least one group [112] of terms according to said determined sense; and generating [210] a taxonomy [120] based on said clustering and a minin of accessible taxonomies. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system [100] comprising:
-
a term extractor [104] configured for extracting at least one term [124] from unstructured information [122]; a term validater [106] configured for validating said at least one term [124]; a sense determiner [126] configured for determining a sense of at least one extracted and validated term [108]; a term clusterer [110] configured for clustering said at least one extracted and validated term [108] into at least one group [112] of terms according to a determined sense; and a taxonomy generator [118] configured for generating a taxonomy [120] based on said clustering and a mining of taxonomies [102]. - View Dependent Claims (13, 14)
-
-
15. A non-transitory computer-readable storage medium comprising instructions stored thereon which, when executed by a computer system, cause said computer system to perform a method [200B] for generating a taxonomy from unstructured information [122], said method comprising:
-
extracting [214] at least one term [124] from unstructured information [122]; validating [216] said at least one term [124]; determining [218] a sense of at least one extracted and validated term [108], said determining comprising; determining a shared sense of a first set of said at least one extracted and validated term [108] that is unambiguous; and based on a determined shared sense, disambiguating a second set of said at least one extracted and validated term [108] that is ambiguous; clustering [220] said at least one extracted and validated term [108] into at least one group of terms according to said determined sense; and generating [222] a taxonomy [120] based on said clustering and a mining of taxonomies.
-
Specification