Method and system for automatically disambiguating the synonymic links in a dictionary for a natural language processing system
First Claim
1. A method for automatically disambiguating the synonymic links in a dictionary for a natural language processing system, wherein said dictionary is stored in the memory of a data processing system and includes a list of "meaning-entries" with their respective synonym lists, characterized in that it comprises the following steps:
- reading from said dictionary a "meaning-entry", the words from which it is a synonym and its own synonyms, to build a synonymic environment table;
digitizing said environment table to build an environment matrix;
building a similarity matrix by computation of similarity indexes between the elements of said environment matrix;
incrementally clustering the words having the greatest similarity index, first with said "meaning-entry" and thereafter with the previous clusters; and
writing the "meaning-entry" and the obtained cluster containing the disambiguated synonyms back into said dictionary.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for automatically disambiguating the synonymic links in a dictionary for a natural language processing system. Said dictionary is stored in the memory of a data processing system and includes a list of headwords with their respective synonym lists. The same headword can have different meanings, each of which has its own synonym list, each of which also can have different meanings. Disambiguation of the synonymic links is performed by reading from said dictionary a "meaning-entry", the words from which it is a synonym and its own list of synonyms, to build a synonymic environment table. A similarity index is computed for each pair of words of said environment and the words having the greatest similarity with the aggregate are incrementally clustered. The final cluster is then validated and the "meaning-entry" and its disambiguated synonyms are written back into the dictionary.
-
Citations
9 Claims
-
1. A method for automatically disambiguating the synonymic links in a dictionary for a natural language processing system, wherein said dictionary is stored in the memory of a data processing system and includes a list of "meaning-entries" with their respective synonym lists, characterized in that it comprises the following steps:
-
reading from said dictionary a "meaning-entry", the words from which it is a synonym and its own synonyms, to build a synonymic environment table; digitizing said environment table to build an environment matrix; building a similarity matrix by computation of similarity indexes between the elements of said environment matrix; incrementally clustering the words having the greatest similarity index, first with said "meaning-entry" and thereafter with the previous clusters; and writing the "meaning-entry" and the obtained cluster containing the disambiguated synonyms back into said dictionary. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. Data processing system comprising a first memory, a second memory, an electronic dictionary including a list of "meaning entries" with their respective synonym lists stored in said second memory:
-
means for reading from said dictionary stored in said second memory, a "meaning-entry", the words from which it is a synonym and its own synonyms, to build a synonymic environment table into said first memory; means for digitizing said environment table to build an environment matrix; means for computing similarity indexes between the elements of said environment matrix to build a similarity matrix; means for incrementally clustering the words having the greatest similarity index; and means for writing back into said dictionary stored in said second memory and "meaning-entry" and the words of the cluster, whereby the synonymic links in the dictionary are disambiguated. - View Dependent Claims (9)
-
Specification