System and method to match linguistic structures using thesaurus information
First Claim
Patent Images
1. A method for evaluating similarity among a plurality of data structures comprising:
- analyzing each structure of said plurality of data structures to generate at least one substructure comprising a linguist feature-value pair;
matching said at least one substructure to a database having a plurality of entries to obtain at least one matching entry based on a semantic concept associated with said linguist feature-value pair, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
generating a match value using a relative entropy value corresponding to said at least one matching entry, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for evaluating similarity among multiple data structures are provided. In one embodiment, a number of entries are created in a database and a relative entropy value corresponding to each entry is calculated. Each data structure is analyzed and at least one substructure is generated for each data structure as a result of the analysis. This at least one substructure pertaining to each data structure is matched to the entries in the database to obtain matching entries. Finally, a match value is generated using the relative entropy values corresponding to the matching entries.
-
Citations
88 Claims
-
1. A method for evaluating similarity among a plurality of data structures comprising:
-
analyzing each structure of said plurality of data structures to generate at least one substructure comprising a linguist feature-value pair;
matching said at least one substructure to a database having a plurality of entries to obtain at least one matching entry based on a semantic concept associated with said linguist feature-value pair, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
generating a match value using a relative entropy value corresponding to said at least one matching entry, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for evaluating similarity among a plurality of data structures comprising:
-
creating a plurality of entries in a database for subsequent comparison with linguist feature-value pairs based on semantic concepts associated with said linguist feature-value pairs, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
calculating a relative entropy value corresponding to each entry of said plurality of entries, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer readable medium containing executable instructions which, when executed in a processing system, cause the system to perform a method for evaluating similarity among a plurality of data structures, the method comprising:
-
analyzing each structure of said plurality of data structures to generate at least one substructure comprising a linguist feature-value pair;
matching said at least one substructure to a database having a plurality of entries to obtain at least one matching entry based on a semantic concept associated with said linguist feature-value pair, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
generating a match value using a relative entropy value corresponding to said at least one matching entry, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A computer readable medium containing executable instructions which, when executed in a processing system, cause the system to perform a method for evaluating similarity among a plurality of data structures, the method comprising:
-
creating a plurality of entries in a database for subsequent comparison with linguist feature-value pairs based on semantic concepts associated with said linguist feature-value pairs, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
calculating a relative entropy value corresponding to each entry of said plurality of entries, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. An article of manufacture comprising a program storage medium readable by a computer and tangibly embodying at least one program of instructions executable by said computer to perform method steps for evaluating similarity among a plurality of data structures, said method comprising:
-
analyzing each structure of said plurality of data structures to generate at least one substructure comprising a linguist feature-value pair;
matching said at least one substructure to a database having a plurality of entries to obtain at least one matching entry based on a semantic concept associated with said linguist feature-value pair, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
generating a match value using a relative entropy value corresponding to said at least one matching entry, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50, 51)
-
-
52. An article of manufacture comprising a program storage medium readable by a computer and tangibly embodying at least one program of instructions executable by said computer to perform method steps for evaluating similarity among a plurality of data structures, said method comprising:
-
creating a plurality of entries in a database for subsequent comparison with linguist feature-value pairs based on semantic concepts associated with said linguist feature-value pairs, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
calculating a relative entropy value corresponding to each entry of said plurality of entries, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. A system for evaluating similarity among a plurality of data structures, comprising:
-
means for analyzing each structure of said plurality of data structures to generate at least one substructure comprising a linguist feature-value pair;
means for matching said at least one substructure to a database having a plurality of entries to obtain at least one matching entry based on a semantic concept associated with said linguist feature-value pair, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
means for generating a match value using a relative entropy value corresponding to said at least one matching entry, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (62, 63, 64, 65, 66, 67, 68, 69, 70, 71)
-
-
72. A system for evaluating similarity among a plurality of data structures, comprising:
-
means for creating a plurality of entries in a database for subsequent comparison with linguist feature-value pairs based on semantic concepts associated with said linguist feature-value pairs, said database encoding relationships between semantic concepts represented by said plurality of entries, each semantic concept representing an “
is-a”
relationship between the entities; and
means for calculating a relative entropy value corresponding to each entry of said plurality of entries, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (73, 74, 75, 76, 77, 78, 79, 80)
-
-
81. A system for evaluating similarity among a plurality of data structures, comprising:
-
a database having a plurality of entries, said database encoding relationships between semantic concepts represented by said plurality of entries;
an analyzer, coupled to said database, said analyzer configured to analyze each structure of said plurality of data structure to generate at least one substructure comprising a linguist feature-value pair;
a matching unit, coupled to said analyzer and said database, said matching unit configured to match said at least one substructure to at least one entry of said plurality of entries to obtain at least one matching entry based on a semantic concept associated with said linguist feature-value pair, each semantic concept representing an “
is-a”
relationship between the entities; and
an entropy calculator, coupled to said matching unit and said database, configured to generate a match value using a relative entropy value corresponding to said at least one matching entry, said relative entropy value being calculated relative to an entropy value of a root entry of said plurality of entries. - View Dependent Claims (82, 83, 84, 85, 86, 87, 88)
-
Specification