Method and system for adapting synonym resources to specific domains
First Claim
1. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises:
- a plurality of target terms each having one or more meanings anda plurality of synonymy relations where each synonymy relation forms a relation between two synonymous terms with respect to a meaning,said method comprising the steps of;
ranking said synonymy relations in relation to said domain;
identifying in said linguistic resource one or more of said synonymy relations from a group comprising;
(1) irrelevant or (2) redundant or (3) likely not to be used in said knowledge domain;
setting a threshold value wherein said setting of said threshold value occurs either prior or subsequent to said identifying step; and
removing said synonymy relations from said linguistic resource according to said threshold value.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for processing synonyms that adapts a general-purpose synonym resource to a specific domain. The method selects out a domain-specific subset of synonyms from the set of general-purpose synonyms. The synonym processing method in turn comprises two methods that can be used either together or on their own. A method of synonym pruning eliminates those synonyms that are inappropriate in a specific domain. A method of synonym optimization eliminates those synonyms that are unlikely to be used in a specific domain. The method has many applications including, but not limited to, information retrieval and domain-specific thesauri as a writer'"'"'s aid.
29 Citations
60 Claims
-
1. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises:
-
a plurality of target terms each having one or more meanings and a plurality of synonymy relations where each synonymy relation forms a relation between two synonymous terms with respect to a meaning, said method comprising the steps of; ranking said synonymy relations in relation to said domain; identifying in said linguistic resource one or more of said synonymy relations from a group comprising;
(1) irrelevant or (2) redundant or (3) likely not to be used in said knowledge domain;setting a threshold value wherein said setting of said threshold value occurs either prior or subsequent to said identifying step; and removing said synonymy relations from said linguistic resource according to said threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 31, 32, 33)
-
-
29. The method of 1 wherein said identifying step comprises:
-
identifying as either (1) redundant or (2) likely not to be used said synonymy relations which are identical to each other in said linguistic resource; and wherein removing said synonymy relations comprises removing all but one of said synonymy relations from said linguistic resource.
-
-
30. The method of 1 wherein said identifying step comprises:
identifying said synonymy relations that are irrelevant in said linguistic resource by producing the frequency of occurrence of said synonymous terms in synonymy relations in said domain.
-
34. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises:
-
a plurality of target terms each having one or more meanings and a plurality of synonymy relations where each synonymy relation forms a relation between two synonymous terms with respect to a meaning, said method comprising the steps of; identifying one or more of said synonymy relations from a group comprising;
(1) irrelevant or (2) redundant or (3) likely not to be used in said knowledge domain; andremoving said synonymy relations from said linguistic resource; wherein said identifying includes identifying as either (1) redundant or (2) likely not to be used said synonymy relations in said linguistic resources which contain a single term that is the same as the largest term. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
-
-
50. A computer program product for adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises:
-
a plurality of target terms each having one or more meanings, and a plurality of synonymy relations where each synonymy relation forms a relation between two synonymous terms with respect to a meaning, said computer program product comprising; a computer usable medium having computer readable program code means embodied in said medium for the steps of; ranking said synonymy relations in relation to said domain; identifying one or more of said synonymy relations from a group comprising;
(1) irrelevant or (2) redundant or (3) likely not to be used in said knowledge domain;setting a threshold value wherein said setting of said threshold value occurs either prior or subsequent to said identifying step; and removing said synonymy relations from said linguistic resource according to said threshold value. - View Dependent Claims (51, 52, 53, 54)
-
-
55. A computer program product for adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises:
-
a plurality of target terms each having one or more, and a plurality of synonymy relations where each synonymy relation forms a relation between two synonymous terms with respect to a meaning, said computer program product comprising; a computer usable medium having computer readable program code means embodied in said medium for; identifying one or more of said synonymy relations from a group comprising;
(1) irrelevant or (2) redundant or (3) likely not to be used in said knowledge domain;setting a threshold value wherein said setting of said threshold value occurs either prior or subsequent to said identifying step; and removing said synonymy relations from said linguistic resource according to said threshold value. - View Dependent Claims (56, 57, 58, 59, 60)
-
Specification