Method and system for adapting synonym resources to specific domains
First Claim
1. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said method comprising the steps of:
- a) ranking said synonymous terms according to the appropriateness of said synonymous term to said domain; and
b) removing synonymous terms from said linguistic resource according to said ranking.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for processing synonyms that adapts a general-purpose synonym resource to a specific domain. The method selects out a domain-specific subset of synonyms from the set of general-purpose synonyms. The synonym processing method in turn comprises two methods that can be used either together or on their own. A method of synonym pruning eliminates those synonyms that are inappropriate in a specific domain. A method of synonym optimization eliminates those synonyms that are unlikely to be used in a specific domain. The method has many applications including, but not limited to, information retrieval and domain-specific thesauri as a writer'"'"'s aid.
-
Citations
32 Claims
-
1. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said method comprising the steps of:
-
a) ranking said synonymous terms according to the appropriateness of said synonymous term to said domain; and
b) removing synonymous terms from said linguistic resource according to said ranking. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 21, 22, 25, 27, 29)
-
- 13. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said method comprising minimizing the number of synonymous terms by removing those synonymous terms which are not useful in said domain.
-
26. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said method comprising:
-
a) forming a synonymy relation between each meaning of said target term and each synonymous term;
b) automatically ranking said synonymy relation according to the frequency of occurrence of said terms of said synonymy relation in a corpus of data in said domain;
c) evaluating the appropriateness of said synonymy relation to said domain by human evaluators acting on the ranking produced by said automatic ranking;
wherein said automatic ranking step comprises i) producing a numerical value associated with each synonymy relation representing the appropriateness of said synonymy relation to said domain, where said numerical value is produced from the frequency of occurrence of said terms in each said synonymy relation in a corpus of data in said domain and the frequency of occurrence of words which are semantically related to said terms; and
d) removing synonymous terms from said linguistic resource according to said ranking. - View Dependent Claims (28)
-
-
30. A computer program product for adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said computer program product comprising:
-
a computer usable medium having computer readable program code means embodied in said medium for a) forming a synonymy relation between each meaning of said target term and each synonymous term;
b) automatically ranking said synonymy relation according to the frequency of occurrence of said synonymy relation in a corpus of data in said domain. c) interfacing with human evaluators to evaluate the appropriateness of said synonymy relation to said domain;
wherein said automatic ranking comprises i) producing a numerical value associated with each synonymy relation representing the appropriateness of said synonymy relation to said domain, where said numerical value is produced from the frequency of occurrence of said terms in each said synonymy relation in a corpus of data in said domain and the frequency of occurrence of words which are semantically related to said terms; and
d) removing synonymous terms from said linguistic resource according to said ranking.
-
-
31. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said method comprising the steps of:
-
a) ranking said synonymous terms according to the appropriateness of said synonymous term to said domain; and
b) forming a new linguistic resource which is reduced in size by removing synonymous terms from said linguistic resource according to said ranking.
-
-
32. A method of adapting a linguistic resource to a specific knowledge domain, wherein said linguistic resource comprises a plurality of target terms each having one or more meanings, each meaning of each target term having associated therewith a set of synonymous terms, said method comprising forming a new linguistic resource which has a minimum number of synonymous terms by removing those synonymous terms which are not useful in said domain.
Specification