Method and system for theme-based word sense ambiguity reduction
First Claim
1. A method for reducing word sense ambiguities in a sentence, based on thematic prediction, said method comprising the steps of:
- a. receiving an input sentence consisting of a sequence of part-of-speech tagged words;
b. creating a sequence of sense tagged words from said received sequence of part-of-speech tagged words, each of said senses further being theme tagged;
c. predicting a set of one or more probable themes associated with said created sequence of sense-tagged words;
d. weighting each of said one or more probable themes from said predicted set, and e. reducing sense ambiguities by eliminating remotely probable senses or selecting highly probably senses from said weighted set of one or more probable themes.
1 Assignment
0 Petitions
Accused Products
Abstract
Word sense ambiguity, for “thematic” words in a sentence, is achieved based on thematic prediction. The senses of “thematic” words are disambiguated in a sentence by determining and weighting possible themes for that sentence. Possible themes are determined for that sentence based on thematic information associated with the different senses of each word in the sentence. A highly deterministic thematic-based word sense disambiguation method is used to preprocess the sentence prior to further syntactic and semantic analysis, thereby enhancing accuracy and decreasing the demand for computational resources (memory and CPU) by reducing input ambiguities.
65 Citations
23 Claims
-
1. A method for reducing word sense ambiguities in a sentence, based on thematic prediction, said method comprising the steps of:
-
a. receiving an input sentence consisting of a sequence of part-of-speech tagged words;
b. creating a sequence of sense tagged words from said received sequence of part-of-speech tagged words, each of said senses further being theme tagged;
c. predicting a set of one or more probable themes associated with said created sequence of sense-tagged words;
d. weighting each of said one or more probable themes from said predicted set, and e. reducing sense ambiguities by eliminating remotely probable senses or selecting highly probably senses from said weighted set of one or more probable themes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for reducing word sense ambiguities in a sentence, based on thematic prediction, said system comprising:
-
a thematic predictor receiving an input sentence comprising a sequence of part-of-speech tagged words and outputting a sequence of sense tagged words and a set of one or more predicted themes associated with said sequence of tagged words;
a thematic scorer weighting each of said set of one or more predicted themes, and a thematic word sense disambiguator reducing sense ambiguities by eliminating remotely probable senses or selecting highly probable senses from said weighted set of one or more probable themes. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An article of manufacture comprising a computer user medium having computer readable code embodied therein which reduces word sense ambiguities in a sentence, based on thematic prediction, said medium comprising:
-
computer readable program code receiving an input sentence consisting of a sequence of part-of-speech tagged words;
computer readable program code creating a sequence of sense tagged words from said received sequence of part-of-speech words, each of said senses further being theme tagged;
computer readable program code predicting a set of one or more probable themes associated with said created sequence of sense-tagged words;
computer readable program code weighting each of said predicted set of one or more probable themes, and computer readable program code reducing sense ambiguities by eliminating remotely probable senses or selecting highly probably senses based on said weighted set of one or more probable themes. - View Dependent Claims (22, 23)
-
Specification