DEVICE, METHOD, AND PROGRAM FOR WORD SENSE ESTIMATION
First Claim
1. A word sense estimation device comprising:
- a word extraction part which extracts a plurality of words included in input data;
a context analysis part which extracts, for each word extracted by the word extraction part, a context feature of a context in which the word appears in the input data;
a word sense candidate extraction part which extracts each concept stored as a word sense of said each word, as a word sense candidate of said each word, from a concept dictionary storing at least one concept as a word sense of a word; and
a word sense estimation part which executes a plurality of number of times a probability calculation of calculating an evaluation value for said each word of a case where said each concept extracted as the word sense candidate by the word sense candidate extraction part is determined as a word sense, based on a proximity between the context feature of a selected word and the context feature of another word, a proximity between a selected concept and a concept of a word sense candidate of said another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated, and which estimates a concept with a higher probability calculated of said each word, to be a word sense of the word.
1 Assignment
0 Petitions
Accused Products
Abstract
A device and method to estimate a word sense with high accuracy by unsupervised learning. A word sense estimation device executes a plurality of number of times a probability calculation of calculating an evaluation value for each word of a case where each concept extracted as a word sense candidate is determined as a word sense, based on a proximity between a context feature of a selected word and a context feature of another word, a proximity between a selected concept and a word sense of this another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated, and estimates a concept with a higher probability calculated of said each word, to be a word sense of the word.
-
Citations
13 Claims
-
1. A word sense estimation device comprising:
-
a word extraction part which extracts a plurality of words included in input data; a context analysis part which extracts, for each word extracted by the word extraction part, a context feature of a context in which the word appears in the input data; a word sense candidate extraction part which extracts each concept stored as a word sense of said each word, as a word sense candidate of said each word, from a concept dictionary storing at least one concept as a word sense of a word; and a word sense estimation part which executes a plurality of number of times a probability calculation of calculating an evaluation value for said each word of a case where said each concept extracted as the word sense candidate by the word sense candidate extraction part is determined as a word sense, based on a proximity between the context feature of a selected word and the context feature of another word, a proximity between a selected concept and a concept of a word sense candidate of said another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated, and which estimates a concept with a higher probability calculated of said each word, to be a word sense of the word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A word sense estimation method comprising:
-
a word extraction step of, with a processing device, extracting a plurality of words included in input data; a context analysis step of, with the processing device, extracting, for each word extracted in the word extraction step, a context feature of a context in which the word appears in the input data; a word sense candidate extraction step of, with the processing device, extracting each concept stored as a word sense of said each word, as a word sense candidate of said each word, from a concept dictionary storing at least one concept as a word sense of a word; and a word sense estimation step of, with the processing device;
executing a plurality of number of times a probability calculation of calculating an evaluation value for said each word of a case where each concept extracted as the word sense candidate in the word sense candidate extraction step is determined as a word sense, based on a proximity between the context feature of a selected word and the context feature of another word, a proximity between a selected concept and a concept of a word sense candidate of said another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated; and
estimating a concept with a higher probability calculated of said each word, to be a word sense of the word.
-
-
13. A word sense estimation program adapted to cause a computer to execute:
-
a word extraction process of extracting a plurality of words included in input data; a context analysis process of extracting, for each word extracted in the word extraction process, a context feature of a context in which the word appears in the input data; a word sense candidate extraction process of extracting each concept stored as a word sense of said each word, as a word sense candidate of said each word, from a concept dictionary storing at least one concept as a word sense of a word; and a word sense estimation process of;
executing a plurality of number of times a probability calculation of calculating an evaluation value for said each word of a case where each concept extracted as the word sense candidate in the word sense candidate extraction process is determined as a word sense, based on a proximity between the context feature of a selected word and the context feature of another word, a proximity between a selected concept and a concept of a word sense candidate of said another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated; and
estimating a concept with a higher probability calculated of said each word, to be a word sense of the word.
-
Specification