Word Use Difference Information Acquisition Program and Device
First Claim
1. In a computer searchably provided with, or connected to, a corpus, which is a usage example database containing example sentences for a plurality of target vocabulary terms having the same or similar meaning, and a thesaurus, which is a database describing the word-to-word relationship between one word and another within a conceptual hierarchy, a word use difference information acquisition program for extracting and outputting information relating to the difference in usage for a plurality of target terms having the same or similar meaning, said program causing the computer to execute processing comprising:
- a target word inputting step of receiving the input of a plurality of target words,a sentence extracting step for accessing the corpus, searching the corpus for each target word for which input thereof has been received in the target word inputting step, and extracting from the corpus each sentence data containing any of said target words,a noun extracting step for analyzing the structure of each sentence data extracted in the sentence extracting step, and extracting from each sentence data nouns that exist in a grammatical relationship with the target word contained therein,a directional graph forming step for accessing the thesaurus, searching the thesaurus for the nouns extracted in the noun extracting step, extracting the node representing each of said nouns and the node representing the higher ranking conceptual category with respect to each said noun, and forming a directional graph constructed from the thus extracted nodes and links that connect respective higher and lower ranking nodes and show the relationship therebetween with respect to the conceptual hierarchy, for each corresponding target word,a difference extracting step for comparing each of the directional graphs formed in the directional graph forming step, and extracting the difference nodes between the directional graphs of different target words, anda difference outputting step for outputting the difference between the directional graphs extracted in the difference extracting step as information representing difference in usage between the target words.
1 Assignment
0 Petitions
Accused Products
Abstract
A device or computer implemented program for accurately and automatically obtaining general-purpose information regarding the usage difference between a plurality of synonyms and quasi-synonyms, such as the types of words with which the synonyms and quasi-synonyms are often used, is provided with: means for receiving the input of a plurality of words; means for extracting sentence data including an inputted word from a corpus; means for analyzing the sentence structure of the sentence data and extracting nouns that are in a grammatical relationship with the inputted word included in the sentence data; means for extracting the nodes representing the nouns and the nodes representing the semantic category of the noun from a thesaurus and forming a directional graph for each inputted word; means for comparing a plurality of directional graphs and extracting the difference nodes; and means for outputting the extracted difference nodes as information relating to the usage difference of the inputted words.
-
Citations
22 Claims
-
1. In a computer searchably provided with, or connected to, a corpus, which is a usage example database containing example sentences for a plurality of target vocabulary terms having the same or similar meaning, and a thesaurus, which is a database describing the word-to-word relationship between one word and another within a conceptual hierarchy, a word use difference information acquisition program for extracting and outputting information relating to the difference in usage for a plurality of target terms having the same or similar meaning, said program causing the computer to execute processing comprising:
-
a target word inputting step of receiving the input of a plurality of target words, a sentence extracting step for accessing the corpus, searching the corpus for each target word for which input thereof has been received in the target word inputting step, and extracting from the corpus each sentence data containing any of said target words, a noun extracting step for analyzing the structure of each sentence data extracted in the sentence extracting step, and extracting from each sentence data nouns that exist in a grammatical relationship with the target word contained therein, a directional graph forming step for accessing the thesaurus, searching the thesaurus for the nouns extracted in the noun extracting step, extracting the node representing each of said nouns and the node representing the higher ranking conceptual category with respect to each said noun, and forming a directional graph constructed from the thus extracted nodes and links that connect respective higher and lower ranking nodes and show the relationship therebetween with respect to the conceptual hierarchy, for each corresponding target word, a difference extracting step for comparing each of the directional graphs formed in the directional graph forming step, and extracting the difference nodes between the directional graphs of different target words, and a difference outputting step for outputting the difference between the directional graphs extracted in the difference extracting step as information representing difference in usage between the target words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A word use difference information acquisition device configured by a computer that is operated according to a program and which extracts and outputs information relating to the difference in usage for a plurality of target terms having the same or similar meaning, said computer being searchably provided with, or connected to, a corpus, which is a usage example database containing example sentences for a plurality of target vocabulary terms having the same or similar meaning, and a thesaurus, which is a database describing the word-to-word relationship between one word and another within a conceptual hierarchy, and comprising:
-
a target word inputting means for receiving the input of a plurality of target words, a sentence extracting means for accessing the corpus, searching the corpus for each target word for which input thereof has been received by the target word inputting means, and extracting from the corpus each sentence data containing any of said target words, a noun extracting means for analyzing the structure of each sentence data extracted by the sentence extracting means, and extracting from each sentence data nouns that exist in a grammatical relationship with the target word contained therein, a directional graph forming means for accessing the thesaurus, searching the thesaurus for the nouns extracted by the noun extracting means, extracting the node representing each of said nouns and the node representing the higher ranking conceptual category with respect to each said noun, and forming a directional graph constructed from the thus extracted nodes and links that connect respective higher and lower ranking nodes and show the relationship therebetween with respect to the conceptual hierarchy, for each corresponding target word, a difference extracting means for comparing each of the directional graphs formed in the directional graph forming means, and extracting the difference nodes between the directional graphs of different target words, and a difference outputting means for outputting the difference between the directional graphs extracted in the difference extracting means as information representing difference in usage between the target words. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification