Sentence translator using a thesaurus and a concept-organized co- occurrence dictionary to select from a plurality of equivalent target words
First Claim
1. A natural language generating system, which translates a source language text into a target language, said system comprises:
- a word dictionary memory storing a predicate concept represented by a source language word and at least one target language word corresponding thereto;
a co-occurrence relation dictionary memory storing co-occurrence relation between a target language word and the predicate concept;
a thesaurus memory representing implication among predicate concepts represented by source language words; and
,a processing device referring to the co-occurrence relation dictionary memory and the thesaurus memory when a plurality of target language words exist corresponding to a predicate concept represented by a source language word to be translated in the word dictionary memory, and selecting a target language word having co-occurrence relation with an argument concept corresponding to the predicate concept represented by the source language word to be translated or with the general concept thereof.
1 Assignment
0 Petitions
Accused Products
Abstract
In a system of performing automatic translation from natural language into another natural language based on dependence relation among concepts, a thesaurus representing implication among concepts and a co-occurrence dictionary storing co-occurrence relation between a predicate word and an argument concept are utilized to select a suitable translation word when a plurality of candidate words exist corresponding to a predicate concept. Furthermore, knowledge regarding the co-occurrence relation is obtained based on feedback information from a revise operation to translation results, thereby contents of co-occurrence relation dictionary can be more and more complete.
101 Citations
7 Claims
-
1. A natural language generating system, which translates a source language text into a target language, said system comprises:
-
a word dictionary memory storing a predicate concept represented by a source language word and at least one target language word corresponding thereto; a co-occurrence relation dictionary memory storing co-occurrence relation between a target language word and the predicate concept; a thesaurus memory representing implication among predicate concepts represented by source language words; and
,a processing device referring to the co-occurrence relation dictionary memory and the thesaurus memory when a plurality of target language words exist corresponding to a predicate concept represented by a source language word to be translated in the word dictionary memory, and selecting a target language word having co-occurrence relation with an argument concept corresponding to the predicate concept represented by the source language word to be translated or with the general concept thereof. - View Dependent Claims (2, 3)
-
-
4. A natural language generating method which translates a source language text into a target language, said method comprises:
-
a first step of reading a source language sentence inputted from an input device, referring to a source language analysis dictionary stored in a memory, analyzing the syntactic and semantic structure of the source language sentence and case relations, and obtaining intermediate representation comprising dependence relations among concepts; a second step of referring to a target language generation dictionary, a co-occurrence relation dictionary and a thesaurus memory, and determining at least one target language word corresponding to each concept in the intermediate representation; and a third step of referring to the target language generation dictionary, determining an arrangement of the target words determined at said second step based on case frames of predicate words, and outputting a translation result to an output device. - View Dependent Claims (5, 6)
-
-
7. A natural language generating system for translating a source language into a target language comprising:
-
means for inputting the source language into the system; means for outputting the target language out of the system; processing means for translating a source language text into a target language comprising first memory means for storing a source language analysis dictionary including relational data of a source word, a part of speech, a case frame and an associated concept;
a second memory means for storing a target language generation dictionary similarly including relational data of a target word, a part of speech, a case frame and an associated concept;
a third memory means for storing a co-occurrence relationship comprising a predicate word of target language, an argument concept, and a case thereof; and
a thesaurus memory for storing a relationship of a concept and a general concept thereof, said processing means being in communication with the memories for selected access thereto whereby a target language sentence is generated by identifying the part of speech, the concept and the case frame associated with a source word from the first memory means, identifying the predicate-argument structure of a source language sentence on the basis of the case frames of source words representing a predicate concept and parts of speech source words, identifying a plurality of candidates of target words, for a predicate concept, and the part of speech and the case frame associated with a target word from the second memory means, identifying general concepts of an argument concept from the thesaurus memory;
selecting a candidate word is a triplet comprising the candidate word, an argument concept or a general concept of an argument concept, and the case of the argument is retrieved from the third memory means; and
determining the syntactic structure of a target language sentence on the basis of the case frames of target words representing a predicate concept and the parts of speech of target words.
-
Specification