MACHINE TRANSLATION DEVICE AND MACHINE TRANSLATION METHOD IN WHICH A SYNTAX CONVERSION MODEL AND A WORD TRANSLATION MODEL ARE COMBINED
First Claim
1. A statistical machine translation device, comprising:
- a translation model constructor extracting syntax conversion knowledge and word translation knowledge of a target sentence by using word reordering information and syntax analysis information between a source sentence and the target sentence in a plurality of parallel corpora, and calculating conversion probabilities with respect to the respective extracted knowledges;
a translation model learning device generating a syntax conversion model and a word translation model by learning the respective translation knowledges and conversion probabilities extracted through the translation model constructor; and
a translated sentence generator decoding the source sentence into the target sentence by applying the syntax conversion model and the word translation model learned through the translation model learning device with respect to a source sentence input in real time.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to statistical machine translation, and provides a machine translation device and a machine translation method that acquire a creation probability for a target language from a single corpus while extracting respective conversion probabilities by extracting syntax conversion knowledge and word translation knowledge from a parallel corpus, model a weighted translation model by allowing each of the conversion knowledge and each of the probabilities to learn using a translation model learning device, and generate a target sentence through decoding processes of a syntax converter and a word translator by applying the translation model to a source sentence input in real time, thereby resolving disadvantages of the existing phrase-based SMT and syntax-based SMT and combining advantages thereof.
-
Citations
19 Claims
-
1. A statistical machine translation device, comprising:
-
a translation model constructor extracting syntax conversion knowledge and word translation knowledge of a target sentence by using word reordering information and syntax analysis information between a source sentence and the target sentence in a plurality of parallel corpora, and calculating conversion probabilities with respect to the respective extracted knowledges; a translation model learning device generating a syntax conversion model and a word translation model by learning the respective translation knowledges and conversion probabilities extracted through the translation model constructor; and a translated sentence generator decoding the source sentence into the target sentence by applying the syntax conversion model and the word translation model learned through the translation model learning device with respect to a source sentence input in real time. - View Dependent Claims (2, 3)
-
-
4. A translation model constructing apparatus, comprising:
-
a syntax conversion knowledge extractor extracting the syntax conversion knowledge for a target sentence by using word reordering information between a source sentence and a target sentence, and syntax analysis information of the source sentence in a plurality of parallel corpora and calculating a conversion probability with respect to the extracted knowledge; and a word translation knowledge extractor extracting the word translation knowledge by using the word reordering information between the source sentence and the target sentence, and the syntax analysis information of the source sentence in the plurality of parallel corpora and calculating the conversion probability with respect to the extracted knowledge. - View Dependent Claims (5, 6, 7, 8)
-
-
9. A translation sentence generating apparatus, comprising:
-
a syntax converter syntactically analyzing a source sentence input in real time, extracting syntax conversion knowledge of a target sentence from a syntax of the analyzed source sentence, and making the extracted knowledge to learn a conversion probability; a word translator generating a target vocabulary string based on a word translation model in which a constraint condition is imposed to the syntax of the target sentence extracted through the syntax converter; and a probability calculator combining a creation probability of the target vocabulary string generated through the word translator with the conversion probability learned through the syntax converter and thereafter, generating a target vocabulary string having the highest probability into a translation sentence. - View Dependent Claims (10)
-
-
11. A machine translation method, comprising:
-
(a) syntactically analyzing a source sentence input in real time and extracting syntax conversion knowledge and a conversion probability of a target sentence from a syntax of the analyzed source sentence; (b) generating a target vocabulary string based on a word translation model in which a constraint condition is imposed to the syntax of the target sentence extracted from the syntax conversion knowledge of the target sentence; and (c) generating a target vocabulary string having a high probability into a translation sentence by combining the syntax conversion probability of the target sentence and a creation probability of the target vocabulary string. - View Dependent Claims (12, 13, 19)
-
-
14. A translation model constructing method, comprising:
-
(a) extracting syntax conversion knowledge for a target sentence by using word reordering information between a source sentence and a target sentence and syntax analysis information of the source sentence in a plurality of parallel corpora; (b) extracting word translation knowledge by using the word reordering information between the source sentence and the target sentence and the syntax analysis information of the source sentence in the plurality of parallel corpora; and (c) calculating conversion probabilities for the syntax conversion knowledge and the word translation knowledge, respectively and making a weight to be learned with respect to each conversion probability. - View Dependent Claims (15, 16, 17, 18)
-
Specification