Language model using reverse translations
First Claim
1. A method comprising:
- accessing a translation system, the translation system configured to generate a machine translation of source material from a source language into a destination language, the translation system being trained using destination language training data and comprising;
a translation model configured to receive the source material and generating one or more destination language hypotheses for the source material, anda language model configured to select one of the destination language hypotheses based on an analysis of the destination language training data;
analyzing supplemental destination language training data for training the language model, the supplemental destination language training data comprising one or more of;
monolingual destination language material that has been previously machine translated from the source language, ordestination language material for which translation into the source language has been previously requested; and
based on the analyzing, modifying the language model to account for the supplemental destination language training data.
2 Assignments
0 Petitions
Accused Products
Abstract
Exemplary embodiments relate to techniques for improving machine translation systems. The machine translation system may apply one or more models for translating material from a source language into a destination language. The models are initially trained using training data. According to exemplary embodiments, supplemental training data is used to train the models, where the supplemental training data uses in-domain material to improve the quality of output translations. In-domain data may include data that relates to the same or similar topics as those expected to be encountered in a translation of material from the source language into the destination language. In-domain data may include material previously translated from the source language into the destination language, material similar to previous translations, and destination language material that has previously been the subject of a request for translation into the source language.
9 Citations
17 Claims
-
1. A method comprising:
-
accessing a translation system, the translation system configured to generate a machine translation of source material from a source language into a destination language, the translation system being trained using destination language training data and comprising; a translation model configured to receive the source material and generating one or more destination language hypotheses for the source material, and a language model configured to select one of the destination language hypotheses based on an analysis of the destination language training data; analyzing supplemental destination language training data for training the language model, the supplemental destination language training data comprising one or more of; monolingual destination language material that has been previously machine translated from the source language, or destination language material for which translation into the source language has been previously requested; and based on the analyzing, modifying the language model to account for the supplemental destination language training data. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory computer-readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to:
-
access a translation system, the translation system configured to generate a machine translation of source material from a source language into a destination language, the translation system being trained using destination language training data and comprising; a translation model configured to receive the source material and generating one or more destination language hypotheses for the source material, and a language model configured to select one of the destination language hypotheses based on an analysis of the destination language training data; analyze supplemental destination language training data for training the language model, the supplemental destination language training data comprising one or more of; monolingual destination language material that has been previously machine translated from the source language, or destination language material for which translation into the source language has been previously requested; and based on the analyzing, modify the language model to account for the supplemental destination language training data. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a non-transitory computer-readable medium configured to store logic for implementing a translation system, the translation system configured to generate a machine translation of source material from a source language into a destination language, the translation system being trained using destination language training data and comprising; a translation model configured to receive the source material and generating one or more destination language hypotheses for the source material, and a language model configured to select one of the destination language hypotheses based on an analysis of the destination language training data; a processor configured to; analyze supplemental destination language training data for training the language model, the supplemental destination language training data comprising one or more of; monolingual destination language material that has been previously machine translated from the source language, or destination language material for which translation into the source language has been previously requested; and based on the analyzing, modify the language model to account for the supplemental destination language training data. - View Dependent Claims (14, 15, 16, 17)
-
Specification