System for parametric text to text language translation
First Claim
1. A text-to-text language translation system, comprising:
- a computer processor;
a memory having stored therein a plurality of models, wherein said models are used in text-to-text translation, said plurality of models including;
a parametric translation model for generating a modeled translation probability, wherein said parametric translation model is generated with reference to a translation model source training text and a translation model target training text, said parametric translation model including a first specification of parameters, anda parametric language model for generating a modeled probability, wherein said parametric language model is generated with reference to a language model training text, said parametric language model including a second specification of parameters; and
means for performing text-to-text language translation using said parametric translation model and said parametric language model.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention is a system for translating text from a first source language into a second target language. The system assigns probabilities or scores to various target-language translations and then displays or makes otherwise available the highest scoring translations. The source text is first transduced into one or more intermediate structural representations. From these intermediate source structures a set of intermediate target-structure hypotheses is generated. These hypotheses are scored by two different models: a language model which assigns a probability or score to an intermediate target structure, and a translation model which assigns a probability or score to the event that an intermediate target structure is translated into an intermediate source structure. Scores from the translation model and language model are combined into a combined score for each intermediate target-structure hypothesis. Finally, a set of target-text hypotheses is produced by transducing the highest scoring target-structure hypotheses into portions of text in the target language. The system can either run in batch mode, in which case it translates source-language text into a target language without human assistance, or it can function as an aid to a human translator. When functioning as an aid to a human translator, the human may simply select from the various translation hypotheses provided by the system, or he may optionally provide hints or constraints on how to perform one or more of the stages of source transduction, hypothesis generation and target transduction.
382 Citations
24 Claims
-
1. A text-to-text language translation system, comprising:
-
a computer processor; a memory having stored therein a plurality of models, wherein said models are used in text-to-text translation, said plurality of models including; a parametric translation model for generating a modeled translation probability, wherein said parametric translation model is generated with reference to a translation model source training text and a translation model target training text, said parametric translation model including a first specification of parameters, and a parametric language model for generating a modeled probability, wherein said parametric language model is generated with reference to a language model training text, said parametric language model including a second specification of parameters; and means for performing text-to-text language translation using said parametric translation model and said parametric language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for text-to-text language translation, comprising the steps of:
-
building a parametric translation model to generate a modeled translation probability, comprising the steps of, storing a translation model source training text, storing a translation model target training text, and choosing a first specification of parameters for the translation model so that the modeled translation probability of the source and target training texts is a first unique local maximum value; building a parametric language model to generate a modeled probability, comprising the steps of, storing a language model raining text, and choosing a second specification of parameters for the language model so that the modeled probability of the given training text is a second unique local maximum value; and performing text-to-text language translation using said parametric translation model and said parametric language model. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A method for translating a first text in a first language into a second text in a second language using a lexical model, comprising the steps of:
-
inputting the first text into the lexical model, wherein the lexical model comprises a parametric translation model for generating a first probability and a parametric language model for generating a second probability; and determining, using the lexical model, the second text in the second language that yields a unique local maximum value of a product of the first probability of the parametric translation model and the second probability of the parametric language model.
-
Specification