Statistical translation system and method for fast sense disambiguation and translation of large corpora using fertility models and sense models
First Claim
1. A system for translating a series of source words in a first language to a series of target words in a second language, comprising:
- input means for inputting the series of source words;
a fertility hypothesis generator operatively coupled to said input means for generating at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word;
a sense hypothesis generator operatively coupled to said input means for generating sense hypotheses for a translation of the source word, based on the source word and the context of the source word;
a fertility model operatively coupled to said fertility hypothesis generator for determining a probability of the fertility of the source word, based on the source word and the context of the source word;
a sense model operatively coupled to said sense hypothesis generator for determining a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word; and
a decoder operatively coupled to said fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by said fertility model and the probability calculated by said sense model.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for translating a series of source words in a first language to a series of target words in a second language is provided. The system includes an input device for inputting the series of source words. A fertility hypothesis generator operatively coupled to the input device generates at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word. A sense hypothesis generator operatively coupled to the input device generates sense hypotheses for a translation of the source word, based on the source word and the context of the source word. A fertility model operatively coupled to the fertility hypothesis generator determines a probability of the fertility of the source word, based on the source word and the context of the source word. A sense model operatively coupled to the sense hypothesis generator determines a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word. A decoder operatively coupled to the fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by the fertility model and the probability calculated by the sense model.
164 Citations
37 Claims
-
1. A system for translating a series of source words in a first language to a series of target words in a second language, comprising:
-
input means for inputting the series of source words; a fertility hypothesis generator operatively coupled to said input means for generating at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word; a sense hypothesis generator operatively coupled to said input means for generating sense hypotheses for a translation of the source word, based on the source word and the context of the source word; a fertility model operatively coupled to said fertility hypothesis generator for determining a probability of the fertility of the source word, based on the source word and the context of the source word; a sense model operatively coupled to said sense hypothesis generator for determining a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word; and a decoder operatively coupled to said fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by said fertility model and the probability calculated by said sense model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for translating a series of source words in a first language to a series of target words in a second language, comprising the steps of:
-
inputting the series of source words; generating at least one fertility hypotheses for a fertility of a source word, based on a source word and a context of the source word; generating sense hypotheses for a translation of the source word, based on the source word and the context of the source word; determining a probability of a fertility of a source word, based on the source word and the context of the source word; determining a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word; and generating a list of target words for the translation of the source word, based on the probability of the fertility and the probability of the target word. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
Specification