Machine translation using elastic chunks
First Claim
1. A machine translation method for translating source text from a first:
- language to target text in a second language, comprising;
receiving the source text in the first language;
accessing a library of bi-fragments, each of the bi-fragments including a text fragment from the first language and a text fragment from the second language, at least some of the bi-fragments being modeled as elastic bi-fragments in which words of a fragment are spaced by a gap which is able to assume a variable size corresponding to a number of other words which are to occupy the gap;
retrieving text fragments from the second language from the library corresponding to text fragments in the source text;
generating at least one target hypothesis, each of said target hypotheses comprising text fragments selected from the second language; and
evaluating the target hypothesis with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes.
1 Assignment
0 Petitions
Accused Products
Abstract
A machine translation method includes receiving source text in a first language and retrieving text fragments in a target language from a library of bi-fragments to generate a target hypothesis. Each bi-fragment includes a text fragment from the first language and a corresponding text fragment from the second language. Some of the bi-fragments are modeled as elastic bi-fragments where a gap between words is able to assume a variable size corresponding to a number of other words to occupy the gap. The target hypothesis is evaluated with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes.
98 Citations
22 Claims
-
1. A machine translation method for translating source text from a first:
- language to target text in a second language, comprising;
receiving the source text in the first language;
accessing a library of bi-fragments, each of the bi-fragments including a text fragment from the first language and a text fragment from the second language, at least some of the bi-fragments being modeled as elastic bi-fragments in which words of a fragment are spaced by a gap which is able to assume a variable size corresponding to a number of other words which are to occupy the gap;
retrieving text fragments from the second language from the library corresponding to text fragments in the source text;
generating at least one target hypothesis, each of said target hypotheses comprising text fragments selected from the second language; and
evaluating the target hypothesis with a translation scoring function which scores the target hypothesis according to a plurality of feature functions, at least one of the feature functions comprising a gap size scoring feature which favors hypotheses with statistically more probable gap sizes over hypotheses with statically less probable gap sizes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 22)
- language to target text in a second language, comprising;
-
17. A machine translation system for translating source text from a first language to target text in a second language, comprising:
-
a library of bi-fragments, each of the bi-fragments including a text fragment from the first language and a text fragment from the second language, at least some of the bi-fragments being modeled as elastic bi-fragments in which words of a fragment are spaced by a gap which is able to assume a variable size corresponding to a number of other words which are to occupy the gap, each of the elastic bi-fragments being associated in memory with a parameter representative of a gap size distribution for the bi-fragment; and
a processor which executes instructions for retrieving text fragments from the second language from the library corresponding to text fragments in the source text, generating at least one target hypothesis, each of said target hypotheses comprising text fragments selected from the second language, and evaluating the hypothesis with a translation scoring function which scores the hypothesis according to a plurality of features, at least one of the features comprising a gap size scoring feature which favors hypotheses comprising text fragments with statistically more probable gap sizes over hypotheses with statically less probable gap sizes. - View Dependent Claims (18, 19, 20, 21)
-
Specification