Example based machine translation system
First Claim
1. A method of performing machine translation of a source language (SL) input to a translation output in a target language (TL), comprising:
- matching fragments of the SL input to SL fragments of examples in an example base;
identifying all matched blocks in the SL input as blocks of terms in the SL input that are matched by one or more SL fragments in an example;
selecting block combinations of the matched blocks to cover one or more fragments of the SL input;
for each block in the selected block combinations, identifying an example associated with the block;
aligning TL portions of the identified example with SL portions of the identified example that match the one or more fragments of the SL input; and
providing the translation output based on the aligned portions.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention performs machine translation by matching fragments of a source language sentence to be translated to source language portions of an example in example base. When all relevant examples have been identified in the example base, the examples are subjected to phrase alignment in which fragments of the target language sentence in each example are aligned against the matched fragments of the source language sentence in the same example. A translation component then substitutes the aligned target language phrases from the matched examples for the matched fragments in the source language sentence.
-
Citations
34 Claims
-
1. A method of performing machine translation of a source language (SL) input to a translation output in a target language (TL), comprising:
-
matching fragments of the SL input to SL fragments of examples in an example base;
identifying all matched blocks in the SL input as blocks of terms in the SL input that are matched by one or more SL fragments in an example;
selecting block combinations of the matched blocks to cover one or more fragments of the SL input;
for each block in the selected block combinations, identifying an example associated with the block;
aligning TL portions of the identified example with SL portions of the identified example that match the one or more fragments of the SL input; and
providing the translation output based on the aligned portions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of performing machine translation of a source language (SL) input to a translation output in a target language (TL), comprising:
-
selecting examples, from an example base, that correspond to fragments of the SL input;
aligning TL portions of the selected examples with SL portions that correspond to the fragments of the SL input by, for each example;
performing word alignment to identify anchor alignment points between the SL portion and the TL portion;
finding continuous alignments between the TL portion and the SL portion based on the anchor alignment points;
finding non-continuous alignments between the TL portion and the SL portion; and
translating the SL input to the translation output from the continuous and non-continuous alignments. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A word alignment system in an example based machine translation system that translates a source language (SL) input to a translation output in a target language (TL) based on examples in an example base, comprising:
-
a bilingual dictionary aligner accessing a bilingual dictionary to obtain dictionary information indicative of word translations between a SL portion and a TL portion of a selected example;
a distortion model aligner coupled to the bilingual dictionary and configured to align words in the SL portion with words in the TL portion of the selected example based on word position;
a monolingual dictionary aligner coupled to the distortion model aligner and configured to merge characters in the TL portion into words and merge words in the TL portion into phrases; and
a statistical aligner coupled to the monolingual dictionary aligner and configured to perform statistical alignment between as yet unaligned words in the SL portion and the TL portion of the example. - View Dependent Claims (34)
-
Specification