AUGMENTING STATISTICAL MACHINE TRANSLATION WITH LINGUISTIC KNOWLEDGE
First Claim
1. A computer-implemented method comprising:
- receiving, at a computing system including one or more processors, a translation model including a plurality of pairs of phrases, each of the plurality of pairs of phrases including a first phrase of one or more words in a first language and a second phrase of one or more words in a second language, wherein a specific first phrase is aligned with a specific second phrase for a specific pair of phrases;
determining, at the computing system, one or more features for each of the plurality of pairs of phrases based on linguistic differences between the first and second languages to obtain a plurality of sets of features;
associating, at the computing system, the plurality of sets of features with the plurality of pairs of phrases, respectively, to obtain a modified translation model; and
performing, at the computing system, statistical machine translation from the first language to the second language using the modified translation model.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented technique can include receiving, at a computing system including one or more processors, a translation model including a plurality of aligned pairs of phrases in first and second languages. The technique can include determining, at the computing system, one or more features for each of the plurality of pairs of phrases based on linguistic differences between the first and second languages to obtain a plurality of sets of features. The technique can include associating, at the computing system, the plurality of sets of features with the plurality of pairs of phrases, respectively, to obtain a modified translation model. The technique can also include performing, at the computing system, statistical machine translation from the first language to the second language using the modified translation model.
-
Citations
50 Claims
-
1. A computer-implemented method comprising:
-
receiving, at a computing system including one or more processors, a translation model including a plurality of pairs of phrases, each of the plurality of pairs of phrases including a first phrase of one or more words in a first language and a second phrase of one or more words in a second language, wherein a specific first phrase is aligned with a specific second phrase for a specific pair of phrases; determining, at the computing system, one or more features for each of the plurality of pairs of phrases based on linguistic differences between the first and second languages to obtain a plurality of sets of features; associating, at the computing system, the plurality of sets of features with the plurality of pairs of phrases, respectively, to obtain a modified translation model; and performing, at the computing system, statistical machine translation from the first language to the second language using the modified translation model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer-implemented method comprising:
-
receiving, at a computing system including one or more processors, a translation model configured for translation between a first language and a second language; receiving, at the computing system, a plurality of pairs of phrases, each of the plurality of pairs of phrases including a first phrase of one or more words in the first language and a second phrase of one or more words in the second language, wherein a specific first phrase is aligned with a specific second phrase for a specific pair of phrases; receiving, at the computing system, a source phrase for translation from the first language to the second language; determining, at the computing system, a translated phrase based on the source phrase using the translation model; determining, at the computing system, a selected second phrase from the plurality of pairs of phrases based on the translated phrase; predicting, at the computing system, one or more features for each word of the translated phrase based on the selected second phrase, a selected first phrase associated with the selected second phrase, and linguistic differences between the first and second languages; modifying, at the computing system, the translated phrase based on the one or more features to obtain a modified translated phrase; and outputting, from the computing system, the modified translated phrase. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A system comprising:
one or more computing devices configured to perform operations including; receiving a translation model including a plurality of pairs of phrases, each of the plurality of pairs of phrases including a first phrase of one or more words in a first language and a second phrase of one or more words in a second language, wherein a specific first phrase is aligned with a specific second phrase for a specific pair of phrases; determining one or more features for each of the plurality of pairs of phrases based on linguistic differences between the first and second languages to obtain a plurality of sets of features; associating the plurality of sets of features with the plurality of pairs of phrases, respectively, to obtain a modified translation model; and performing statistical machine translation from the first language to the second language using the modified translation model. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
38. A system comprising:
one or more computing devices configured to perform operations including; receiving a translation model configured for translation between a first language and a second language; receiving a plurality of pairs of phrases, each of the plurality of pairs of phrases including a first phrase of one or more words in the first language and a second phrase of one or more words in the second language, wherein a specific first phrase is aligned with a specific second phrase for a specific pair of phrases; receiving a source phrase for translation from the first language to the second language; determining a translated phrase based on the source phrase using the translation model; determining a selected second phrase from the plurality of pairs of phrases based on the translated phrase; predicting one or more features for each word of the translated phrase based on the selected second phrase, a selected first phrase associated with the selected second phrase, and linguistic differences between the first and second languages; modifying the translated phrase based on the one or more features to obtain a modified translated phrase; and outputting the modified translated phrase. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
Specification