Word alignment with bridge languages
First Claim
Patent Images
1. A method comprising:
- receiving multi-lingual parallel text associating a source language, a target language, and one or more bridge languages;
determining a direct alignment between the source language and the target language, wherein the direct alignment represents a word-for-word connection between source language and target language words;
determining, using one or more computing devices, a first bridge-based alignment between the source language and the target language using a first bridge language that is distinct from the source language and the target language, wherein the first bridge-based alignment is determined based in part on a first posterior probability matrix for a first alignment from the source language to the first bridge language and a second posterior probability matrix for a second alignment from the first bridge language to the target language;
using the direct alignment between the source language and the target language to generate a first candidate translation of an input text in the source language to the target language;
using the first bridge-based alignment to generate a second candidate translation of the input text in the source language to the target language; and
determining a consensus translation using the candidate translations.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and computer program products are provided for statistical machine translation. In some implementations a method is provided. The method includes receiving multi-lingual parallel text associating a source language, a target language, and one or more bridge languages, determining an alignment between the source language and the target language using a first bridge language that is distinct from the source language and the target language, and using the determined alignment to generate a candidate translation of an input text in the source language to the target language.
27 Citations
23 Claims
-
1. A method comprising:
-
receiving multi-lingual parallel text associating a source language, a target language, and one or more bridge languages; determining a direct alignment between the source language and the target language, wherein the direct alignment represents a word-for-word connection between source language and target language words; determining, using one or more computing devices, a first bridge-based alignment between the source language and the target language using a first bridge language that is distinct from the source language and the target language, wherein the first bridge-based alignment is determined based in part on a first posterior probability matrix for a first alignment from the source language to the first bridge language and a second posterior probability matrix for a second alignment from the first bridge language to the target language; using the direct alignment between the source language and the target language to generate a first candidate translation of an input text in the source language to the target language; using the first bridge-based alignment to generate a second candidate translation of the input text in the source language to the target language; and determining a consensus translation using the candidate translations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product, encoded on a non-transitory computer-readable medium, operable to cause data processing apparatus to perform operations comprising:
-
identifying multi-lingual parallel text associating a source language, a target language, and one or more bridge languages; determining a direct alignment between the source language and the target language, wherein the direct alignment represents a word-for-word connection between source language and target language words; determining a first bridge-based alignment between the source language and the target language using a first bridge language that is distinct from the source language and the target language, wherein the first bridge-based alignment is determined based in part on a first posterior probability matrix for a first alignment from the source language to the first bridge language and a second posterior probability matrix for a second alignment from the first bridge language to the target language; using the direct alignment between the source language and the target language to generate a first candidate translation of an input text in the source language to the target language; using the first bridge-based alignment to generate a second candidate translation of the input text in the source language to the target language; and determining a consensus translation using the candidate translations. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
one or more computing devices operable to perform operations including; identifying multi-lingual parallel text associating a source language, a target language, and one or more bridge languages; determining a direct alignment between the source language and the target language wherein the direct alignment represents a word-for-word connection between source language and target language words; determining a first bridge-based alignment between the source language and the target language using a first bridge language that is distinct from the source language and the target language, wherein the first bridge-based alignment is determined based in part on a first posterior probability matrix for a first alignment from the source language to the first bridge language and a second posterior probability matrix for a second alignment from the first bridge language to the target language; using the direct alignment between the source language and the target language to generate a first candidate translation of an input text in the source language to the target language; using the first bridge-based alignment to generate a second candidate translation of the input text in the source language to the target language; and determining a consensus translation using the candidate translations. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
Specification