Extraction-empowered machine translation
First Claim
Patent Images
1. A method for automatically translating a document from a first language to a second language comprising:
- receiving the document in the first language;
processing the document to extract elements of information;
determining, using a processor, a plurality of potential translations for each of the extracted elements of information using a first translation process and a likelihood value for each of the potential translations of the elements of information;
determining a plurality of potential translations of a remainder of the document using a second, different translation process and a likelihood value for each potential remainder translation;
generating a plurality of combinations by combining a plurality of the potential translations of the elements of information with a plurality of the potential remainder translations;
determining a likelihood value for respective ones of a plurality of the combinations based on a model of the second language and corresponding likelihood values of each of the potential element of information translations and remainder translations included in the respective combinations; and
forming a translated version of the document based on the likelihood values of the combinations.
4 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to systems and methods for automatically translating documents from a first language to a second language. To carry out the translation of a document, elements of information are extracted from the document and are translated using one or more specialized translation processes. The remainder of the document is separately translated by a statistical translation process. The translated elements of information and the translated remainder are then merged into a final translated document.
-
Citations
16 Claims
-
1. A method for automatically translating a document from a first language to a second language comprising:
-
receiving the document in the first language; processing the document to extract elements of information; determining, using a processor, a plurality of potential translations for each of the extracted elements of information using a first translation process and a likelihood value for each of the potential translations of the elements of information; determining a plurality of potential translations of a remainder of the document using a second, different translation process and a likelihood value for each potential remainder translation; generating a plurality of combinations by combining a plurality of the potential translations of the elements of information with a plurality of the potential remainder translations; determining a likelihood value for respective ones of a plurality of the combinations based on a model of the second language and corresponding likelihood values of each of the potential element of information translations and remainder translations included in the respective combinations; and forming a translated version of the document based on the likelihood values of the combinations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for automatically translating a document from a first language to a second language comprising:
-
receiving the document in the first language; determining, using a processor, a plurality of potential translations of a first portion of the document to the second language using a first translation process and a likelihood value for each potential first portion translation; determining a plurality of potential translations of a second portion of the document to the second language using a second, different translation process and a likelihood value for each potential second portion translation; generating a plurality of combinations by combining a plurality of the potential translations of the first portion with a plurality of the potential translations of the second portion; determining a likelihood value for respective ones of a plurality of the combinations based on a model of the second language and corresponding likelihood values of each of the potential translations of the first portion and the potential translations of the second portion; and forming a translated version of the document based on the likelihood values of the combinations. - View Dependent Claims (13, 14)
-
-
15. A system for automatically translating a document from a first language to a second language comprising:
-
a first processor for receiving the document in the first language and processing the document to extract elements of information; a second processor for determining a plurality of potential translations for each of the extracted elements of information using a first translation process and a likelihood value for each of the potential translations of the elements of information; a third processor for determining a plurality of potential translations of a remainder of the document using a second, different translation process and a likelihood value for each potential remainder translation; a fourth processor for generating a plurality of combinations by combining a plurality of the potential translations of the elements of information with a plurality of the potential remainder translations; a fifth processor for determining a likelihood value for respective ones of a plurality of the combinations based on a model of the second language and corresponding likelihood values of each of the potential element of information translations and remainder translations included in the respective combinations; and a sixth processor for forming a translated version of the document based on the likelihood values of the combinations.
-
-
16. A non-transitory computer-readable medium containing computer-executable instructions, which when executed by a computer carry out a method for translating a document from a first language to a second language, comprising:
-
receiving the document in the first language; processing the document to extract elements of information; determining, using a processor, a plurality of potential translations for each of the extracted elements of information using a first translation process and a likelihood value for each of the potential translations of the elements of information; determining a plurality of potential translations of a remainder of the document using a second, different translation process and a likelihood value for each potential remainder translation; generating a plurality of combinations by combining a plurality of the potential translations of the elements of information with a plurality of the potential remainder translations; determining a likelihood value for respective ones of a plurality of the combinations based on a model of the second language and corresponding likelihood values of each of the potential element of information translations and remainder translations included in the respective combinations; and forming a translated version of the document based on the likelihood values of the combinations.
-
Specification