Method and apparatus for example-based machine translation with learned word associations
First Claim
1. A method of translating words in a first language into words in a second language, the method comprising:
- using a first training text in a first language and a second training text in a second language to generate a word association list comprising entries, at least one of the entries comprising a word from the first language and a word from the second language; and
translating words in the first language into words in the second language based in part on the word association list without referring to a multi-lingual human-authored dictionary of content words wherein a content word comprises a word that is not a preposition, is not a conjunction, is not a pronoun and is not an article.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and computer-readable medium are provided that perform a series of steps associated with machine translation. These steps include using a first text in a first language and a second text in a second language, to produce an association list where words in the first language are associated with words in the second language. A first syntactic structure for a sentence from the first text is aligned with a second syntactic structure for a sentence in the second text based on the association list without referring to a bilingual dictionary of content words. The association list is also used during translations. Specifically, a word in the first language is translated into a word in the second language based on an entry in the association list without referring to a bilingual dictionary that contains content words. Thus, training and translation are performed without the need for a bilingual dictionary of content words.
98 Citations
16 Claims
-
1. A method of translating words in a first language into words in a second language, the method comprising:
-
using a first training text in a first language and a second training text in a second language to generate a word association list comprising entries, at least one of the entries comprising a word from the first language and a word from the second language; and
translating words in the first language into words in the second language based in part on the word association list without referring to a multi-lingual human-authored dictionary of content words wherein a content word comprises a word that is not a preposition, is not a conjunction, is not a pronoun and is not an article. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
identifying associations between words in a first language and words in a second language based on a first text in the first language and a second text in the second language to produce an association list;
identifying a first syntactic structure for a sentence in a first language;
identifying a second syntactic structure for a sentence in a second language; and
aligning the first syntactic structure with the second syntactic structure based on the association list without referring to a human-authored bilingual dictionary of content words. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
Specification