Statistical machine translation
First Claim
Patent Images
1. A computer implemented method, the method comprising:
- assigning a part of speech identifier to each word in a source string, the source string in a first language;
detecting a first sequence of syntactic chunks in the source string, the syntactic chunks each comprising at least one of the words;
assigning a syntactic chunk label to each of the detected syntactic chunks in the source string;
defining connections between each of the detected syntactic chunks in the source string and at least one syntactic chunk of a sequence of syntactic chunks in a target string, the target string being a parallel translation in a second language of the source string, said defining comprising determining connections based on a chunk mapping table, the chunk mapping table using pre-defined connections based on the assigned syntactic chunk label;
mapping each word in the detected syntactic chunks in the source string to each word in the syntactic chunks in the target string, said mapping based on a word mapping table and the part of speech identifier;
translating by a computer an input string in the first language into a translation in the second language based on the chunk mapping table and the word mapping table, andoutputting the translation using the computer.
2 Assignments
0 Petitions
Accused Products
Abstract
A method includes detecting a syntactic chunk in a source string in a first language, assigning a syntactic label to the detected syntactic chunk in the source string, mapping the detected syntactic chunk in the source string to a syntactic chunk in a target string in a second language, said mapping based on the assigned syntactic label, and translating the source string into a possible translation in the second language.
69 Citations
18 Claims
-
1. A computer implemented method, the method comprising:
-
assigning a part of speech identifier to each word in a source string, the source string in a first language; detecting a first sequence of syntactic chunks in the source string, the syntactic chunks each comprising at least one of the words; assigning a syntactic chunk label to each of the detected syntactic chunks in the source string; defining connections between each of the detected syntactic chunks in the source string and at least one syntactic chunk of a sequence of syntactic chunks in a target string, the target string being a parallel translation in a second language of the source string, said defining comprising determining connections based on a chunk mapping table, the chunk mapping table using pre-defined connections based on the assigned syntactic chunk label; mapping each word in the detected syntactic chunks in the source string to each word in the syntactic chunks in the target string, said mapping based on a word mapping table and the part of speech identifier; translating by a computer an input string in the first language into a translation in the second language based on the chunk mapping table and the word mapping table, and outputting the translation using the computer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer readable storage medium having stored thereon a program, the program being executable by a processor for performing a method, the method comprising:
-
assigning a part of speech identifier to each word in a source string, the source string in a first language; detecting a first sequence of syntactic chunks in the source string, the syntactic chunks each comprising at least one of the words; assigning a syntactic chunk label to each of the detected syntactic chunks in the source string; defining connections between each of the detected syntactic chunks in the source string and at least one syntactic chunk of a sequence of syntactic chunks in a target string, the target string being a parallel translation in a second language of the source string, said defining comprising determining connections based on a chunk mapping table, the chunk mapping table using pre-defined connections based on the assigned syntactic chunk label; mapping each word in the detected syntactic chunks in the source string to each word in the syntactic chunks in the target string, said mapping based on a word mapping table and the part of speech identifier; and translating by a computer an input string in the first language into a translation in the second language based on the chunk mapping table and the word mapping table. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification