Round-trip translation for automated grammatical error correction
First Claim
1. A computer-implemented method for correcting a grammatical error in a text sequence, the computer-implemented method comprising:
- translating a first text sequence in a first language to a second language with a processing system using a first machine translation algorithm to provide a first translated text;
translating the first text sequence to a third language with the processing system using a second machine translation algorithm to provide a second translated text, the third language being different from the second language;
translating the first translated text back to the first language with the processing system using a third machine translation algorithm to provide a first back translation, the translating of the first translated text back to the first language being performed without reference to the first text sequence;
translating the second translated text back to the first language with the processing system using a fourth machine translation algorithm to provide a second back translation, the translating of the second translated text back to the first language being performed without reference to the first text sequence;
determining a plurality of candidate text sequences that include features of the first back translation and the second back translation with the processing system, the plurality of candidate text sequences including alternative grammatical options for the first text sequence; and
scoring each of the plurality of candidate text sequences using a computer based scoring model with the processing system, the scoring providing a measure of a grammaticality of each of the candidate text sequences, wherein a computer-readable memory is encoded with instructions for commanding the processing system to execute the first, second, third, and fourth machine translation algorithms and to execute the determining of the plurality of candidate text sequences and the scoring of each of the plurality of candidate text sequences.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for correcting a grammatical error in a text sequence. A first text sequence in a first language is received. The first text sequence is translated to a second language to provide a first translated text. The first text sequence is translated to a third language to provide a second translated text. The third language is different from the second language. The first translated text is translated to the first language to provide a first back translation. The second translated text is translated to the first language to provide a second back translation. A plurality of candidate text sequences that include features of the first back translation and the second back translation are determined. The plurality of candidate text sequences include alternative grammatical options for the first text sequence. The plurality of candidate text sequences are scored with the processing system.
13 Citations
21 Claims
-
1. A computer-implemented method for correcting a grammatical error in a text sequence, the computer-implemented method comprising:
-
translating a first text sequence in a first language to a second language with a processing system using a first machine translation algorithm to provide a first translated text; translating the first text sequence to a third language with the processing system using a second machine translation algorithm to provide a second translated text, the third language being different from the second language; translating the first translated text back to the first language with the processing system using a third machine translation algorithm to provide a first back translation, the translating of the first translated text back to the first language being performed without reference to the first text sequence; translating the second translated text back to the first language with the processing system using a fourth machine translation algorithm to provide a second back translation, the translating of the second translated text back to the first language being performed without reference to the first text sequence; determining a plurality of candidate text sequences that include features of the first back translation and the second back translation with the processing system, the plurality of candidate text sequences including alternative grammatical options for the first text sequence; and scoring each of the plurality of candidate text sequences using a computer based scoring model with the processing system, the scoring providing a measure of a grammaticality of each of the candidate text sequences, wherein a computer-readable memory is encoded with instructions for commanding the processing system to execute the first, second, third, and fourth machine translation algorithms and to execute the determining of the plurality of candidate text sequences and the scoring of each of the plurality of candidate text sequences. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for correcting a grammatical error in a text sequence, the computer-implemented method comprising:
-
a processing system; and computer-readable memory in communication with the processing system encoded with instructions for commanding the processing system to execute steps comprising; receiving a first text sequence in a first language; translating the first text sequence to a second language with the processing system using a first machine translation algorithm to provide a first translated text; translating the first text sequence to a third language with the processing system using a second machine translation algorithm to provide a second translated text, the third language being different from the second language; translating the first translated text back to the first language with the processing system using a third machine translation algorithm to provide a first back translation, the translating of the first translated text back to the first language being performed without reference to the first text sequence; translating the second translated text back to the first language with the processing system using a fourth machine translation algorithm to provide a second back translation, the translating of the second translated text back to the first language being performed without reference to the first text sequence; determining a plurality of candidate text sequences that include features of the first back translation and the second back translation with the processing system, the plurality of candidate text sequences including alternative grammatical options for the first text sequence; and scoring each of the plurality of candidate text sequences using a computer based scoring model with the processing system, the scoring providing a measure of a grammaticality of each of the candidate text sequences. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage medium for correcting a grammatical error in a text sequence, the computer-readable storage medium comprising computer executable instructions which, when executed, cause a processing system to execute steps comprising:
-
receiving a first text sequence in a first language; translating the first text sequence to a second language with the processing system using a first machine translation algorithm to provide a first translated text; translating the first text sequence to a third language with the processing system using a second machine translation algorithm to provide a second translated text, the third language being different from the second language; translating the first translated text back to the first language with the processing system using a third machine translation algorithm to provide a first back translation, the translating of the first translated text back to the first language being performed without reference to the first text sequence; translating the second translated text back to the first language with the processing system using a fourth machine translation algorithm to provide a second back translation, the translating of the second translated text back to the first language being performed without reference to the first text sequence; determining a plurality of candidate text sequences that include features of the first back translation and the second back translation with the processing system, the plurality of candidate text sequences including alternative grammatical options for the first text sequence; and scoring each of the plurality of candidate text sequences using a computer based scoring model with the processing system, the scoring providing a measure of a grammaticality of each of the candidate text sequences. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification