Confidence-driven rewriting of source texts for improved translation
First Claim
1. A method for rewriting source text, comprising:
- receiving source text comprising at least one source text string in a first natural language;
with a processor, for each of the at least one source text string;
translating the source text string with a machine translation system to generate a first target text string in a second natural language;
computing a first translation confidence for the source text string based on at least one feature that is based on at least one of the source text string and the first target text string;
providing for generating at least one alternative text string in the first natural language, the generating comprising automatically rewriting the source text string; and
for each of the at least one alternative text string;
translating the alternative text string with the machine translation system to generate a second target text string in the second natural language; and
computing a second translation confidence for the alternative text string based on at least one feature that is based on at least one of the alternative text string and the second target text string; and
based on the computed first and second translation confidences, providing for selecting one of the at least one alternative text strings as a replacement for the source text string in the source text.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for rewriting source text includes receiving source text including a source text string in a first natural language. The source text string is translated with a machine translation system to generate a first target text string in a second natural language. A translation confidence for the source text string is computed, based on the first target text string. At least one alternative text string is generated, where possible, in the first natural language by automatically rewriting the source string. Each alternative string is translated to generate a second target text string in the second natural language. A translation confidence is computed for the alternative text string based on the second target string. Based on the computed translation confidences, one of the alternative text strings may be selected as a candidate replacement for the source text string and may be proposed to a user on a graphical user interface.
-
Citations
21 Claims
-
1. A method for rewriting source text, comprising:
-
receiving source text comprising at least one source text string in a first natural language; with a processor, for each of the at least one source text string; translating the source text string with a machine translation system to generate a first target text string in a second natural language; computing a first translation confidence for the source text string based on at least one feature that is based on at least one of the source text string and the first target text string; providing for generating at least one alternative text string in the first natural language, the generating comprising automatically rewriting the source text string; and for each of the at least one alternative text string; translating the alternative text string with the machine translation system to generate a second target text string in the second natural language; and computing a second translation confidence for the alternative text string based on at least one feature that is based on at least one of the alternative text string and the second target text string; and based on the computed first and second translation confidences, providing for selecting one of the at least one alternative text strings as a replacement for the source text string in the source text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for rewriting source text, comprising:
-
a rewriting component configured for receiving source text comprising a source text string in a first natural language and generating at least one alternative text string in the first natural language by rewriting the source text string; a translation requester configured for requesting a first machine translation of the source text string and a second machine translation of the alternative text string; a confidence estimator for estimating a translation confidence for the source text string and alternative text string; a selection component for selecting alternative text strings as replacements for source text strings or selecting candidates for replacement of source text strings, based on the translation confidences; and a processor which implements at least one of the translation requester, confidence estimator, and selection component. - View Dependent Claims (20)
-
-
21. An interactive translation method comprising:
-
receiving source text comprising at least one source text string in a first natural language; with a processor, for each of the at least one source text string; translating the source text string with a machine translation system to generate a first target text string in a second natural language; computing a first translation confidence for the source text string; providing for retaining the source text string in the source text where the first translation confidence meets a confidence threshold and for the source text string in the source text and for generating at least one alternative text string in the first natural language based on the source text string where the first translation confidence does not meet the threshold; and for each of the at least one alternative text string; translating the alternative text string with the machine translation system to generate a second target text string in the second natural language; and computing a second translation confidence for the alternative text string; and providing for retaining the source text string in the source text where the first translation confidence at least meets the second translation confidence of each of the at least one alternative text strings and for presenting at least one of the at least one alternative text strings to a user as a candidate for replacement of the source string in the source text otherwise; and outputting a translation of the source text based on the source text strings that have been retained and the presented alternative source strings that have been selected by a user for replacement of a respective source string.
-
Specification