Identifying risky translations
First Claim
1. A method comprising:
- receiving a request at a translation system to translate source material from a source language into a destination language;
translating the source material into destination material, the translating comprising;
providing the source material to a translation model,identifying, with the translation model, a plurality of hypotheses representing translations of a source word or phrase into a destination word or phrase, the hypotheses comprising a first hypothesis and a second hypothesis different from the first hypothesis,outputting, from the translation model, a translation model score or rank for each of the plurality of hypotheses, wherein the first hypothesis has a higher translation model score or rank than the second hypothesis,evaluating, with a language model distinct from the translation model, the plurality of hypotheses in a context of the translated destination material,outputting, from the language model, a language model score or rank for each of the hypotheses, wherein the second hypothesis has a higher language model score or rank than the first hypothesis,combining, for each of the plurality of hypotheses, the translation model score or rank and the language model score or rank to select the first hypothesis or the second hypothesis as a most likely translation while identifying that the most likely translation is a questionable word or phrase based on the first hypothesis having a higher translation model score or rank and the second hypothesis having a higher language model score or rank; and
displaying the destination material, the displaying comprising visually distinguishing the questionable words or phrases in the destination material by differentiating between the most-likely translation and other words or phrases in the destination material that were translated with high confidence.
2 Assignments
0 Petitions
Accused Products
Abstract
Exemplary embodiments provide techniques for evaluating when words or phrases of a translation were generated with a low degree of confidence, and conveying this information when the translation is presented. For example, if a source language word is encountered in source material for translation, but the source language word was only encountered a few times (or not at all) in the training data used to train the translation system, then the resulting translation may be flagged as being of low confidence. Other situations, such as the generation of two equally-likely translations, or translation system model disagreement, may also indicate a questionable translation. When the translation is displayed, questionable words and phrases may be flagged, and possible alternative translations may be presented. If one of the alternatives is selected, this information may be used to update the translation system'"'"'s models in order to improve translation quality in the future.
55 Citations
18 Claims
-
1. A method comprising:
-
receiving a request at a translation system to translate source material from a source language into a destination language; translating the source material into destination material, the translating comprising; providing the source material to a translation model, identifying, with the translation model, a plurality of hypotheses representing translations of a source word or phrase into a destination word or phrase, the hypotheses comprising a first hypothesis and a second hypothesis different from the first hypothesis, outputting, from the translation model, a translation model score or rank for each of the plurality of hypotheses, wherein the first hypothesis has a higher translation model score or rank than the second hypothesis, evaluating, with a language model distinct from the translation model, the plurality of hypotheses in a context of the translated destination material, outputting, from the language model, a language model score or rank for each of the hypotheses, wherein the second hypothesis has a higher language model score or rank than the first hypothesis, combining, for each of the plurality of hypotheses, the translation model score or rank and the language model score or rank to select the first hypothesis or the second hypothesis as a most likely translation while identifying that the most likely translation is a questionable word or phrase based on the first hypothesis having a higher translation model score or rank and the second hypothesis having a higher language model score or rank; and displaying the destination material, the displaying comprising visually distinguishing the questionable words or phrases in the destination material by differentiating between the most-likely translation and other words or phrases in the destination material that were translated with high confidence. - View Dependent Claims (2, 3, 4, 5, 6, 18)
-
-
7. A non-transitory computer-readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to:
-
receive a request at a translation system to translate source material from a source language into a destination language; translate the source material into destination material, the translating comprising; providing the source material to a translation model, identifying, with the translation model, a plurality of hypotheses representing translations of a source word or phrase into a destination word or phrase, the hypotheses comprising a first hypothesis and a second hypothesis different from the first hypothesis, outputting, from the translation model, a translation model score or rank for each of the plurality of hypotheses, wherein the first hypothesis has a higher translation model score or rank than the second hypothesis, evaluating, with a language model distinct from the translation model, the plurality of hypotheses in a context of the translated destination material, outputting, from the language model, a language model score or rank for each of the hypotheses, wherein the second hypothesis has a higher language model score or rank than the first hypothesis, combining, for each of the plurality of hypotheses, the translation model score or rank and the language model score or rank to select the first hypothesis or the second hypothesis as a most likely translation while identifying that the most likely translation is a questionable word or phrase based on the first hypothesis having a higher translation model score or rank and the second hypothesis having a higher language model score or rank; and display the destination material, the displaying comprising visually distinguishing the questionable words or phrases in the destination material by differentiating between the most-likely translation and other words or phrases in the destination material that were translated with high confidence. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a network adapter configured to receive a request to translate source material from a source language into a destination language; translation system logic, at least a portion of which is implemented in hardware, to implement a translation system, the translation system logic comprising logic to; translate the source material into destination material, the translating comprising; providing the source material to a translation model, identifying, with the translation model, a plurality of hypotheses representing translations of a source word or phrase into a destination word or phrase, the hypotheses comprising a first hypothesis and a second hypothesis different from the first hypothesis, outputting, from the translation model, a translation model score or rank for each of the plurality of hypotheses, wherein the first hypothesis has a higher translation model score or rank than the second hypothesis, evaluating, with a language model distinct from the translation model, the plurality of hypotheses in a context of the translated destination material, outputting, from the language model, a language model score or rank for each of the hypotheses, wherein the second hypothesis has a higher language model score or rank than the first hypothesis, combining, for each of the plurality of hypotheses, the translation model score or rank and the language model score or rank to select the first hypothesis or the second hypothesis as a most likely translation while identifying that the most likely translation is a questionable word or phrase based on the first hypothesis having a higher translation model score or rank and the second hypothesis having a higher language model score or rank; and display the destination material, the displaying comprising visually distinguishing the questionable words or phrases in the destination material by differentiating between the most-likely translation and other words or phrases in the destination material that were translated with high confidence. - View Dependent Claims (14, 15, 16, 17)
-
Specification