MACHINE TRANSLATION SYSTEM EMPLOYING CLASSIFIER
First Claim
1. A method comprising:
- accessing a machine translation system configured to translate an input in a source language to an output in a destination language;
analyzing information associated with at least one of the machine translation system or the output to determine that the output of the machine translation system comprises one or more target words or phrases; and
automatically modifying at least one of the machine translation system or the output so that the one or more target words or phrases are not automatically presented to an output device, wherein the modifying comprises filtering bilingual training data that trains the machine translation system.
2 Assignments
0 Petitions
Accused Products
Abstract
Exemplary embodiments relate to detecting, removing, and/or replacing objectionable words and phrases in a machine-generated translation. A classifier identifies translations containing target words or phrases. The classifier may be applied to the output translation to remove target words and phrases from the translation, or to prevent target words and phrases from being automatically presented. Further, the classifier may be applied to a translation model to prevent the target words and phrases from appearing in the output translation. Still further, the classifier may be applied to training data so that the translation model is not trained using the target words of phrases. The classifier may remove target words or phrases only when the target words or phrases appear in the output translation but not the source language input data. The classifier may be provided as a standalone service, or may be employed in the context of a machine translation system.
-
Citations
20 Claims
-
1. A method comprising:
-
accessing a machine translation system configured to translate an input in a source language to an output in a destination language; analyzing information associated with at least one of the machine translation system or the output to determine that the output of the machine translation system comprises one or more target words or phrases; and automatically modifying at least one of the machine translation system or the output so that the one or more target words or phrases are not automatically presented to an output device, wherein the modifying comprises filtering bilingual training data that trains the machine translation system. - View Dependent Claims (2, 4, 5, 6, 7)
-
-
3. (canceled)
-
8. A non-transitory computer-readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to:
-
access a machine translation system configured to translate an input in a source language to an output in a destination language; analyze information associated with at least one of the machine translation system or the output to determine that the output of the machine translation system comprises one or more target words or phrases; and automatically modify at least one of the machine translation system or the output so that the one or more target words or phrases are not automatically presented to an output device, wherein the modifying comprises filtering a trained phrase table that the machine translation system uses to translate the source language into the destination language. - View Dependent Claims (9, 10, 12, 13, 14)
-
-
11. (canceled)
-
15. An apparatus comprising:
-
a non-transitory computer readable medium storing logic for a machine translation system configured to translate an input in a source language to an output in a destination language; a classifier configured to analyze information associated with at least one of the machine translation system or the output to determine that the output of the machine translation system comprises one or more target words or phrases; and a processor configured to automatically modify at least one of the machine translation system or the output so that the one or more target words or phrases are not automatically presented to an output device, wherein the modifying comprises;
filtering bilingual training data that trains the machine translation system or filtering a trained phrase table that the machine translation system uses to translate the source language into the destination language. - View Dependent Claims (16, 19, 20)
-
-
17-18. -18. (canceled)
Specification