×

Predicting the cost associated with translating textual content

  • US 10,417,646 B2
  • Filed: 03/09/2010
  • Issued: 09/17/2019
  • Est. Priority Date: 03/09/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving an estimate of a first quantity of a first type of textual content of documents of a source language expected to be translated in the future, the first type of textual content including text expected to be translated via human translation for which a near-perfect translation is desired;

    receiving an estimate of a second quantity of a second type of textual content of the documents expected to be translated in the future, the second type of textual content different from the first type of textual content, the second type of textual content including text expected to be translated via machine-generated translation for which a potentially imperfect translation is acceptable;

    executing instructions stored in a memory by a processor to obtain an indication of a target language, the source language and the target language forming a language pair;

    training a machine translation system for the language pair using a training dataset associated with similar subject matter as documents of the second type of textual content expected to be translated in the future;

    comparing a machine-generated target-language corpus with a human-generated target-language corpus, relative to the second type of textual content of the documents expected to be translated in the future;

    mapping features such as similarities and differences between the machine-generated target-language corpus and the human-generated target-language corpus using the comparison;

    predicting a quality level of the trained machine translation system associated with translational accuracy of future machine-generated translations relative to human-generated translations using the mapping;

    executing instructions stored in a memory by the processor to determine a prediction of a first cost associated with translating the first type of textual content and a second cost associated with the second type of textual content from the source language to the target language, the prediction based at least in part on the first quantity estimation, the second quantity estimation, the predicted quality level, and the language pair;

    selecting the trained machine translation system to perform machine translations of the second type of textual content based on the predicted quality level of translation of the second type of textual content by the machine translation system and the predicted first and second cost;

    receiving text for translation;

    receiving a selection of a quantity of the received text that is of the second type of textual content to be translated by a machine translation system rather than by a human; and

    performing a machine translation of the selected quantity of the received text of the second type of textual content using the selected machine translation system.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×