Please download the dossier by clicking on the dossier button x
×

MACHINE TRANSLATION USING OVERLAPPING BIPHRASE ALIGNMENTS AND SAMPLING

  • US 20120101804A1
  • Filed: 10/25/2010
  • Published: 04/26/2012
  • Est. Priority Date: 10/25/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for training a translation scoring model for a statistical machine translation system, comprising:

  • receiving a set of source sentences, each of the source sentences comprising source words in a source language;

    for each source sentence, generating a target sentence comprising target words in a target language which are translations of the source words;

    generating a plurality of translation neighbors based on the target sentence, each translation neighbor comprising at least some of the target words of the target sentence;

    for each of the plurality of translation neighbors, computing a phrase alignment between the source sentence and the translation neighbor;

    scoring each translation neighbor with a translation scoring model, based on the computed phrase alignment between the source sentence and the translation neighbor;

    ranking a plurality of the translation neighbors based on the translation model scores;

    updating parameters of the model based on a comparison of the ranking based on the translation model scores with an external ranking of the plurality of translation neighbors; and

    iterating, at least once more, the generating of translation neighbors, scoring, ranking, and updating parameters, wherein in the generating of the plurality of translation neighbors of the target sentence, the target sentence is a translation neighbor from a prior iteration.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×