×

Dynamic bi-phrases for statistical machine translation

  • US 9,552,355 B2
  • Filed: 05/20/2010
  • Issued: 01/24/2017
  • Est. Priority Date: 05/20/2010
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for phrase-based translation comprising:

  • receiving an input of source text in a source language to be translated into target text in a target language;

    providing at least one dynamic bi-phrase rule to be used in generation of dynamic bi-phrases for translation of the source text;

    for a sentence of the source text;

    after receiving the source text, applying the at least one rule to the source text to generate a dynamic bi-phrase;

    associating a value of at least one dynamic feature with the at least one dynamic bi-phrase;

    retrieving static bi-phrases from a static bi-phrase table stored in memory which each include at least one word of the source text, each of the static bi-phrases being associated with a value of at least one static feature, each static bi-phrase in the bi-phrase table including a pair of phrases, each phrase comprising a sequence of at least one word, one of the phrases in the pair being from the source language and the other phrase being from the target language, the static bi-phrases having been automatically extracted from a training corpus of bisentences;

    retrieving any of the dynamic bi-phrases which each cover at least one word of the source text, the retrieved static bi-phrases and dynamic bi-phrases forming a set of active bi-phrases;

    generating translation hypotheses for at least a part of the source sentence using active bi-phrases from the set;

    scoring the translation hypotheses with a translation scoring model which takes into account the static feature values of static bi-phrases in the hypothesis and dynamic feature values of dynamic-bi-phrases in the hypothesis, wherein in scoring of a hypothesis, the scoring model considers static bi-phrases in the active set of bi-phrases in which the source phrase of the respective static bi-phrase covers at least one source word of the source sentence, wherein the static bi-phrase and a dynamic bi-phrase in the active set of bi-phrases both cover a same source word, such that each source word of the hypothesis is covered by exactly one of the retrieved bi-phrases in the active set of bi-phrases; and

    outputting a translation of the source text sentence based on the scoring of the hypotheses;

    wherein at least one of the generating at least one dynamic bi-phrase, associating at least one dynamic feature value of a dynamic feature with the at least one dynamic bi-phrase, retrieving static bi-phrases from a static bi-phrase table stored in memory, retrieving any of the dynamic bi-phrases which each cover at least one word of the source text, generating translation hypotheses for at least a part of the source sentence using active bi-phrases from the set, and scoring the translation hypotheses with a translation scoring model is implemented by a computer processor.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×