×

SYNTAX-BASED AUGMENTATION OF STATISTICAL MACHINE TRANSLATION PHRASE TABLES

  • US 20120296633A1
  • Filed: 05/20/2011
  • Published: 11/22/2012
  • Est. Priority Date: 05/20/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented process for augmenting a machine translation phrase table with additional phrase pairs each pair of which associates a phrase in a source language with a phrase in a target language, comprising:

  • using a computer to perform the following process actions;

    inputting one or more syntactic transfer patterns, each of said patterns defining the syntax of a translation to the target language of a different phrase structure in the source language, wherein each source language phrase type represents a phase having a particular syntactic structure that is different from the other source language phrase types;

    for each inputted syntactic transfer pattern,synthesizing phrases in the source language of the type associated with the pattern under consideration using a lexicon of the source language,eliminating synthesized phrases not found in a monolingual corpus of the source language,for each remaining synthesized phrase, translating the synthesized phrase into the target language using the syntactic transfer pattern under consideration, a bilingual source-to-target language dictionary, and a morphological synthesizer to properly inflect the words of each translated phrase, andfor each translated phrase, adding a phrase pair comprising the translated phrase and its corresponding source language phrase to the current version of the phrase table to produce an augmented version of the phrase table.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×