×

Terminological adaptation of statistical machine translation system through automatic generation of phrasal contexts for bilingual terms

  • US 9,367,541 B1
  • Filed: 01/20/2015
  • Issued: 06/14/2016
  • Est. Priority Date: 01/20/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for terminological adaptation of a machine translation system comprising:

  • receiving a set of vocabulary pairs, each vocabulary pair including a source term in a source language and a target term in a target language, each term being in a class which includes a set of sequences;

    extracting contextual phrase pairs from a bilingual training corpus, each of the contextual phrase pairs including a source phrase and a target phrase, the source phrase including a source sequence of a same class as the source term of one of the vocabulary pairs and associated source context, the target phrase including a target sequence of a same class as the target term of the one of the vocabulary pairs and associated target context;

    generating templates based on the extracted contextual phrase pairs, each template replacing the source and target sequences of a contextual phrase pair with respective source and target placeholders denoting the class;

    generating at least one candidate phrase pair from each template, each candidate phrase pair replacing the source and target placeholders of the template with respective source and target terms of a vocabulary pair of the same class; and

    incorporating at least some of the candidate phrase pairs into a phrase table of a machine translation system,wherein at least one of the extracting contextual phrase pairs, generating templates, and generating candidate phrase pairs is performed with a processor.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×