×

WORD ALIGNMENT METHOD AND SYSTEM FOR IMPROVED VOCABULARY COVERAGE IN STATISTICAL MACHINE TRANSLATION

  • US 20110307245A1
  • Filed: 06/14/2010
  • Published: 12/15/2011
  • Est. Priority Date: 06/14/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for generating word alignments from pairs of aligned text strings comprising:

  • from a corpus of text strings, receiving a pair of text strings comprising a first text string in a first language and a second text string in a second language;

    with a first alignment tool, generating a first alignment between the first and second text strings which creates links between the first and second text string, each link linking a single token of the first text string to a single token of the second text string, the tokens of the first and second text strings including words;

    with a second alignment tool, generating a second alignment between the first and second text strings which creates links between the first and second text strings, each link linking at least one token of the first text string to at least one token of the second text string,generating a modified first alignment by selectively modifying links in the first alignment which include a word which is infrequent in the corpus, based on links generated in the second alignment.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×