×

Constructing a translation lexicon from comparable, non-parallel corpora

  • US 20030204400A1
  • Filed: 03/26/2003
  • Published: 10/30/2003
  • Est. Priority Date: 03/26/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method for building a translation lexicon from non-parallel corpora, the method comprising:

  • identifying identically spelled words in a first corpus and a second corpus, the first corpus including words in a first language and the second corpus including words in a second language;

    generating a seed lexicon including identically spelled words; and

    expanding the seed lexicon by identifying possible translations of words in the first and second corpora using one or more clues.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×