×

EXTRACTING TREELET TRANSLATION PAIRS

  • US 20090271177A1
  • Filed: 07/08/2009
  • Published: 10/29/2009
  • Est. Priority Date: 11/04/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method of identifying treelet translation pairs for use in a machine translation system that translates a source language input into a target language output, the method comprising:

  • accessing a corpus of pairs of aligned, parallel syntactic dependency structures, each pair including a source language dependency structure having nodes that represent lexical items, the nodes being aligned with nodes representing lexical items in a target language dependency structure;

    enumerating individual source nodes and combinations of source nodes connected in the source language dependency structure as possible source treelets identifying lexical items, and corresponding dependencies, in the target language dependency structure, that are aligned with the enumerated nodes and combinations of connected nodes, as possible target treelets corresponding to the possible source treelets;

    extracting well formed treelet translation pairs from the possible source treelets and possible target treelets; and

    storing the treelet translation pairs in a data store.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×