×

Adaptive pattern learning for bilingual data mining

  • US 8,275,604 B2
  • Filed: 03/18/2009
  • Issued: 09/25/2012
  • Est. Priority Date: 03/18/2009
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • one or more processors; and

    memory that includes a plurality of computer-executable components executable by the one or more processors, the plurality of computer-executable components comprising;

    a pre-processing component to process a bilingual web page into a Document Object Model (DOM) tree that includes at least one node;

    a seed mining component to link bilingual snippet pairs of the at least one node into a plurality of translation snippet pairs;

    a pattern learning component to determine one or more best fit candidate patterns based on the plurality of translation snippet pairs via a Support Vector Machine (SVM) classifier;

    a data mining component to mine one or more translation pairs from the bilingual web page using the one or more best fit candidate patterns; and

    a data storage component to store the one or more translation pairs, wherein the one or more translation pairs including at least one of a term pair, a phrase pair, or a sentence pair.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×