×

System for identifying paraphrases using machine translation

  • US 7,412,385 B2
  • Filed: 11/12/2003
  • Issued: 08/12/2008
  • Est. Priority Date: 11/12/2003
  • Status: Active Grant
First Claim
Patent Images

1. A method of training a paraphrase processing system, comprising:

  • i. accessing a plurality of documents;

    ii. identifying, from the plurality of documents, a cluster of related texts that are written by different authors about a common subject, wherein the related texts are further identified as being from different news agencies and about a common event;

    iii. receiving the cluster of related texts;

    iv. selecting a set of text segments from the cluster, wherein selecting comprises grouping desired text segments of the related texts into a set of related text segments;

    v. using textual alignment to identify paraphrase relationships between texts in the text segments included in the set of related text segments; and

    vi. wherein textual alignment comprises;

    using statistical textual alignment to align words in the text segments in the set; and

    identifying the paraphrase relationships based on the aligned words.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×