×

Longest-common-subsequence detection for common synonyms

  • US 8,001,136 B1
  • Filed: 07/02/2008
  • Issued: 08/16/2011
  • Est. Priority Date: 07/10/2007
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for identifying synonym candidates, the method comprising:

  • receiving a first term and a second term;

    identifying, using one or more processors, a longest subsequence that is common to the first term and the second term;

    determining which of the first term and the second term are longer;

    determining a ratio between a length of the longest subsequence and a length of the longer of the first term and the second term; and

    determining that the ratio between the length of the longest subsequence and the length of the longer of the first term and the second term meets a first threshold;

    computing an edit distance between the first term and the second term;

    comparing the edit distance to a second threshold;

    determining that the edit distance meets the second threshold; and

    designating the first term and the second term as synonym candidates based on determining that the ratio between the length of the longest subsequence and the length of the longer of the first term and the second term meets the first threshold and based on determining that the edit distance meets the second threshold.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×