×

Automatically finding acronyms and synonyms in a corpus

  • US 8,316,007 B2
  • Filed: 06/28/2007
  • Issued: 11/20/2012
  • Est. Priority Date: 06/28/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method in a computer system for identifying acronym and synonym pairs for a selected target corpus, the method comprising:

  • analyzing each sentence in a target corpus to identify possible acronym and synonym pairs;

    determining, using a processor associated with a computer system, an occurrence frequency of each identified possible acronym and synonym pair from among a plurality of possible acronym and synonym pairs;

    determining a maximum possible length for each identified possible acronym and synonym pair;

    receiving a user-selected relative weighting factor from a user for weighting an occurrence frequency relative to a maximum possible length;

    scoring each identified possible acronym and synonym pair based on the user-selected weighting factor, occurrence frequency and maximum possible length, and wherein the scoring of each identified possible acronym and synonym pair further includes only scoring pairs with a longer maximum length higher than terms with a shorter maximum length when those pairs have substantially the same occurrence frequency;

    determining that at least one of the identified acronym and synonym pairs includes a pair in which a longer maximum length higher than terms with a shorter maximum length when those pairs have substantially the same occurrence frequency;

    only ranking the at least one identified acronym and synonym pair with the longer maximum length, such that only one of those pairs that had substantially the same occurrence frequency is ranked, wherein each of the acronym and synonym pairs are ranked relative to the plurality of ranked acronym and synonym pairs; and

    displaying the ranked acronym and synonym pairs from among the plurality of ranked acronym and synonym pairs.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×