×

Systems and methods for lexicon generation

  • US 8,527,513 B2
  • Filed: 08/26/2010
  • Issued: 09/03/2013
  • Est. Priority Date: 08/26/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for lexicon generation, comprising the steps of:

  • determining a corpus term from a plurality of documents;

    generating a candidate term from the corpus term, wherein generating the candidate term comprises generating a linguistic variant of the corpus term;

    generating a plurality of equivalent terms from the candidate term;

    validating the plurality of equivalent terms by comparing the plurality of equivalent terms to frequency of occurrence of the candidate term;

    linking each of the plurality of equivalent terms to the candidate term to create respective equivalent term pairs;

    determining whether any of the equivalent term pairs are equivalent and, in response to determining that at least two of equivalent term pairs are equivalent, merging the equivalent term pairs to create a group of equivalent terms;

    selecting a normalized term from the group of equivalent terms; and

    storing the group of equivalent terms.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×