×

Methods and systems for augmenting a token lexicon

  • US 8,051,096 B1
  • Filed: 09/30/2004
  • Issued: 11/01/2011
  • Est. Priority Date: 09/30/2004
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • receiving a character string in an alphanumeric format having no token-delineating breaks and comprising one or more tokens in the alphanumeric format; and

    for each of the one or more tokens, parsing the received character string into a first portion containing a first token and a second portion containing the remaining tokens;

    identifying the first token in one or more logs associated with multiple previously received search requests;

    determining a frequency with which the identified first token appears in the one or more logs;

    determining whether the determined frequency with which the identified first token appears in the one or more logs exceeds a first threshold level; and

    storing the identified first token in a lexicon data storage based on the determination of whether the determined frequency with which the identified first token appears in the one or more logs exceeds the first threshold level, wherein the lexicon data storage comprises an ontology associating at least one of a misspelling of the first token with a correct spelling, or an alternate spelling of the first token with a different spelling.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×