×

Linguistic key normalization

  • US 8,521,516 B2
  • Filed: 03/25/2009
  • Issued: 08/27/2013
  • Est. Priority Date: 03/26/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method executed by one or more processors, the method comprising:

  • receiving a collection of phrases;

    normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic normalizing rules;

    generating a normalized phrase table including a plurality of key-value pairs, each key-value pair having a key that includes a normalized phrase and a value that includes one or more un-normalized phrases associated with the normalized phrase of the key and one or more parameters associated with each un-normalized phrase, the one or more parameters including a translation corresponding to the normalized phrase and a probability for the translation given the normalized phrase;

    receiving a training phrase;

    normalizing the training phrase according to one or more lexicographic normalization rules;

    locating the normalized training phrase in a normalized phrase table, the normalized phrase table including a plurality of key-value pairs, each key-value pair having a key that includes a normalized phrase and a value that includes one or more un-normalized phrases associated with the normalized phrase of the key and one or more parameters associated with each un-normalized phrase;

    associating one or more weights to one or more un-normalized phrases associated with the key-value pair for the identified normalized training phrase in the normalized phrase table based on a relation of each associated un-normalized phrase to the received training phrase; and

    determining a degree of match between the received training phrase and a specific un-normalized phrase associated with the located normalized training phrase, the degree of match being determined according to a similarity measure, wherein associating one or more weights comprises;

    associating a first weight to the specific un-normalized phrase when the training phrase has a high degree of match with the specific un-normalized phrase, andassociating a second weight to the specific un-normalized phrase when the training phrase has a low degree of match with the specific un-normalized phrase.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×