×

Adaptation of exponential models

  • US 7,860,314 B2
  • Filed: 10/29/2004
  • Issued: 12/28/2010
  • Est. Priority Date: 07/21/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A non-transitory computer storage medium having computer-executable instructions that when executed by a processor cause the processor to perform steps comprising:

  • for each of a first set of feature threshold counts, performing steps comprising;

    selecting a set of features from background data, where each feature in the set appears in the background data more than a number of times represented by the feature threshold count;

    for each of a first set of variances of a prior model, performing steps comprising;

    training a set of weights comprising a separate weight for each feature in the selected set of features from the background data such that the set of weights maximizes the likelihood of the set of background data using update equations for the weights that are based on an exponential probability model and relative frequencies in the background data of co-occurrences of contexts and capitalization tags, wherein each trained set of weights and respective selected set of features from the background data represent a separate model;

    applying each separate model to a set of background development data and selecting the model with the best accuracy as an initial model having an initial set of weights and an initial set of features from the background data;

    for each of a second set of feature threshold counts performing steps comprising;

    selecting a set of features from adaptation data, where each feature in the set of features appears in the adaptation data more than a number of times represented by the feature threshold count from the second set of feature threshold counts, wherein the adaptation data is smaller than the background data;

    for each of a second set of variances of the prior model, performing steps comprising;

    the processor determining an adapted set of weights comprising a separate weight for each feature in a union of the selected set of features from the adaptation data and the initial set of features from the background data such that the set of weights maximize the likelihood of a set of adaptation data, and such that a weight for a feature that is present in the initial set of features from the background data but that is not present in the selected set of features from the adaptation data is updated when determining an adapted set of weights, wherein the likelihood of the set of adaptation data is based on;

    a second exponential probability model;



    a prior model for the set of weights that comprises means with values equal to the initial set of weights for features that are present in the initial set of features from the background data and means with values equal to zero for features that are not present in the initial set of features from the background data but that are present in the set of features from the adaptation data; and

    relative frequencies in the adaptation data of co-occurrences of contexts and capitalization tags;

    selecting a set of adapted weights as a final adapted model be determining which set of adapted weights provides the highest likelihood for a asset of adaptation development data.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×