×

Language Modeling Using Entities

  • US 20150340024A1
  • Filed: 05/11/2015
  • Published: 11/26/2015
  • Est. Priority Date: 05/23/2014
  • Status: Abandoned Application
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • obtaining a plurality of text samples;

    for each of one or more text samples in the plurality of text samples;

    determining that at least one term in the text sample corresponds to a first entity in a data structure of entities, wherein the data structure includes representations of a plurality of entities and defines relationships among particular ones of the plurality of entities;

    determining classes to which the first entity within the data structure of entities belongs; and

    annotating the text sample with one or more labels that indicate respective classes to which the first entity corresponding to the at least one term belongs;

    generating a class-based training set of text samples by substituting the one or more terms in the one or more text samples with respective class identifiers for the one or more terms that correspond to the respective labels for the one or more terms;

    training a class-based language model using the class-based training set of text samples;

    training a plurality of class-specific language models; and

    performing speech recognition on an utterance using the class-based language model and at least one class-specific language model from among the plurality of class-specific language models.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×