×

Method and apparatus for context-dependent estimation of multiple probability distributions of phonetic classes with multilayer perceptrons in a speech recognition system

  • US 5,317,673 A
  • Filed: 06/22/1992
  • Issued: 05/31/1994
  • Est. Priority Date: 06/22/1992
  • Status: Expired due to Term
First Claim
Patent Images

1. In a speech recognition apparatus having a hidden Markov model speech recognizer, a method for using a multilayer perceptron (MLP) for recognizing speech by context-dependent estimation of a plurality of state-dependent observation probability distributions of phonetic (phone) classes which has weights that have been obtained based on a training set of speech vectors, wherein said training set of said speech vectors has been used to create context-dependent phone classes for use in said method, said speech vectors being characterized by phone classes, the method comprising the steps of:

  • applying input speech vectors containing unknown data to a single input layer of a multilayer perceptron, said multilayer perceptron having a single input layer, a single hidden layer, a single set of weights between said input layer and said hidden layer, and a plurality of output layers with an associated plurality of sets of weights between said hidden layer and said output layers, each one of said output layers having a plurality of output units for storing a plurality of probability values;

    forward propagating each input speech vector through said multilayer perceptron to produce an activation level representative of a probability value at each output unit within each one of said output layers;

    determining likelihood of observing each said input speech vector, assuming a specific state of a hidden Markov model by factoring, according to Bayes rule, said likelihood of observing being in terms of posterior probabilities of phone classes of the speech vector assuming context and the input speech vector, thereby obtaining values representative of context-dependent estimation; and

    employing as input to said hidden Markov model speech recognizer said values representative of context-dependent estimation as state-dependent observation probabilities to identify a specific estimated word sequence from said input speech vectors.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×