×

Multi-frame prediction for hybrid neural network/hidden Markov models

  • US 8,442,821 B1
  • Filed: 07/27/2012
  • Issued: 05/14/2013
  • Est. Priority Date: 07/27/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • transforming an audio input signal, using one or more processors of a system, into a first time sequence of feature vectors, each respective feature vector of the first time sequence bearing quantitative measures of acoustic properties of a corresponding, respective temporal frame of a first sequence of temporal frames of the audio input signal;

    providing, at a first time step, the first time sequence of feature vectors as input to a neural network (NN) implemented by the one or more processors of the system;

    by concurrently processing the feature vectors in the first time sequence with the NN during the first time step, concurrently determining emission probabilities corresponding to all the temporal frames of the first sequence of temporal frames, wherein concurrently determining the emission probabilities corresponding to all the temporal frames of the first sequence of temporal frames comprises;

    during a common time interval, determining for each feature vector in the first time sequence a respective set of emission probabilities for a first plurality of hidden Markov models (HMMs); and

    concurrently applying the emission probabilities determined at the first time step for the feature vectors in the first time sequence to the first plurality of HMMs to determine speech content corresponding to the first sequence of temporal frames of the audio input signal.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×