System and method for likelihood computation in multi-stream HMM based speech recognition
First Claim
Patent Images
1. A method for speech recognition, comprising the steps of:
- determining active Gaussians related to a first feature stream and a second feature stream by labeling at least one of the first and second streams;
determining active Gaussians co-occurring in the first stream and the second stream based upon joint probability;
reducing a number of Gaussians computed for the second stream based upon Gaussians already computed for the first stream and a number of Gaussians co-occurring in the second stream; and
decoding speech based on the Gaussians computed for the first and second streams.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for speech recognition includes determining active Gaussians related to a first feature stream and a second feature stream by labeling at least one of the first and second streams, and determining active Gaussians co-occurring in the first stream and the second stream based upon joint probability. A number of Gaussians computed is reduced based upon Gaussians already computed for the first stream and a number of Gaussians co-occurring in the second stream. Speech is decoded based on the Gaussians computed for the first and second streams.
9 Citations
22 Claims
-
1. A method for speech recognition, comprising the steps of:
-
determining active Gaussians related to a first feature stream and a second feature stream by labeling at least one of the first and second streams;
determining active Gaussians co-occurring in the first stream and the second stream based upon joint probability;
reducing a number of Gaussians computed for the second stream based upon Gaussians already computed for the first stream and a number of Gaussians co-occurring in the second stream; and
decoding speech based on the Gaussians computed for the first and second streams. - View Dependent Claims (2, 3, 4, 5, 6, 7, 13)
-
-
8. A method for speech recognition based upon a plurality of feature streams, comprising the steps of:
-
determining active Gaussians related to a first feature stream by hierarchically labeling the first feature stream;
determining active Gaussians co-occurring in the feature streams other than the first feature stream based upon joint probability using cooccurence statistics such that a number of Gaussians computed for subsequent feature streams are reduced based upon co-occurring Gaussians already computed for at least one other feature stream; and
decoding speech based on the Gaussians computed for the plurality of feature streams. - View Dependent Claims (9, 10, 11, 12, 14)
-
-
15. A speech recognition system, comprising:
-
a first front end, which extracts features from a first stream to generate likelihoods of the features of the first stream;
a second front end, which extracts features from a second stream associated with the first stream for generating likelihoods of the features of the second stream;
a processing module, which determines active Gaussians used to compute the likelihoods of the features of the first stream and finds active Gaussians co-occurring in the second stream to generate the likelihoods of the features of the second stream such that a number of Gaussians computed for the second stream is reduced based upon Gaussians already computed for the first stream; and
a speech decoder which decodes speech based on the Gaussians computed for the first and second streams. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
Specification