Method for likelihood computation in multi-stream HMM based speech recognition
First Claim
Patent Images
1. A method for speech recognition, comprising the steps of:
- determining active Gaussians related to a first feature stream and a second feature stream by hierarchically labeling at least one of the first and second streams by surveying Gaussians in multiple resolutions;
determining active Gaussians co-occurring in the first stream and the second stream based upon joint probability wherein the first stream includes an audio stream and the second stream includes a video stream;
reducing a number of Gaussians computed for the second stream based upon Gaussians already computed for the first stream and a number of Gaussians co-occurring in the second stream; and
decoding speech based on the Gaussians computed for the first and second streams by employing multi-stream hidden Markov models.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for speech recognition includes determining active Gaussians related to a first feature stream and a second feature stream by labeling at least one of the first and second streams, and determining active Gaussians co-occurring in the first stream and the second stream based upon joint probability. A number of Gaussians computed is reduced based upon Gaussians already computed for the first stream and a number of Gaussians co-occurring in the second stream. Speech is decoded based on the Gaussians computed for the first and second streams.
31 Citations
1 Claim
-
1. A method for speech recognition, comprising the steps of:
-
determining active Gaussians related to a first feature stream and a second feature stream by hierarchically labeling at least one of the first and second streams by surveying Gaussians in multiple resolutions; determining active Gaussians co-occurring in the first stream and the second stream based upon joint probability wherein the first stream includes an audio stream and the second stream includes a video stream; reducing a number of Gaussians computed for the second stream based upon Gaussians already computed for the first stream and a number of Gaussians co-occurring in the second stream; and decoding speech based on the Gaussians computed for the first and second streams by employing multi-stream hidden Markov models.
-
Specification