Source normalization training for HMM modeling of speech
First Claim
Patent Images
1. An improved speech recognition system comprising:
- a speech recognizer; and
a source normalization model coupled to said recognizer for recognizing incoming speech;
said model derived by a method of source normalization training for HMM modeling comprising the steps of;
a) providing an initial speech recognition model andb) performing on said initial speech recognition model the following steps to get a new speech recognition model;
b1) estimation of intermediate quantities;
b2) performing re-estimation to determine probabilities;
b3) deriving mean vector and bias vector; and
b4) solving jointly for mean vector and bias vector.
1 Assignment
0 Petitions
Accused Products
Abstract
A maximum likelihood (ML) linear regression (LR) solution to environment normalization is provided where the environment is modeled as a hidden (non-observable) variable. By application of an expectation maximization algorithm and extension of Baum-Welch forward and backward variables (Steps 23a–23d) a source normalization is achieved such that it is not necessary to label a database in terms of environment such as speaker identity, channel, microphone and noise type.
56 Citations
14 Claims
-
1. An improved speech recognition system comprising:
-
a speech recognizer; and a source normalization model coupled to said recognizer for recognizing incoming speech;
said model derived by a method of source normalization training for HMM modeling comprising the steps of;a) providing an initial speech recognition model and b) performing on said initial speech recognition model the following steps to get a new speech recognition model; b1) estimation of intermediate quantities; b2) performing re-estimation to determine probabilities; b3) deriving mean vector and bias vector; and b4) solving jointly for mean vector and bias vector. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of source normalization for modeling of speech comprising the steps of:
-
a) providing an initial speech recognition model and b) performing on said initial speech recognition model the following steps to get a new speech recognition model; b1) estimation of intermediate quantities; b2) performing re-estimation to determine probabilities; b3) deriving mean vector and bias vector; and b4) solving jointly for mean vector and bias vector. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification