Method and apparatus for training a text independent speaker recognition system using speech data with text labels
First Claim
Patent Images
1. A method, comprising the steps of:
- providing a Text Independent (TI) speaker recognition mode in one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and a Text Constrained (TC) HMM speaker recognition system,wherein said providing step comprises;
creating a Gaussian Mixture Model (GMM) by pooling Gaussians from a plurality of HMM states; and
normalizing Gaussian weights with respect to the plurality of HMM states.
3 Assignments
0 Petitions
Accused Products
Abstract
There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.
-
Citations
14 Claims
-
1. A method, comprising the steps of:
-
providing a Text Independent (TI) speaker recognition mode in one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and a Text Constrained (TC) HMM speaker recognition system, wherein said providing step comprises; creating a Gaussian Mixture Model (GMM) by pooling Gaussians from a plurality of HMM states; and normalizing Gaussian weights with respect to the plurality of HMM states. - View Dependent Claims (2, 3, 4)
-
-
5. A method, comprising the steps of:
-
providing one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition mode and a Text Constrained (TC) HMM speaker recognition mode in a Text Independent (TI) Gaussian Mixture Model (GMM) speaker recognition system, wherein said providing step comprises; creating an HMM by assigning states to Gaussians from a GMM; and calculating state transition probabilities and Gaussian weights with respect to a plurality of HMM states. - View Dependent Claims (6, 7, 8)
-
-
9. A method, comprising the steps of:
-
providing one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition mode and a Text Constrained (TC) HMM speaker recognition mode in another one of a TD HMM speaker recognition system and a TC HMM speaker recognition system, wherein said providing step comprises; creating an HMM with one of a smaller number of states and a larger number of states by one of pooling Gaussians from a plurality of HMM states into a single HMM state and splitting the Gaussians from the plurality of HMM states into different HMM states, respectively; and normalizing Gaussian weights with respect to the HMM states. - View Dependent Claims (10, 11, 12, 13, 14)
-
Specification