Method and apparatus for training a text independent speaker recognition system using speech data with text labels
First Claim
1. An apparatus for providing a Text Independent (TI) speaker recognition mode in one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and a Text Constrained (TC) HMM speaker recognition system, comprising:
- a Gaussian Mixture Model (GMM) generator for creating a GMM by pooling Gaussians from a plurality of HMM states; and
a Gaussian weight normalizer for normalizing Gaussian weights with respect to the plurality of HMM states.
3 Assignments
0 Petitions
Accused Products
Abstract
There is provided an apparatus for providing a Text Independent (TI) speaker recognition mode in a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and/or a Text Constrained (TC) HMM speaker recognition system. The apparatus includes a Gaussian Mixture Model (GMM) generator and a Gaussian weight normalizer. The GMM generator is for creating a GMM by pooling Gaussians from a plurality of HMM states. The Gaussian weight normalizer is for normalizing Gaussian weights with respect to the plurality of HMM states.
43 Citations
27 Claims
-
1. An apparatus for providing a Text Independent (TI) speaker recognition mode in one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and a Text Constrained (TC) HMM speaker recognition system, comprising:
-
a Gaussian Mixture Model (GMM) generator for creating a GMM by pooling Gaussians from a plurality of HMM states; and
a Gaussian weight normalizer for normalizing Gaussian weights with respect to the plurality of HMM states. - View Dependent Claims (2, 3, 4)
-
-
5. An apparatus for providing one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition mode and a Text Constrained (TC) HMM speaker recognition mode in a Text Independent (TI) Gaussian Mixture Model (GMM) speaker recognition system, comprising:
-
an HMM generator for creating an HMM by assigning states to Gaussians from a GMM; and
a probability and weight calculator for calculating state transition probabilities and Gaussian weights with respect to a plurality of HMM states. - View Dependent Claims (6, 7, 8)
-
-
9. An apparatus for providing one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition mode and a Text Constrained (TC) HMM speaker recognition mode in another one of a TD HMM speaker recognition system and a TC HMM speaker recognition system, comprising:
-
an HMM generator for creating an HMM with one of a smaller number of states and a larger number of states by one of pooling Gaussians from a plurality of HMM states into a single HMM state and splitting the Gaussians from the plurality of HMM states into different HMM states, respectively; and
a Gaussian weight normalizer for normalizing Gaussian weights with respect to the HMM states. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method for providing a Text Independent (TI) speaker recognition mode in one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition system and a Text Constrained (TC) HMM speaker recognition system, comprising the steps of:
-
creating a Gaussian Mixture Model (GMM) by pooling Gaussians from a plurality of HMM states; and
normalizing Gaussian weights with respect to the plurality of HMM states. - View Dependent Claims (15, 16, 17)
-
-
18. A method for providing one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition mode and a Text Constrained (TC) HMM speaker recognition mode in a Text Independent (TI) Gaussian Mixture Model (GMM) speaker recognition system, comprising the steps of:
-
creating an HMM by assigning states to Gaussians from a GMM; and
calculating state transition probabilities and Gaussian weights with respect to a plurality of HMM states. - View Dependent Claims (19, 20, 21)
-
-
22. A method for providing one of a Text Dependent (TD) Hidden Markov Model (HMM) speaker recognition mode and a Text Constrained (TC) HMM speaker recognition mode in another one of a TD HMM speaker recognition system and a TC HMM speaker recognition system, comprising the steps of:
-
creating an HMM with one of a smaller number of states and a larger number of states by one of pooling Gaussians from a plurality of HMM states into a single HMM state and splitting the Gaussians from the plurality of HMM states into different HMM states, respectively; and
normalizing Gaussian weights with respect to the HMM states. - View Dependent Claims (23, 24, 25, 26, 27)
-
Specification