Transformation and combination of hidden Markov models for speaker selection training
First Claim
Patent Images
1. A method of transforming and combining a plurality of models representing training speakers into a model for a test speaker, comprising:
- selecting a set of cohort speakers from the training speakers;
transforming a plurality of models representing the cohort speakers based on speech data from the test speaker; and
combining the plurality of transformed models to form the model for the test speaker.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention is directed to a 3-stage adaptation framework based on speaker selection training. First a subset of cohort speakers is selected for a test speaker. Then cohort models are transformed to be closer to the test speaker. Finally the adapted model for the test speaker is obtained by combining these transformed cohort models. Combination weights as well as bias items can be adaptively learned from adaptation data.
22 Citations
19 Claims
-
1. A method of transforming and combining a plurality of models representing training speakers into a model for a test speaker, comprising:
-
selecting a set of cohort speakers from the training speakers;
transforming a plurality of models representing the cohort speakers based on speech data from the test speaker; and
combining the plurality of transformed models to form the model for the test speaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for transforming and combining a plurality of models representing training speakers into a model for a test speaker, comprising:
-
an adaptation component configured to receive a speech input that is generated from a plurality of training speakers;
a Gaussian Mixture Model (GMM) generating component configured to generate Gaussian Mixture Models for each of the training speakers;
a speaker selection component configured to select a cohort of speakers based on a relative probability of each of the training speakers of matching the test speaker;
a model transformation component transforms the models for the cohort speakers to more closely match the test speaker; and
a model combination component configured to combine the transformed models of the cohort speakers to more closely match that of the test speaker. - View Dependent Claims (16, 17, 18, 19)
-
Specification