Method and apparatus for providing a speaker adapted speech recognition model set
First Claim
1. A method, comprising:
- providing a speaker independent speech recognition model set to be used when recognizing speech;
providing a speaker independent speech feature space model that is at least partially different from the speaker independent speech recognition model;
receiving speech from a particular speaker;
using the speech to provide a corresponding speaker dependent speech feature space model;
using the speaker independent speech feature space model and the speaker dependent speech feature space model to provide at least one resultant set of alignment indices;
using the at least one resultant set of alignment indices to modify the speaker independent speech recognition model to provide a speaker adapted speech recognition model set.
4 Assignments
0 Petitions
Accused Products
Abstract
Speech feature vectors (10) are provided and utilized to develop a corresponding estimated speaker dependent speech feature space model (20) (in one embodiment, it is not necessary that this model (20) have defined correlations with the verbal content of the represented speech itself). A model alignment unit (21) then contrasts this model (20) against the contents of a speaker independent speech feature space model (24) to provide alignment indices to a transformation estimation unit (23). In one embodiment, these alignment indices are based, as least in part, upon a measure of the differences between likelihoods of occurrence for the elements that comprise the constituency of these models. The transformation estimation unit (23) utilizes these alignment indices to provide transformation parameters to a model transformation unit (25) that uses such parameters to transform a speaker independent speech recognition model set (26) and yield a resultant speaker adapted speech recognition model set (27).
68 Citations
24 Claims
-
1. A method, comprising:
-
providing a speaker independent speech recognition model set to be used when recognizing speech;
providing a speaker independent speech feature space model that is at least partially different from the speaker independent speech recognition model;
receiving speech from a particular speaker;
using the speech to provide a corresponding speaker dependent speech feature space model;
using the speaker independent speech feature space model and the speaker dependent speech feature space model to provide at least one resultant set of alignment indices;
using the at least one resultant set of alignment indices to modify the speaker independent speech recognition model to provide a speaker adapted speech recognition model set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
providing a speaker independent speech recognition model set to be used when recognizing speech;
providing a speaker independent speech feature space model;
receiving speech from a particular speaker;
using the speech to provide a corresponding speaker dependent speech feature space model;
using the speaker independent speech feature space model and the speaker dependent speech feature space model to provide at least one resultant set of alignment indices as a function, at least in part, of a likelihood that a given speaker dependent speech feature space model will occur;
using the at least one resultant set of alignment indices to modify the speaker independent speech recognition model to provide a speaker adapted speech recognition model set. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. An apparatus comprising:
-
a speech feature vector input;
a speaker dependent speech feature space model estimation unit that is operably coupled to the speech feature vector input and having an output providing speaker dependent acoustic feature space model information;
speaker independent acoustic feature space model information;
a speech feature model alignment unit responsive to the speaker dependent acoustic feature space model information and the speaker independent acoustic feature space model information and having an output providing model alignment indices that correspond to differences between speaker dependent feature space models and speaker independent feature space models that correspond to one another as a function, at least in part, of a probability of occurrence of each such model;
a transformation estimation unit responsive to the model alignment indices and having an output providing model transformation parameters;
a model transformation unit responsive to the model transformation parameters and to a speaker independent speech recognition model set and having an output providing a speaker adapted speech recognition model set. - View Dependent Claims (23, 24)
-
Specification