Method and system of adapting speech recognition models to speaker environment
First Claim
1. A method of recognizing speech comprising the steps of:
- receiving a spoken password utterance for access to a speaker environment;
getting a set of speaker independent(SI) speech recognition models;
determining a mapping sequence between the SI speech recognition models and speech input frames in the spoken password utterance that comprise recognition of the utterance;
determining a transform between the SI speech recognition models and the spoken password utterance using the mapping sequence;
generating speaker adapted (SA) speech recognition models by applying the transform to SI speech recognition models; and
recognizing a nonpassword speech utterance in said speaker environment by applying the SA speech recognition models.
1 Assignment
0 Petitions
Accused Products
Abstract
The method and system of adapting speech recognition models to a speaker environment may comprise receiving a spoken password (52) and getting a set of speaker independent (SI) speech recognition models (54). A mapping sequence may be determined for the spoken password (56). Using the mapping sequence, a speaker ID may be identified (58). A transform may be determined (66) between the SI speech recognition models and the spoken password using the mapping sequence. Speaker adapted (SA) speech recognition models may be generated (68) by applying the transform to SI speech recognition models. A speech input may be recognized (70) by applying the SA speech recognition models.
-
Citations
23 Claims
-
1. A method of recognizing speech comprising the steps of:
-
receiving a spoken password utterance for access to a speaker environment; getting a set of speaker independent(SI) speech recognition models; determining a mapping sequence between the SI speech recognition models and speech input frames in the spoken password utterance that comprise recognition of the utterance; determining a transform between the SI speech recognition models and the spoken password utterance using the mapping sequence; generating speaker adapted (SA) speech recognition models by applying the transform to SI speech recognition models; and recognizing a nonpassword speech utterance in said speaker environment by applying the SA speech recognition models. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of recognizing speech, comprising the steps of:
-
receiving a spoken password utterance for access to a speaker environment; getting a set of speaker independent (SI) speech recognition models; determining a mapping sequence between the SI speech recognition models and speech input frames for the spoken password utterance identifying a speaker ID from the mapping sequence between the SI speech recognition models and the spoken password utterance; determining a transform between the SI speech recognition models and the spoken password utterance using the mapping sequence; generating speaker adapted (SA) speech recognition models by applying the transform to SI speech recognition models; and recognizing a nonpassword speech utterance in said speaker environment by applying the SA speech recognition models to the nonpassword speech utterance. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A speech recognition system, comprising:
-
a recognition engine having an identification module and an adaption module; a database having a set of speaker independent (SI) speech recognition models; the identification module operable to receive a spoken password utterance, determine a mapping sequence of the spoken password utterance in a speaker environment to SI speech recognition models, and identify the speaker from the mapping sequence; the adaption module operable to determine a transform between the SI speech recognition models and the spoken password utterance using the mapping sequence and to generate a speaker adapted (SA) speech recognition model by applying the transform to SI speech recognition models; and the recognition engine operable to recognize a nonpassword speech utterance in said speaker environment by applying the SA speech recognition model. - View Dependent Claims (20)
-
-
21. A speech recognition system, comprising:
-
a recognition engine having an identification module and an adaption module; a database having a set of speaker independent (SI) speech recognition models; the identification module operable to receive a spoken password utterance determine a mapping sequence of the spoken password utterance in a speaker environment to SI speech recognition models, and identify the speaker from the mapping sequence; the adaption module operable to determine a transform between the SI speech recognition models and the spoken password utterance using the mapping sequence and to generate a speaker adapted (SA) speech recognition model by applying the transform to SI speech recognition models; and the recognition engine operable to recognize a nonpassword speech utterance in said speaker environment by applying the SA speech recognition model to the nonpassword speech utterance. - View Dependent Claims (22)
-
-
23. A method of recognizing speech comprising the steps of:
-
receiving a spoken keyword utterance in a speaker environment; getting a set of speaker independent (SI) speech recognition models; determining a mapping sequence between the SI speech recognition models and the speech input frames in the spoken keyword utterance; determining a transform between the SI speech recognition models and the spoken keyword utterance using the mapping sequence; generating speaker adapted (SA) speech recognition models by applying the transform to SI speech recognition models; and recognizing a nonkeyword speech utterance in said speaker environment by applying the SA speech recognition models.
-
Specification