System and method for personalization of acoustic models for automatic speech recognition
First Claim
1. A method comprising:
- starting a current automatic speech recognition session for recognizing speech received from a user via a device;
identifying, via a processor, a group of speech recognition models comprising a speaker independent model and a speaker dependent model;
recognizing the speech via each model in the group of speech recognition models, to yield recognition results;
selecting, based on the recognition results, a dominant speech model from the group of speech recognition models to yield a remainder set of dropped speech recognition models; and
continuously using only the dominant speech model, without applying the remainder set of dropped speech recognition models, to recognize additional speech received from the user.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are methods, systems, and computer-readable storage media for automatic speech recognition. The method includes selecting a speaker independent model, and selecting a quantity of speaker dependent models, the quantity of speaker dependent models being based on available computing resources, the selected models including the speaker independent model and the quantity of speaker dependent models. The method also includes recognizing an utterance using each of the selected models in parallel, and selecting a dominant speech model from the selected models based on recognition accuracy using the group of selected models. The system includes a processor and modules configured to control the processor to perform the method. The computer-readable storage medium includes instructions for causing a computing device to perform the steps of the method.
-
Citations
20 Claims
-
1. A method comprising:
-
starting a current automatic speech recognition session for recognizing speech received from a user via a device; identifying, via a processor, a group of speech recognition models comprising a speaker independent model and a speaker dependent model; recognizing the speech via each model in the group of speech recognition models, to yield recognition results; selecting, based on the recognition results, a dominant speech model from the group of speech recognition models to yield a remainder set of dropped speech recognition models; and continuously using only the dominant speech model, without applying the remainder set of dropped speech recognition models, to recognize additional speech received from the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; starting a current automatic speech recognition session for recognizing speech received from a user via a device; identifying a group of speech recognition models comprising a speaker independent model and a speaker dependent model; recognizing the speech via each model in the group of speech recognition models, to yield recognition results; selecting, based on the recognition results, a dominant speech model from the group of speech recognition models to yield a remainder set of dropped speech recognition models; and continuously using only the dominant speech model, without applying the remainder set of dropped speech recognition models, to recognize additional speech received from the user. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer-readable storage device having instructions stored which, when executed by a computing device, result in the computing device performing operations comprising:
-
starting a current automatic speech recognition session for recognizing speech received from a user via a device; identifying a group of speech recognition models comprising a speaker independent model and a speaker dependent model; recognizing the speech via each model in the group of speech recognition models, to yield recognition results; selecting, based on the recognition results, a dominant speech model from the group of speech recognition models to yield a remainder set of dropped speech recognition models; and continuously using only the dominant speech model, without applying the remainder set of dropped speech recognition models, to recognize additional speech received from the user.
-
Specification