SYSTEM AND METHOD FOR RAPID CUSTOMIZATION OF SPEECH RECOGNITION MODELS
First Claim
1. A method of generating a domain-specific speech recognition model, the method comprising:
- identifying a speech recognition domain;
combining a plurality of speech recognition models to yield a combined speech recognition model, each speech recognition model of the plurality of speech recognition models being from a respective speech recognition domain;
receiving an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model; and
tuning the combined speech recognition model for the speech recognition domain based on the amount of data.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.
121 Citations
20 Claims
-
1. A method of generating a domain-specific speech recognition model, the method comprising:
-
identifying a speech recognition domain; combining a plurality of speech recognition models to yield a combined speech recognition model, each speech recognition model of the plurality of speech recognition models being from a respective speech recognition domain; receiving an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model; and tuning the combined speech recognition model for the speech recognition domain based on the amount of data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for recognizing speech, the system comprising:
-
a processor; a first module configured to control the processor to identify a speech recognition domain; a second module configured to control the processor to combine a plurality of speech recognition models to yield a combined speech recognition model, each speech recognition model of the plurality of speech recognition models being from a respective speech recognition domain; a third module configured to control the processor to receive an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model; a fourth module configured to control the processor to tune the combined speech recognition model for the speech recognition domain based on the amount of data; and a fifth module configured to control the processor to recognize speech using the combined speech recognition model. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to generate a speech recognition model for a specific recognition domain, the instructions comprising:
-
combining a plurality of speech recognition models to yield a combined speech recognition model, each speech recognition model of the plurality of speech recognition models being from a respective speech recognition domain; receiving an amount of data specific to a speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model; and tuning the combined speech recognition model for the speech recognition domain based on the amount of data. - View Dependent Claims (17, 18, 19, 20)
-
Specification