Speech recognition semantic classification training
First Claim
1. A method for developing an automated speech input semantic classification system, the method comprising:
- defining a set of semantic classifications for classification of input speech utterances, each semantic classification representing a specific semantic classification of the speech input;
training the semantic classification system from training data having little or no in-domain manually transcribed training data;
operating the semantic classification system to assign input speech utterances to the defined semantic classifications;
collecting adaptation training data based on input speech utterances with manually assigned semantic labels; and
when the adaptation training data satisfies a pre-determined adaptation criteria, automatically retraining the semantic classification system based on the adaptation training data.
2 Assignments
0 Petitions
Accused Products
Abstract
An automated method is described for developing an automated speech input semantic classification system such as a call routing system. A set of semantic classifications is defined for classification of input speech utterances, where each semantic classification represents a specific semantic classification of the speech input. The semantic classification system is trained from training data having little or no in-domain manually transcribed training data, and then operated to assign input speech utterances to the defined semantic classifications. Adaptation training data based on input speech utterances is collected with manually assigned semantic labels. When the adaptation training data satisfies a pre-determined adaptation criteria, the semantic classification system is automatically retrained based on the adaptation training data.
217 Citations
18 Claims
-
1. A method for developing an automated speech input semantic classification system, the method comprising:
-
defining a set of semantic classifications for classification of input speech utterances, each semantic classification representing a specific semantic classification of the speech input; training the semantic classification system from training data having little or no in-domain manually transcribed training data; operating the semantic classification system to assign input speech utterances to the defined semantic classifications; collecting adaptation training data based on input speech utterances with manually assigned semantic labels; and when the adaptation training data satisfies a pre-determined adaptation criteria, automatically retraining the semantic classification system based on the adaptation training data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of automated training for a language model in a semantic classification system, the method comprising:
-
performing first-pass recognition of available training data using an initial language model to develop first pass recognition results corresponding to the available training data; dividing the first pass recognition results into a plurality of first pass subsets; for each first pass subset, developing a subset language model from the first pass subset recognition results; performing second-pass recognition of the first pass subsets using cross-adapted subset language models to develop second pass recognition results corresponding to the available training data; and computing a final adapted language model from the second pass recognition results. - View Dependent Claims (10, 11, 12)
-
-
13. A method of automated training for a confidence engine in a semantic classification system, the method comprising:
-
operating an under-trained semantic classification system on a set of input speech utterances with manually assigned semantic labels to extract a set of corresponding feature data; and training the confidence engine on the extracted feature data.
-
-
14. A method of automated training for a confidence engine in a semantic classification system, the method comprising:
-
operating a semantic classification system on a set of input speech utterances with manually assigned semantic labels to extract a set of corresponding feature data; and training on the extracted feature data the confidence engine for a different semantic classification application.
-
-
15. A method of automated training for a router confidence engine in a call routing system, the method comprising:
-
defining a set of call routes for processing input speech utterances in a call routing system, each call route representing a specific call processing path; performing initial router confidence training based on an initial set of training data; operating the call routing system to assign input speech utterances to the defined call routes; collecting adaptation training data based on processed input speech utterances; and re-performing router confidence training based on the adaptation training data.
-
-
16. A method of training for an automated call routing system for improving system accuracy at early deployment stages, the method comprising:
-
defining a set of call routes for processing input speech utterances in a call routing system, each call route representing a specific call processing path; performing a first training of the call routing system from training data having little or no in-domain manually transcribed training data and a large amount of un-transcribed in-domain data; operating the call routing system to assign input speech utterances to the defined call routes; collecting adaptation training data based on manually processed input speech utterances; and automatically retraining the call routing system based on the adaptation training data to improve system accuracy.
-
-
17. A method of training for an automated call routing system for improving system accuracy at early deployment stages, the method comprising:
-
defining a set of call routes for processing input speech utterances in a call routing system, each call route representing a specific call processing path; training the call routing system based on a generic confidence engine; operating the call routing system to assign input speech utterances to the defined call routes; collecting adaptation training data based on manually processed input speech utterances; and automatically retraining the call routing system based on the adaptation training data to improve system accuracy.
-
-
18. A method of training for an automated call routing system for improving system accuracy at early deployment stages, the method comprising:
-
defining a set of call routes for processing input speech utterances in a call routing system, each call route representing a specific call processing path; training the call routing system based on a generic confidence engine using little or no in-domain manually transcribed training data and a large amount of un-transcribed in-domain data; operating the call routing system to assign input speech utterances to the defined call routes; collecting adaptation training data based on manually processed input speech utterances; and automatically retraining the call routing system based on the adaptation training data to improve system accuracy.
-
Specification