System and method for targeted tuning module of a speech recognition system
First Claim
1. A method of tuning a speech system comprising:
- accessing, from a database, information representing a plurality of utterances for at least one speech-enabled application, the plurality of utterances comprising at least a first type of utterance and a second type of utterance;
accessing, from the database, interpretive information representing an assigned interpretation for at least a portion of the plurality of utterances;
determining, by a training tool subsystem, an appropriate interpretation for the portion of the plurality of utterances;
comparing, by the training tool subsystem, the assigned interpretation for the portion of the plurality of utterances to the appropriate interpretation for the portion of the plurality of utterances;
determining, by the training tool subsystem, a frequency value for the second type of utterance that represents the percentage of occurrence of the second type of utterance in the plurality of utterances;
determining, by the training tool subsystem, that the speech-enabled application more accurately responds to the first type of utterance; and
electing, by the training tool subsystem, to apply a targeted tuning to the speech-enabled application to improve recognition of the second type of utterance when the frequency value for the second type of utterance is greater than a frequency threshold value.
16 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for targeted tuning of a speech recognition system. A method incorporating teachings of the present disclosure may include deploying a speech recognition module to apply an appropriate interpretation to a plurality of utterance types. The method may also include accessing information representing a collection of recorded utterances and assigned interpretation for each of the plurality of recorded utterances. The assigned interpretation for each of the plurality of recorded utterances may then be compared to an accurate interpretation for each of the plurality of utterance, and a separate accuracy value may be determined for each of the plurality of utterance types. With some implementations, if the separate accuracy value for a given type of utterance is too low, a selection of utterances having the given type may be used to tune the speech recognition module.
-
Citations
26 Claims
-
1. A method of tuning a speech system comprising:
-
accessing, from a database, information representing a plurality of utterances for at least one speech-enabled application, the plurality of utterances comprising at least a first type of utterance and a second type of utterance; accessing, from the database, interpretive information representing an assigned interpretation for at least a portion of the plurality of utterances; determining, by a training tool subsystem, an appropriate interpretation for the portion of the plurality of utterances; comparing, by the training tool subsystem, the assigned interpretation for the portion of the plurality of utterances to the appropriate interpretation for the portion of the plurality of utterances; determining, by the training tool subsystem, a frequency value for the second type of utterance that represents the percentage of occurrence of the second type of utterance in the plurality of utterances; determining, by the training tool subsystem, that the speech-enabled application more accurately responds to the first type of utterance; and electing, by the training tool subsystem, to apply a targeted tuning to the speech-enabled application to improve recognition of the second type of utterance when the frequency value for the second type of utterance is greater than a frequency threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A speech tuning system, comprising:
-
a repository comprising a memory to store a sample of captured utterances from an implemented speech-enabled application and an assigned utterance type for each of the captured utterances; an accuracy engine communicatively coupled to the repository and operable to determine if an assigned utterance type for a given captured utterance represents an accurate interpretation of the given captured utterance; a targeting engine communicatively coupled to the accuracy engine and operable to determine a first accuracy level of the speech-enabled application in identifying a first type of utterance and a second accuracy level of the speech-enabled application in identifying a second type of utterance; and a tuning engine operable to feed the speech-enabled application with a collection of utterances having the first type when the first accuracy level is lower than the second accuracy level and when a frequency of occurrence of the first type of utterance in the sample of captured utterances is greater than a frequency threshold value. - View Dependent Claims (18, 19, 20, 21, 22)
-
-
23. A method of tuning a speech-enabled application comprising:
-
deploying a speech-recognition module to apply an appropriate interpretation to a plurality of utterance types; accessing, from a database, information representing a collection of recorded utterances and assigned interpretation for each of the plurality of recorded utterances; comparing, by an accuracy engine, the assigned interpretation for each of the plurality of recorded utterances to an accurate interpretation for each of the plurality of utterances; determining, by the accuracy engine, a separate accuracy value for each of the plurality of utterance types; and feeding the speech-recognition module with a selection of utterances having a given type when the separate accuracy value for the given type is lower than an accuracy threshold value and when a frequency of occurrence of the given type of utterance in the plurality of recorded utterances is greater than a frequency threshold value. - View Dependent Claims (24, 25, 26)
-
Specification