Automated learning for speech-based applications
First Claim
1. A method comprising:
- receiving, by a computer-based speech recognition system, a speech input and a first task associated with the speech input, the first task being determined by processing the speech input using an original set of internal representations for the computer-based speech recognition system, the original set of internal representations comprising one or more parameters for recognizing speech in the speech input, the computer-based speech recognition system comprising at least one processor and at least one memory device;
comparing the first task with a second task associated with the speech input, the second task being identified by a human in response to hearing the speech input;
based at least in part on the comparison, modifying the original set of internal representations to create a modified set of internal representations;
processing, by the at least one processor of the computer-based speech recognition system and based at least partly on the modified set of internal representations, the speech input to identify a third task for the speech input;
comparing the third task to the second task to determine that the third task is within an acceptable margin of error to the second task;
in response to determining that the third task is within an acceptable margin of error to the second task, replacing the original set of internal representations of the computer-based speech recognition system with the modified set of internal representations;
receiving, by the computer-based speech recognition system, another speech input; and
processing, by the at least one processor and based at least partly on the modified set of internal representations, the other speech input to identify a particular task for the other speech input.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for modifying a computer-based speech recognition system. A speech utterance is processed with the computer-based speech recognition system using a set of internal representations, which may comprise parameters for recognizing speech in a speech utterance, such as parameters of an acoustic model and/or a language model. The computer-based speech recognition system may perform a first task in response to the processed speech utterance. The utterance may also be provided to a human who performs a second task based on the utterance. Data indicative of the first task, performed by the computer system, is compared to data indicative of a second task, performed by the human in response to the speech utterance. Based on the comparison, the set of internal representations may be updated or modified to improve the speech recognition performance and capabilities of the speech recognition system.
-
Citations
19 Claims
-
1. A method comprising:
-
receiving, by a computer-based speech recognition system, a speech input and a first task associated with the speech input, the first task being determined by processing the speech input using an original set of internal representations for the computer-based speech recognition system, the original set of internal representations comprising one or more parameters for recognizing speech in the speech input, the computer-based speech recognition system comprising at least one processor and at least one memory device; comparing the first task with a second task associated with the speech input, the second task being identified by a human in response to hearing the speech input; based at least in part on the comparison, modifying the original set of internal representations to create a modified set of internal representations; processing, by the at least one processor of the computer-based speech recognition system and based at least partly on the modified set of internal representations, the speech input to identify a third task for the speech input; comparing the third task to the second task to determine that the third task is within an acceptable margin of error to the second task; in response to determining that the third task is within an acceptable margin of error to the second task, replacing the original set of internal representations of the computer-based speech recognition system with the modified set of internal representations; receiving, by the computer-based speech recognition system, another speech input; and processing, by the at least one processor and based at least partly on the modified set of internal representations, the other speech input to identify a particular task for the other speech input. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
one or more processors; and memory, communicatively coupled to the one or more processors, containing instructions that, when executed, configure the one or more processors to perform operations comprising; receiving a speech utterance and a first task for the speech utterance, the first task being determined by processing the speech utterance using an original set of statistical representations stored in the memory of the system, the original set of statistical representations comprising one or more parameters for recognizing speech in the speech utterance; comparing the first task with a second task for the speech utterance, the second task being identified by a human in response to hearing the speech utterance; based at least in part on the comparison, updating the original set of statistical representations to create an updated set of statistical representations; processing, with the updated set of statistical representations, the speech input to identify a third task for the speech utterance; comparing the third task to the second task to determine that the third task is within an acceptable margin of error to the second task; in response to determining that the third task is within an acceptable margin of error, replacing the original set of statistical representations of the system with the updated set of statistical representations; receiving another speech utterance; and processing, by the one or more processors and based at least partly on the updated set of statistical representations, the other speech utterance to identify a particular task for the other speech utterance. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. One or more non-transitory computer-readable storage media storing instructions that, when executed by one or more processors, configure the processor to perform acts comprising:
-
identifying a speech input and a first task associated with the speech input, the first task being determined by processing the speech input using an original set of internal representations of a computer-based speech recognition system, the original set of internal representations comprising one or more parameters for recognizing speech in the speech input; comparing the first task with a second task associated with the speech input, the second task being identified by a human in response to hearing the speech input; based at least in part on the comparison, modifying the original set of internal representations to create a modified set of internal representations; processing, by the one or more processors and using the modified set of internal representations, the speech input to identify a third task for the speech input; comparing the third task to the second task to determine whether the third task is within an acceptable margin of error to the second task; in response to determining that the third task is within an acceptable margin of error, replacing the original set of internal representations of the computer-based speech recognition system with the modified set of internal representations; receiving another speech input; and processing, by the one or more processors and using the modified set of internal representations, the other speech input to identify a particular task for the other speech input. - View Dependent Claims (16, 17, 18, 19)
-
Specification