REDUCING FALSE POSITIVES IN SPEECH RECOGNITION SYSTEMS
First Claim
1. A method comprising:
- receiving a spoken utterance;
processing the spoken utterance in a speech recognizer to generate a recognition result;
determining consistencies of one or more parameters of component sounds of the spoken utterance, wherein the parameters are selected from the group consisting of duration, energy, and pitch, and wherein each component sound of the spoken utterance has a corresponding value of said parameter; and
validating the recognition result based on the consistency of at least one of said parameters.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present invention improve methods of performing speech recognition. In one embodiment, the present invention includes a method comprising receiving a spoken utterance, processing the spoken utterance in a speech recognizer to generate a recognition result, determining consistencies of one or more parameters of component sounds of the spoken utterance, wherein the parameters are selected from the group consisting of duration, energy, and pitch, and wherein each component sound of the spoken utterance has a corresponding value of said parameter, and validating the recognition result based on the consistency of at least one of said parameters.
25 Citations
23 Claims
-
1. A method comprising:
-
receiving a spoken utterance; processing the spoken utterance in a speech recognizer to generate a recognition result; determining consistencies of one or more parameters of component sounds of the spoken utterance, wherein the parameters are selected from the group consisting of duration, energy, and pitch, and wherein each component sound of the spoken utterance has a corresponding value of said parameter; and validating the recognition result based on the consistency of at least one of said parameters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system comprising:
-
a processor; and a memory, wherein the processor is configured to; receive a spoken utterance; process the spoken utterance in a speech recognizer to generate a recognition result; determine consistencies of one or more parameters of component sounds of the spoken utterance, wherein the parameters are selected from the group consisting of duration, energy, and pitch, and wherein each component sound of the spoken utterance has a corresponding value of said parameter; and validate the recognition result based on the consistency of at least one of said parameters.
-
Specification