Dynamic speech recognition data evaluation
First Claim
1. A method of dynamically providing speech recognition data from a client computing device to a server computing device, the method comprising:
- receiving audio input at the client computing device;
processing the audio input to generate the speech recognition data;
determining a first estimated confidence level for a first identified portion of the speech recognition data comprising a first feature vector, wherein the first estimated confidence level exceeds a predetermined confidence threshold that corresponds to a valid result;
based on determining that the first estimated confidence level corresponds to the valid result, continuing to process the speech recognition data with the first identified portion;
determining a second estimated confidence level for a second identified portion of the speech recognition data comprising a second feature vector, wherein the second estimated confidence level also exceeds the predetermined confidence threshold that corresponds to the valid result;
identifying at least one statistically improbable characteristic associated with the second feature vector;
determining that the client computing device comprises a first feature extractor;
comparing the first feature extractor of the client computing device with a second feature extractor utilized by the server computing device;
based on comparing the first feature extractor of the client computing device with the second feature extractor utilized by the server computing device, determining that the second feature extractor of the server computing device is different from the first feature extractor;
based on (1) determining the second estimated confidence level corresponds to the valid result, (2) identifying the at least one statistically improbable characteristic, and (3) determining that the server computing device comprises the second feature extractor different from the first feature extractor, providing the second feature vector to the server computing device for an evaluation of the second feature vector by the second, different feature extractor.
1 Assignment
0 Petitions
Accused Products
Abstract
Computing devices and methods for providing speech recognition data from one computing device to another device are disclosed. In one disclosed embodiment, audio input is received at a client device and processed to generate speech recognition data. An estimated confidence level is determined for a portion of the data, where the confidence level exceeds a predetermined confidence threshold corresponding to a valid result. At least one statistically improbable characteristic associated with the portion of data is identified. Based on identifying the statistically improbable characteristic, the portion of data is provided to a server computing device for evaluation.
38 Citations
16 Claims
-
1. A method of dynamically providing speech recognition data from a client computing device to a server computing device, the method comprising:
-
receiving audio input at the client computing device; processing the audio input to generate the speech recognition data; determining a first estimated confidence level for a first identified portion of the speech recognition data comprising a first feature vector, wherein the first estimated confidence level exceeds a predetermined confidence threshold that corresponds to a valid result; based on determining that the first estimated confidence level corresponds to the valid result, continuing to process the speech recognition data with the first identified portion; determining a second estimated confidence level for a second identified portion of the speech recognition data comprising a second feature vector, wherein the second estimated confidence level also exceeds the predetermined confidence threshold that corresponds to the valid result; identifying at least one statistically improbable characteristic associated with the second feature vector; determining that the client computing device comprises a first feature extractor; comparing the first feature extractor of the client computing device with a second feature extractor utilized by the server computing device; based on comparing the first feature extractor of the client computing device with the second feature extractor utilized by the server computing device, determining that the second feature extractor of the server computing device is different from the first feature extractor; based on (1) determining the second estimated confidence level corresponds to the valid result, (2) identifying the at least one statistically improbable characteristic, and (3) determining that the server computing device comprises the second feature extractor different from the first feature extractor, providing the second feature vector to the server computing device for an evaluation of the second feature vector by the second, different feature extractor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computing device, comprising:
-
a processor; a mass storage device; and a speech recognition program stored in the mass storage device, the speech recognition program comprising instructions executable by the processor to; receive audio input; process the audio input to generate speech recognition data; determine a first estimated confidence level for a first identified portion of the speech recognition data comprising a first feature vector, wherein the first estimated confidence level exceeds a predetermined confidence threshold that corresponds to a valid result; based on determining that the first estimated confidence level corresponds to the valid result, continue to process the speech recognition data with the first identified portion; determine a second estimated confidence level for a second identified portion of the speech recognition data comprising a second feature vector, wherein the second estimated confidence level also exceeds the predetermined confidence threshold that corresponds to the valid result; identify at least one statistically improbable characteristic associated with the second feature vector; determine that the client computing device comprises a first feature extractor; compare the first feature extractor of the computing device with a second feature extractor utilized by a different computing device; based on comparing the first feature extractor of the computing device with the second feature extractor utilized by the different computing device, determine that the second feature extractor of the different computing device is different from the first feature extractor; based on (1) determining the second estimated confidence level corresponds to the valid result, (2) identifying the at least one statistically improbable characteristic, and (3) determining that the different computing device comprises the second feature extractor different from the first feature extractor, provide the second feature vector to the different computing device for an evaluation of the second feature vector by the second, different feature extractor. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computing device, comprising:
-
a processor; a mass storage device; and a speech recognition program stored in the mass storage device, the speech recognition program comprising instructions executable by the processor to; receive audio input; process the audio input to generate speech recognition data, wherein the speech recognition data comprises one or more of audio data, feature vectors, speech components, and recognized text; determine a first estimated confidence level for a first identified portion of the speech recognition data comprising a first feature vector, wherein the first estimated confidence level exceeds a predetermined confidence threshold that corresponds to a valid result; based on determining that the first estimated confidence level corresponds to the valid result, continue to process the speech recognition data with the first identified portion; determine a second estimated confidence level for a second identified portion of the speech recognition data comprising a second feature vector, wherein the second estimated confidence level also exceeds the predetermined confidence threshold that corresponds to the valid result; use one or more machine learning techniques to identify at least one statistically improbable characteristic associated with the second feature vector determine that the computing device comprises a first feature extractor; compare the first feature extractor of the computing device with a second feature extractor utilized by a different computing device; based on comparing the first feature extractor of the computing device with the second feature extractor utilized by the different computing device, determine that the second feature extractor of the different computing device is different from the first feature extractor; based on (1) determining the second estimated confidence level corresponds to the valid result, (2) identifying the at least one statistically improbable characteristic, and (3) determining that the different computing device comprises the second feature extractor different from the first feature extractor, provide the second feature vector to the different computing device for an evaluation of the second feature vector by the second, different feature extractor; receive from the different computing device weighting information derived from the evaluation of the second feature vector; and use the weighting information to bias a speech recognition engine of the speech recognition program. - View Dependent Claims (16)
-
Specification