Voice Recognition Accuracy in High Noise Conditions
First Claim
Patent Images
1. A method of detecting a human utterance comprising:
- receiving an audio signal containing noise;
determining a noise energy level and a speech energy level in the audio signal;
modifying a prior speech energy level threshold based at least in part on the determined noise energy level and speech energy level to generate a modified speech energy level threshold;
comparing the determined speech energy level to the modified speech energy level threshold; and
producing a presence signal indicating the presence of speech in the audio signal when the determined speech energy level exceeds the modified speech energy level threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for voice recognition determine energy levels for speech and noise and generate adaptive thresholds based on the determined energy levels. The adaptive thresholds are applied to determine the presence of speech and to generate noise-dependent triggers for indicating the presence of speech during high-noise conditions. In an embodiment, the signal energy is averaged in the presence of speech and in the presence of background noise. Audio energy calculations may be made by averaging via a sliding window or via a memory filter.
56 Citations
20 Claims
-
1. A method of detecting a human utterance comprising:
-
receiving an audio signal containing noise; determining a noise energy level and a speech energy level in the audio signal; modifying a prior speech energy level threshold based at least in part on the determined noise energy level and speech energy level to generate a modified speech energy level threshold; comparing the determined speech energy level to the modified speech energy level threshold; and producing a presence signal indicating the presence of speech in the audio signal when the determined speech energy level exceeds the modified speech energy level threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A portable electronic device comprising:
-
an audio input receiver; a user interface output; and a processor configured to receive an audio signal containing noise at the audio input receiver, determine a noise energy level and a speech energy level of the audio signal, modify a speech energy to generate a modified speech energy level threshold level threshold based on the determined noise energy level and speech energy level, compare the determined speech energy level to the modified speech energy level threshold, and produce a presence signal indicating the presence of speech in the audio signal when the determined speech energy level exceeds the modified speech energy level threshold. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A method of detecting human speech comprising:
-
setting a speech energy threshold to identify a speech energy level at which human speech is said to be present; receiving an audio signal and determining a noise energy level and a speech energy level in the audio signal; modifying the speech energy level threshold based on the noise energy level and speech energy level to generate a modified speech energy level threshold; and comparing the speech energy level to the modified speech energy level threshold to detect the presence of speech in the audio signal.
-
Specification