Communication device for screening speech recognizer input
First Claim
1. A communication device capable of screening speech recognizer input, comprising:
- at least one microprocessor having a speech/noise classifier, wherein the at least one microprocessor analyzes a speech signal to determine speech waveform parameters within a speech acquisition window, wherein the at least one microprocessor compares speech waveform parameters to determine whether an error exists in the signal format of the speech signal, and wherein the at least one microprocessor provides error information when an error exists in the signal format of the speech signal;
a microphone for providing the speech signal to the at least one microprocessor; and
means, operatively connected to the at least one microprocessor, for communicating the error information from the at least one microprocessor.
4 Assignments
0 Petitions
Accused Products
Abstract
A communication device capable of screening speech recognizer input includes a microprocessor (110) connected to communication interface circuitry (115), memory (120), audio circuitry (130), an optional keypad (140), a display (150), and a vibrator/buzzer (160). Audio circuitry (130) is connected to microphone (133) and speaker (135). Microprocessor (110) includes a speech/noise classifier and speech recognition technology. Microprocessor (110) analyzes a speech signal to determine speech waveform parameters within a speech acquisition window. Microprocessor (110) compares the speech waveform parameters to determine whether an error exists in the signal format of the speech signal. Microprocessor (110) informs the user when an error exists in the signal format and instructs the user how to correct the signal format to eliminate the error.
75 Citations
34 Claims
-
1. A communication device capable of screening speech recognizer input, comprising:
-
at least one microprocessor having a speech/noise classifier, wherein the at least one microprocessor analyzes a speech signal to determine speech waveform parameters within a speech acquisition window, wherein the at least one microprocessor compares speech waveform parameters to determine whether an error exists in the signal format of the speech signal, and wherein the at least one microprocessor provides error information when an error exists in the signal format of the speech signal;
a microphone for providing the speech signal to the at least one microprocessor; and
means, operatively connected to the at least one microprocessor, for communicating the error information from the at least one microprocessor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
wherein the at least one microprocessor provides instructions for correcting the error, and the communication device comprises means for communicating the instructions from the at least one microprocessor. -
3. A communication device capable of screening speech recognizer input according to claim 2, wherein the means for communicating the error information and the means for communicating the instructions are at least one communication output mechanism.
-
4. A communication device capable of screening speech recognizer input according to claim 3, wherein the at least one communication output mechanism is a speaker.
-
5. A communication device capable of screening speech recognizer input according to claim 3, wherein the at least one communication output mechanism is a display.
-
6. A communication device capable of screening speech recognizer input according to claim 1, wherein the error comprises the user speaking over the start of the speech acquisition window.
-
7. A communication device capable of screening speech recognizer input according to claim 1, wherein the error comprises the user speaking over the end of the speech acquisition window.
-
8. A communication device capable of screening speech recognizer input according to claim 1, wherein the speech signal comprises noise and speech communication.
-
9. A communication device capable of screening speech recognizer input according to claim 8, wherein the error comprises the noise obscuring the speech communication when a ratio of the speech communication to the noise is less than a threshold.
-
10. A communication device capable of screening speech recognizer input according to claim 1, wherein the means for communicating the error information comprises a speaker.
-
11. A communication device capable of screening speech recognizer input according to claim 1, wherein the means for communicating the error information is a display.
-
12. A communication device capable of screening speech recognizer input according to claim 1, wherein the means for communicating the error information comprises a vibrator/buzzer.
-
13. A communication device capable of screening speech recognizer input according to claim 1, wherein the means for communicating the error information comprises a display and a speaker.
-
14. A communication device capable of screening speech recognizer input according to claim 1, further comprising:
audio circuitry operatively connected to the microphone and at least one microprocessor, the audio circuitry having an analog-to-digital converter.
-
15. A communication device capable of screening speech recognizer input according to claim 14, wherein the error comprises at least one speech sample clipped by the analog-to-digital converter.
-
16. A communication device capable of screening speech recognizer input according to claim 1, further comprising a memory operatively connected to the at least one microprocessor.
-
17. A communication device capable of screening speech recognizer input according to claim 1,
wherein the at least one microprocessor has speech recognition technology, and wherein the at least one microprocessor uses the speech recognition technology to produce a speech recognition signal from the speech signal. -
18. A communication device capable of screening speech recognizer input according to claim 17, further comprising:
communication interface circuitry operatively connected to receive the speech recognition signal from the at least one microprocessor.
-
-
19. A method for screening speech recognizer input, comprising the steps of:
-
(a) analyzing a speech signal to determine speech waveform parameters within a speech acquisition window;
(b) comparing the speech waveform parameters to determine whether an error exists in the signal format of the speech signal; and
(c) when an error exists in the signal format of the speech signal, providing error information. - View Dependent Claims (20, 21, 22, 23, 24, 25)
(c1) deactivating the speech recognition process;
(c2) prompting the user to reactivate the speech recognition process with instructions to correct the error in the signal format of the speech signal.
-
-
22. A method for screening speech recognizer input according to claim 19, wherein step (c) further comprises the substeps of:
-
(c1) halting the speech recognition process;
(c2) prompting the user to provide a corrected speech signal with instructions for correcting the error in the signal format of the speech signal;
(c3) repeating steps (a), (b), and (c) for the corrected speech signal.
-
-
23. A method for screening speech recognizer input according to claim 19, wherein the speech waveform parameters in step (a) include speech energy, noise energy, start energy, end energy, and a percentage of clipped speech samples within the speech acquisition window.
-
24. A method for screening speech recognizer input according to claim 23, wherein the step (b) of comparing the speech waveform parameters comprises the substeps of:
-
(b1) determining whether the ratio of the speech energy to the start energy is less than a first threshold and whether the ratio of the start energy to the end energy is greater than a second threshold;
(b2) determining whether the ratio of the speech energy to the end energy is less than a third threshold and whether the ratio of the end energy to the start energy is greater than a fourth threshold;
(b3) determining whether the percentage of clipped speech samples is greater than a fifth threshold; and
(b4) determining whether the ratio of the speech energy to the noise energy is less than a sixth threshold.
-
-
25. A method for screening speech recognizer input according to claim 19, wherein the substeps (b1), (b2), (b3), and (b4) are performed sequentially to provide a hierarchical decision structure.
-
26. A radiotelephone, comprising:
-
at least one microprocessor for screening speech recognizer input, the at least one microprocessor having a speech/noise classifier, wherein the at least one microprocessor analyzes a speech signal to determine speech waveform parameters within a speech acquisition window, wherein the speech waveform parameters include speech energy, noise energy, start energy, end energy, and a percentage of clipped speech samples within the speech acquisition window, wherein the at least one microprocessor compares speech waveform parameters to determine whether an error exists in the signal format of the speech signal, wherein the at least one microprocessor provides error information when an error exists in the signal format of the speech signal, and wherein the at least one microprocessor provides instructions for correcting the error;
a microphone for providing the speech signal to the at least one microprocessor;
audio circuitry operatively connected to the microphone and at least one microprocessor, the audio circuitry having an analog-to-digital converter;
a memory operatively connected to the at least one microprocessor; and
means, operatively connected to the at least one microprocessor, for communicating error information and instructions for correcting the error. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
wherein the at least one microprocessor compares the speech waveform parameters to determine whether the ratio of the speech energy to the start energy is less than a first threshold and whether the ratio of the start energy to the end energy is greater than a second threshold, wherein the at least one microprocessor compares the speech waveform parameters to determine whether the ratio of the speech energy to the end energy is less than a third threshold and whether the ratio of the end energy to the start energy is greater than a fourth threshold, wherein the at least one microprocessor compares the speech waveform parameters to determine whether the percentage of clipped speech samples is greater than a fifth threshold, and wherein the at least one microprocessor compares the speech waveform parameters to determine whether the ratio of the speech energy to the noise energy is less than a sixth threshold. -
28. A radiotelephone according to claim 27, wherein the at least one microprocessor compares the speech waveform parameters according to the sequence in claim 27.
-
29. A radiotelephone according to claim 26, further comprising means for tactile data input.
-
30. A radiotelephone according to claim 29, wherein the means for tactile data input comprises a keypad.
-
31. A radiotelephone according to claim 26, wherein the means for communicating comprises a speaker.
-
32. A radiotelephone according to claim 26, wherein the means for communicating comprises a display.
-
33. A radiotelephone according to claim 26,
wherein the at least one microprocessor has speech recognition technology, and wherein the at least one microprocessor uses the speech recognition technology to produce a speech recognition signal from the speech signal. -
34. A radiotelephone according to claim 33, further comprising:
- communication interface circuitry operatively connected to receive the speech recognition signal from the at least one microprocessor.
-
Specification