Signal injection coupling into the human vocal tract for robust audible and inaudible voice recognition
First Claim
1. A speech recognition system for processing sounds emanating from a living body'"'"'s vocal tract, said sounds including sounds or sound components excited by at least one artificial exciter coupled, either directly or indirectly, into said vocal tract to introduce artificial excitations, said at least one artificial excitation modified or modulated by said vocal tract and emanating therefrom.
0 Assignments
0 Petitions
Accused Products
Abstract
A means and method are provided for enhancing or replacing the natural excitation of the human vocal tract by artificial excitation means, wherein the artificially created acoustics present additional spectral, temporal, or phase data useful for (1) enhancing the machine recognition robustness of audible speech or (2) enabling more robust machine-recognition of relatively inaudible mouthed or whispered speech. The artificial excitation (a) may be arranged to be audible or inaudible, (b) may be designed to be non-interfering with another user'"'"'s similar means, (c) may be used in one or both of a vocal content-enhancement mode or a complimentary vocal tract-probing mode, and/or (d) may be used for the recognition of audible or inaudible continuous speech or isolated spoken commands.
-
Citations
59 Claims
- 1. A speech recognition system for processing sounds emanating from a living body'"'"'s vocal tract, said sounds including sounds or sound components excited by at least one artificial exciter coupled, either directly or indirectly, into said vocal tract to introduce artificial excitations, said at least one artificial excitation modified or modulated by said vocal tract and emanating therefrom.
-
23. A speech recognition system for processing sounds emanating from a living body'"'"'s vocal tract, said sounds including sounds excited by at least one artificial exciter coupled, either directly or indirectly, into said vocal tract to introduce artificial excitations, said at least one artificial excitation modified or modulated by said vocal tract and emanating therefrom, said speech recognition system including:
-
means for representation, modeling or classification, and searching of artificially excited speech signals or signal components;
means for representation, modeling or classification, and searching of naturally excited speech signals or signal components;
at least one of said searching means having access to at least one of an acoustic model, lexical model or language model; and
at least one training means. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A method of performing speech recognition on silently-mouthed, silently-articulated or whispered speech from a living body'"'"'s vocal tract, comprising:
-
providing a source of artificial acoustic excitation;
coupling said artificial acoustic excitation, directly or indirectly, into said vocal tract of a speaker;
allowing said artificial acoustic excitation to be modified or modulated by said speaker'"'"'s mouthing, articulation or whispering action by a state of at least a portion of said speaker'"'"'s vocal tract; and
performing speech-recognition processing on at least a portion of or component of said modified acoustic excitation to contribute to the identification of said speech or utterance. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41)
-
-
42. A method of enhancing the accuracy or speed of speech recognition of the speech or utterances emanating from a living body'"'"'s vocal tract, comprising:
-
coupling artificial acoustic excitation, directly or indirectly, into said vocal tract of a speaker;
allowing said speaker to audibly speak;
at least during portions of said audible speech, allowing said artificial acoustic excitation to be modified or modulated by said speaker'"'"'s mouthing, articulation or whispering action by a state of at least a portion of said speaker'"'"'s vocal tract to provide an artificially excited output of said speaker; and
performing speech-recognition processing on at least a portion of said artificially excited output of said speaker, to thereby provide enhanced accuracy or speed of said speech or utterance recognition. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A method of minimizing degradation in the accuracy or speed of speech-recognition of a first speaker'"'"'s speech or utterance caused by at least one second interfering background speaker or voice comprising:
-
coupling artificial acoustic excitation, directly or indirectly, into the vocal tract of the first speaker;
allowing said first speaker to audibly speak in the potential acoustic presence of said at least one second background speaker, thereby modifying or modulating said first speaker'"'"'s artificial acoustic excitation as well as said first speaker'"'"'s natural excitation; and
processing at least a portion of said first speaker'"'"'s artificially-produced acoustic output by a speech recognition means;
wherein said first speaker'"'"'s output is known to be that of said first speaker due to its identifiable artificial acoustic content;
or wherein said second speakers interfering output is ignored or rejected because it does not contain first speakers identifying artificial excitations. - View Dependent Claims (52, 53, 54, 55, 56, 57)
-
-
58. A method of providing a speech-recognition based security function for user identification or validation comprising:
-
(a) coupling, directly or indirectly, an artificial acoustic exciter into a user'"'"'s vocal tract;
(b) having the user speak, articulate or mouth an utterance wherein said utterance, at least in part, comprises a portion of the artificial excitation as-modified or modulated by said user'"'"'s vocal tract;
(c) applying speech recognition processing means to identify or validate said user, said means processing at least a portion of said artificially excited speech, utterance or signal-representation thereof; and
(d) storing information relating to at least one characteristic of said user'"'"'s vocal tract, or of its function, being used in said user identification or validation process. - View Dependent Claims (59)
-
Specification