Speech activated control system and related methods
First Claim
1. A speech activated control system for controlling aerial vehicle components, comprising:
- a host processor having memory and positioned in communication with a database for storing speech recognition models; and
speech actuated command program product at least partially stored in the memory of the host processor and including instructions that when executed by the host processor cause the processor to perform the operations of;
forming a digitized user-speech template representing a command annunciation,dividing the user-speech template into a plurality of time slices,subdividing each separate one of the plurality of time slices into a plurality of bins each associated with a corresponding different one of a plurality of frequency ranges,performing a noise reduction and speech enhancement on the digitized user-speech template to include;
estimating noise power for each separate set of bins having a same frequency range across the plurality of time slices to thereby provide a plurality of frequency range-specific noise power estimates,equalizing energy values of each set of bins having a same frequency range across the plurality of time slices responsive to the respective frequency range-specific noise power estimate, andthresholding each equalized bin by a predetermined value to remove noise from within and around speech formants of the user-speech template,developing a set of feature vectors representing energy of a frequency content of the digitized user-speech template to thereby determine a unique pattern identifying the command annunciation,applying a speech recognition engine to the set of feature vectors to form at least one speech recognition model associated with the command annunciation,associating an index with the at least one speech recognition model associated with the command annunciation, andstoring the at least one speech recognition model and the associated index.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech activated control system for controlling aerial vehicle components, program product, and associated methods are provided. The system can include a host processor adapted to develop speech recognition models and to provide speech command recognition. The host processor can be positioned in communication with a database for storing and retrieving speech recognition models. The system can include an avionic computer in communication with the host processor and adapted to provide command function management, a display and control processor in communication with the avionic computer adapted to provide a user interface between a user and the avionic computer, and a data interface positioned in communication with the avionic computer and the host processor provided to divorce speech command recognition functionality from vehicle or aircraft-related speech-command functionality. The system can also include speech actuated command program product at least partially stored in the memory of the host processor and adapted to provide the speech recognition model training and speech recognition model recognition functionality.
247 Citations
31 Claims
-
1. A speech activated control system for controlling aerial vehicle components, comprising:
-
a host processor having memory and positioned in communication with a database for storing speech recognition models; and speech actuated command program product at least partially stored in the memory of the host processor and including instructions that when executed by the host processor cause the processor to perform the operations of; forming a digitized user-speech template representing a command annunciation, dividing the user-speech template into a plurality of time slices, subdividing each separate one of the plurality of time slices into a plurality of bins each associated with a corresponding different one of a plurality of frequency ranges, performing a noise reduction and speech enhancement on the digitized user-speech template to include; estimating noise power for each separate set of bins having a same frequency range across the plurality of time slices to thereby provide a plurality of frequency range-specific noise power estimates, equalizing energy values of each set of bins having a same frequency range across the plurality of time slices responsive to the respective frequency range-specific noise power estimate, and thresholding each equalized bin by a predetermined value to remove noise from within and around speech formants of the user-speech template, developing a set of feature vectors representing energy of a frequency content of the digitized user-speech template to thereby determine a unique pattern identifying the command annunciation, applying a speech recognition engine to the set of feature vectors to form at least one speech recognition model associated with the command annunciation, associating an index with the at least one speech recognition model associated with the command annunciation, and storing the at least one speech recognition model and the associated index. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method to provide speech-activated control of aerial vehicle components, the method comprising the steps of:
-
(a) sampling a speech signal representing speech to define sampled data; (b) performing an integrity check on the sampled data to identify when the speech is below a preselected standard; (c) aligning the sampled data in an observation window to enhance analysis of the speech; (d) performing noise reduction processing to remove noise from within and around speech formants, to include; estimating noise power for each separate set of a plurality of bins having a same frequency range across a plurality of time slices of the sampled data to thereby provide a plurality of frequency range-specific noise power estimates, equalizing energy values of each set of bins having a same frequency range across the plurality of time slices responsive to the respective frequency range-specific noise power estimate, and thresholding each equalized bin by a parameterized threshold value to remove noise from within and around speech formants of the sampled data; (e) developing a set of feature vectors representing energy of a frequency content of the sampled data to thereby determine a unique pattern; and (f) applying a speech recognition engine to the set of feature vectors to perform one of the following functions;
forming at least one speech recognition model associated with the speech signal, and matching the speech signal to the at least one speech recognition model. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A method to provide speech-activated control of aerial vehicle components, the method comprising the steps of:
-
(a) performing noise reduction processing on sampled speech data representing a command annunciation to remove noise from within and around speech formants of the sampled speech, to include; estimating noise power for each separate set of a plurality of bins having a same frequency range across a plurality of time slices of the sampled speech data to thereby provide a plurality of frequency range-specific noise power estimates, equalizing energy values of each set of bins having a same frequency range across the plurality of time slices responsive to the respective frequency range-specific noise power estimate, and thresholding each equalized bin by a parameterized threshold value to remove noise from within and around speech formants of the sampled speech data; (b) developing a set of feature vectors representing energy of a frequency content of the sampled speech data to thereby determine a unique pattern identifying the command annunciation; (c) applying a speech recognition engine to the set of feature vectors to thereby form at least one speech recognition model; (d) associating an index with the at least one speech recognition model associated with the command annunciation; and (e) storing the at least one speech recognition model and the assigned index. - View Dependent Claims (28, 29, 30, 31)
-
Specification