Method for robust voice recognation by analyzing redundant features of source signal
First Claim
1. A method for processing speech signals, comprising operations of:
- applying a primary transformation to a digital input speech signal to extract primary features therefrom;
applying each of at least one secondary transformation to one of the input speech signal and the primary features to yield secondary features statistically dependant on the primary features;
applying at least one predetermined function to form a combined signal comprising a combination of the primary features with the secondary features;
generating a recognition answer by pattern matching the combined signal against predetermined voice recognition templates.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for processing digitized speech signals by analyzing redundant features to provide more robust voice recognition. A primary transformation is applied to a source speech signal to extract primary features therefrom. Each of at least one secondary transformation is applied to the source speech signal or extracted primary features to yield at least one set of secondary features statistically dependant on the primary features. At least one predetermined function is then applied to combine the primary features with the secondary features. A recognition answer is generated by pattern matching this combination against predetermined voice recognition templates.
145 Citations
28 Claims
-
1. A method for processing speech signals, comprising operations of:
-
applying a primary transformation to a digital input speech signal to extract primary features therefrom;
applying each of at least one secondary transformation to one of the input speech signal and the primary features to yield secondary features statistically dependant on the primary features;
applying at least one predetermined function to form a combined signal comprising a combination of the primary features with the secondary features;
generating a recognition answer by pattern matching the combined signal against predetermined voice recognition templates. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A signal-bearing medium tangibly embodying a program of machine-readable instructions executable by a digital processing apparatus to perform operations for processing speech signals, said operations comprising:
-
applying a primary transformation to a digital input speech signal to extract primary features therefrom;
applying each of at least one secondary transformation to one of the input speech signal and the primary features to yield secondary features statistically dependant on the primary features;
applying at least one predetermined function to form a combined signal comprising a combination of the primary features with the secondary features;
generating a recognition answer by pattern matching the combined signal against predetermined voice recognition templates. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. Circuitry of multiple interconnected electrically conductive elements configured to perform operations to process speech signals, the operations comprising:
-
applying a primary transformation to a digital input speech signal to extract primary features therefrom;
applying each of at least one secondary transformation to one of the input speech signal and the primary features to yield secondary features statistically dependant on the primary features;
applying at least one predetermined function to form a combined signal comprising a combination of the primary features with the secondary features;
generating a recognition answer by pattern matching the combined signal against predetermined voice recognition templates. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
25. A voice recognition system, comprising:
-
a primary feature extractor applying a primary function to extract primary features from a digital input speech signal;
at least one secondary transformation module each producing secondary features statistically dependent on the primary features by applying a secondary function to an input comprising one of the following;
the input speech signal, the primary features;
a feature combination module coupled to the primary feature extractor and each of the secondary transformation modules to apply one or more predetermined functions to combine the primary features with the secondary features forming a combined signal;
a statistical modeling engine, coupled to the feature combination module to generate a recognition answer by pattern matching the combined signal against predetermined voice recognition templates.
-
-
26. A voice recognition system, comprising:
-
primary feature extractor means for applying a primary function to extract primary features from a digital input speech signal;
secondary transformation means for producing secondary features statistically dependent on the primary features by applying at least one secondary function to an input comprising one of the following;
the input speech signal, the primary features;
feature combination means for applying one or more predetermined functions to combine the primary features with the secondary features forming a combined signal;
statistical modeling means for generating a recognition answer by pattern matching the combined features against predetermined voice recognition templates.
-
-
27. A wireless communications device, comprising:
-
a transceiver coupled to an antenna;
a speaker;
a microphone;
a user interface;
a manager coupled to components including the transceiver, speaker, microphone, and user interface to manage operation of the components, the manager including a voice recognition system configured to perform operations comprising;
applying a primary transformation to a digital input speech signal to extract primary features therefrom;
applying each of at least one secondary transformation to one of the input speech signal and the primary features to yield secondary features statistically dependant on the primary features;
applying at least one predetermined function to form a combined signal comprising a combination of the primary features with the secondary features;
generating a recognition answer by pattern matching the combined signal against predetermined voice recognition templates.
-
-
28. A wireless communications device, comprising:
-
a transceiver coupled to an antenna;
a speaker;
a microphone;
a user interface;
means for managing operation of the transceiver, speaker, microphone, and user interface;
the means for managing further including means for performing voice recognition by;
applying a primary transformation to a digital input speech signal to extract primary features therefrom;
applying each of at least one secondary transformation to one of the input speech signal and the primary features to yield secondary features statistically dependant on the primary features;
applying at least one predetermined function to form a combined signal comprising a combination of the primary features with the secondary features;
generating a recognition answer by pattern matching the combined signal against predetermined voice recognition templates.
-
Specification