Speech recognition faciliation method and apparatus
First Claim
1. A method comprising:
- providing information having varying amplitude in a spectral domain to be speech-recognized;
adding masking information to the information as a function, at least in part, of the amplitude of the information to provide modified information
4 Assignments
0 Petitions
Accused Products
Abstract
In a speech recognition platform, a masking unit 17 can be utilized to mask noisy content within an audio sample. By masking such noise in a dynamic but predictable manner, valid content can be preserved while largely overcoming the random and detrimental presence of noise. In one embodiment, speech recognition features are extracted pursuant to a hierarchical process that localizes, at least to some extent, some of the resultant features from other resultant features. As a result, noisy or otherwise unreliable information corresponding to the audio sample will not be leveraged unduly across the entire feature set. In another embodiment, an average energy value for processed samples is calculated with individual energy values that are downwardly weighted when such individual energy values are likely representative of noise.
10 Citations
26 Claims
-
1. A method comprising:
-
providing information having varying amplitude in a spectral domain to be speech-recognized;
adding masking information to the information as a function, at least in part, of the amplitude of the information to provide modified information - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A device comprising:
-
an information signal input;
a spectral transformation unit having an input operably coupled to the information signal input and having an output providing a spectrally transformed information signal; and
a masking unit having an input operably coupled to the output of the spectral transformation unit and having an output providing a modified spectrally transformed information signal wherein at least some amplitude valleys are at least partially masked. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A device comprising:
-
an information signal input;
a localized speech recognition feature extraction unit having an input operably coupled to the information signal input and an output providing localized speech recognition features. - View Dependent Claims (25, 26)
-
Specification