Multi-source phoneme classification for noise-robust automatic speech recognition
First Claim
Patent Images
1. A method of processing an audio signal comprising:
- computing 600 spectral values on a logarithmic frequency scale from the audio signal;
separating the 600 spectral values into a plurality of streams which group sounds from a same source prior to classification;
analyzing each separated stream to determine phoneme-level classification; and
outputting one or more words of the audio signal.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for processing an audio signal including separating the audio signal into a plurality of streams which group sounds from a same source prior to classification and analyzing each separate stream to determine phoneme-level classification. One or more words of the audio signal may then be outputted.
42 Citations
15 Claims
-
1. A method of processing an audio signal comprising:
-
computing 600 spectral values on a logarithmic frequency scale from the audio signal; separating the 600 spectral values into a plurality of streams which group sounds from a same source prior to classification; analyzing each separated stream to determine phoneme-level classification; and outputting one or more words of the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
Specification