Perceptual phonetic feature speech recognition system and method
First Claim
1. A speech processing system for processing an input speech spectrum vector comprising:
- a perceptual speech processor for perceptually processing the input speech spectrum vector to generate a perceptual spectrum;
a storage device for storing a plurality of reference spectrum vectors; and
a phonetic feature mapper, coupled to said perceptual speech processor and to said storage device, for mapping said perceptual spectrum on to said plurality of reference spectrum vectors.
1 Assignment
0 Petitions
Accused Products
Abstract
A complete system and method for accurate and robust speech recognition based on the application of three perceptual processing techniques to the speech Fourier spectrum to achieve a robust perceptual spectrum and the accurate recognition of that perceptual spectrum by projecting the perceptual spectrum onto a set of reference vowel spectrum vectors for input to a speech recognizer. The invention comprises a perceptual speech processor for preceptually processing the input speech spectrum vector to generate a perceptual spectrum, a storage device for storing a plurality of reference spectrum vectors and a phonetic feature mapper, coupled to said perceptual speech processor and to said storage device, for mapping said perceptual spectrum onto said plurality of reference spectrum vectors.
211 Citations
16 Claims
-
1. A speech processing system for processing an input speech spectrum vector comprising:
-
a perceptual speech processor for perceptually processing the input speech spectrum vector to generate a perceptual spectrum;
a storage device for storing a plurality of reference spectrum vectors; and
a phonetic feature mapper, coupled to said perceptual speech processor and to said storage device, for mapping said perceptual spectrum on to said plurality of reference spectrum vectors. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech recognition system for recognizing a sampled speech spectrum vector comprising:
-
a fast Fourier transform analyzer for generating Fourier transforms of the sampled speech spectrum vector;
a perceptual speech processor, coupled to said fast Fourier transform analyzer, for processing said Fourier transforms to generate a perceptual spectrum;
a storage device for storing a plurality of reference spectrum vectors;
a phonetic feature mapper, coupled to said perceptual speech processor and to said storage device, for mapping said perceptual spectrum to said plurality of reference spectrum vectors, thereby selecting at least one reference vector of greatest similarity to said perceptual spectrum; and
a continuous HMM recognizer, coupled to said phonetic feature mapper, for recognizing said at least one reference vector. - View Dependent Claims (7, 8)
-
-
9. A method for speech processing an input speech spectrum vector comprising the steps of:
-
preceptually processing the input speech spectrum vector to generate a perceptual spectrum;
storing a plurality of reference spectrum vectors; and
mapping said perceptual spectrum on to said plurality of reference spectrum vectors. - View Dependent Claims (10, 11, 12, 13, 15, 16)
-
-
14. A method for speech recognition of a sampled input speech spectrum vector, comprising the steps of:
-
generating Fourier transforms of the sampled input speech spectrum vector utilizing a fast Fourier transform analyzer;
generating a perceptual spectrum by processing said Fourier transforms;
storing a plurality of reference spectrum vectors;
mapping said perceptual spectrum onto said plurality of reference spectrum vectors;
selecting at least one reference vector of greatest similarity to said perceptual spectrum; and
recognizing said at least one reference vector utilizing a continuous HMM recognizer.
-
Specification