Variable-subframe-length speech-coding classes derived from wavelet-transform parameters
First Claim
1. A method for classifying speech signals comprising the steps of:
- segmenting the speech signal into frames;
calculating a wavelet transformation;
obtaining a set of parameters (P1 -P3) from the wavelet transformation;
dividing the frames into subframes using a finite-state model which is a function of the set of parameters;
classifying each of the subframes into one of a plurality of speech coding classes.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and a device are described for classifying speech on the basis of the wavelet transformation for low-bit-rate speech coding processes. The method and the device permit a more robust classifier of speech signals for signal-matched control of speech coding processes in order to reduce the bit rate without affecting the speech quality or to increase the quality at the same bit rate. The method provides that, after segmenting the speech signal, a wavelet transformation is calculated for each frame, from which a set of parameters is determined with the help of adaptive thresholds. The parameters control a finite-state model, which subdivides the frames into shorter subframes if required, and classifies each subframe into one of several classes typical for speech coding. The speech signal is classified on the basis of the wavelet transformation for each time frame. Thus both a high time resolution (location of pulses) and frequency resolution (good mean values) can be achieved. This method and the classifier are therefore especially well suited for the control and selection of code books in a low-bit-rate speech coder. They also have a low sensitivity to background noise and low complexity.
43 Citations
11 Claims
-
1. A method for classifying speech signals comprising the steps of:
-
segmenting the speech signal into frames; calculating a wavelet transformation; obtaining a set of parameters (P1 -P3) from the wavelet transformation; dividing the frames into subframes using a finite-state model which is a function of the set of parameters; classifying each of the subframes into one of a plurality of speech coding classes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for classifying speech signals comprising the steps of:
-
segmenting the speech signal into frames; calculating a wavelet transformation; obtaining a set of parameters (P1 -P3) from the wavelet transformation; dividing the frames into subframes based on the set of parameters, so that the subframes are classified as either voiceless, voicing onsets, or voiced.
-
-
11. A speech classifier comprising:
-
a segmentator for segmenting input speech to produce frames; a wavelet processor for calculating a discrete wavelet transformation for each segment and determining a set of parameters (P1 -P3) with the help of adaptive thresholds; and a finite-state model processor, which receives the set of parameters as inputs and in turn divides the speech frames into subframes and classifies each of these subframes into one of a plurality of speech coding classes.
-
Specification