Method of encoding a speech signal
First Claim
1. A method of encoding a speech signal comprising the steps of:
- sampling the speech signal;
dividing the sample speech signal into a plurality of frames;
performing multi-band excitation analysis on the signal within each frame to derive a fundamental pitch, a plurality of voiced/unvoiced decisions for frequency bands in the signal and amplitudes of harmonics within said bands;
transforming the harmonic amplitudes to form a plurality of transform coefficients;
vector quantizing the coefficients to form a plurality of indices;
characterised by dividing the harmonic amplitudes into a first group of a fixed number of harmonics and a second group of the remainder of the harmonics, the first and second groups being subject to different transforms to form respective first and second sets of transform coefficients for quantization.
7 Assignments
0 Petitions
Accused Products
Abstract
A method of coding speech is disclosed in which the speech signal is sampled and divided into a plurality of frames upon which multi-band excitation analysis is performed to derive a fundamental pitch, a plurality of voiced/unvoiced decisions and amplitudes of harmonics within the bands. The harmonic amplitudes are split into a first group of a fixed number of harmonics and a second group of the remainder of harmonics and these are separately transformed using the Discrete Cosine Transform for the first group and Non-Square Transform for the second group, the resulting transform coefficients being vector quantized to form a plurality of output indices. A decoding method and apparatus for performing both encoding and decoding methods are also disclosed.
36 Citations
22 Claims
-
1. A method of encoding a speech signal comprising the steps of:
-
sampling the speech signal;
dividing the sample speech signal into a plurality of frames;
performing multi-band excitation analysis on the signal within each frame to derive a fundamental pitch, a plurality of voiced/unvoiced decisions for frequency bands in the signal and amplitudes of harmonics within said bands;
transforming the harmonic amplitudes to form a plurality of transform coefficients;
vector quantizing the coefficients to form a plurality of indices;
characterised bydividing the harmonic amplitudes into a first group of a fixed number of harmonics and a second group of the remainder of the harmonics, the first and second groups being subject to different transforms to form respective first and second sets of transform coefficients for quantization. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of decoding an input data signal for speech synthesis comprising the steps of:
-
vector dequantizing a plurality of indices of the data signal to form first and second sets of transform coefficients;
inverse-transforming the first and second sets of coefficients using different transforms to derive respective first and second groups of harmonic amplitudes;
deriving pitch and voiced/unvoiced decision information from the input data signal;
performing multi-band excitation synthesis on the information and the harmonic amplitudes to form a synthesized speech signal; and
constructing a speech signal from the synthesized signal.
-
-
9. Speech coding apparatus comprising:
-
means for sampling a speech signal and dividing the sampled signal into a plurality of frames;
a multi-band excitation analyzer for deriving a fundamental pitch and a plurality of voiced/unvoiced decisions for frequency bands in each frame and amplitudes of harmonics within said bands;
transformation means for transforming the harmonic amplitudes to form a plurality of transform coefficients;
vector quantization means for quantizing the coefficients to form a plurality of indices;
characterized in that the transformation means comprises first transform means for transforming a first fixed number of harmonics into a first set of transform coefficients and second transform means for transforming the remainder of the harmonic amplitudes into a second set of transform coefficients, the first and second transform means performing different transforms. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. Decoding apparatus for decoding an input data signal for speech synthesis comprising:
-
vector dequantization means for dequantizing a plurality of indices to form at least two sets of transform coefficients;
first and second transform means for transforming respectively the first and second sets of coefficients to derive first and second groups of harmonic amplitudes, the first and second transform means performing different transforms;
a multi-band excitation synthesizer for combining the harmonics with pitch and voiced/unvoiced decision information from the input signal; and
means for constructing a speech signal from the output of the synthesizer. - View Dependent Claims (21, 22)
-
Specification