Scalable speech coding/decoding apparatus, method, and medium having mixed structure
First Claim
1. A scalable speech coding apparatus having a mixed structure, the apparatus comprising:
- a band divider dividing a speech input signal into a low-band signal and a high-band signal according to a specific frequency, and outputting the low-band signal and the high-band signal;
a low-band coder outputting a low-band first index by coding the low-band signal, transmitting information required for coding the high-band signal to a high-band coder, and transmitting an uncoded first error signal to a wide-band coder;
a high-band coder outputting a high-band second index obtained when the high-band signal is coded by using information received from the low-band coder, and transmitting an uncoded second error signal to the wide-band coder;
a wide-band coder quantizing coefficients of the first and second error signals using a modified discrete cosine transform (MDCT) method through time-frequency mapping, and outputting a wide-band third index; and
a bit-stream generator outputting a scalable bit-stream composed of the low-band first index received from the low-band coder, the high-band second index received from the high-band coder, and the wide-band third index received from the wide-band coder.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided are a scalable wide-band speech coding/decoding apparatus, method, and medium. An input wide-band speech input signal is first divided into a low-band signal and a high-band signal. The divided low-band signal is then coded using a code excited linear prediction (CELP) method. The divided high-band signal is coded using a harmonic method. A signal representing a difference between a synthetic signal obtained from the low-band and the high band, and a signal input to the low-band and the high-band is then coded using a modified discrete cosine transform (MDCT) method. The coded signal is then multiplexed. The multiplexed signal is then output. Accordingly, high quality speech can be achieved for all layers.
55 Citations
32 Claims
-
1. A scalable speech coding apparatus having a mixed structure, the apparatus comprising:
-
a band divider dividing a speech input signal into a low-band signal and a high-band signal according to a specific frequency, and outputting the low-band signal and the high-band signal;
a low-band coder outputting a low-band first index by coding the low-band signal, transmitting information required for coding the high-band signal to a high-band coder, and transmitting an uncoded first error signal to a wide-band coder;
a high-band coder outputting a high-band second index obtained when the high-band signal is coded by using information received from the low-band coder, and transmitting an uncoded second error signal to the wide-band coder;
a wide-band coder quantizing coefficients of the first and second error signals using a modified discrete cosine transform (MDCT) method through time-frequency mapping, and outputting a wide-band third index; and
a bit-stream generator outputting a scalable bit-stream composed of the low-band first index received from the low-band coder, the high-band second index received from the high-band coder, and the wide-band third index received from the wide-band coder. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A scalable speech coding method having a mixed structure, the method comprising:
-
(a) dividing a speech input signal into a low-band signal and a high-band signal according to a specific frequency, and outputting the low-band signal and the high-band signal;
(b) generating and outputting a low-band first index by coding the output low-band signal, and outputting specific information required for coding the high-band signal and an uncoded first error signal;
(c) coding the output high-band signal by using the specific information, and outputting a high-band second index and an uncoded second error signal;
(d) quantizing coefficients of the first and second error signals using a modified discrete cosine transform (MDCT) through time-frequency mapping, and outputting a low-band third index; and
(e) outputting a scalable bit-stream composed of the low-band first index, the high-band second index, and the wide-band third index. - View Dependent Claims (8, 9, 10, 11, 12, 25, 26, 27, 28)
-
-
13. A scalable speech decoding apparatus having a mixed structure, the apparatus comprising:
-
a bit-stream divider receiving a scalable bit-stream transmitted at a specific transmission rate according to a network condition, and transmitting the scalable bit-stream to each decoder of a corresponding frequency band by dividing the scalable bit-stream according to a frequency band used in reproduction;
a low-band decoder receiving a low-band signal into which the scalable bitstream is divided by the bit-stream divider, decoding and outputting the received low-band signal, and transmitting specific information required for decoding a high-band signal among coefficients decoded in a low-band;
a high-band decoder decoding and outputting a high-band signal into which the scalable bit-stream is divided by the bitstream divider, using the specific information;
a wide-band decoder decoding a wide-band signal into which the scalable bitstream is divided by the bit-stream divider, and dividing and outputting the decoded wide-band signal into a low-band signal and a high-band signal according to a specific frequency; and
a band combiner outputting a wide-band synthetic signal of a combined band by receiving a first synthetic signal, which is generated when a signal output from the low-band decoder is combined with the low-band signal output from the wide-band decoder, and a second synthetic signal which is generated when a signal output from the high-band decoder is combined with the high-band signal output from the wide-band decoder. - View Dependent Claims (14, 15, 16)
-
-
17. A scalable speech decoding method having a mixed structure, the method comprising:
-
(a) receiving a scalable bit-stream transmitted at a specific transmission rate according to a network condition, and dividing and outputting the scalable bit-stream into a low-band signal, a high-band signal, and a wide-band signal according to a frequency band used for reproduction;
(b) receiving the low-band signal of the scalable bitstream, decoding and outputting the received low-band signal, and outputting information on a pitch signal among coefficients decoded in a low-band;
(c) receiving the high-band signal of the scalable bitstream and the pitch signal information, and decoding and outputting the high-band signal by using the pitch signal information;
(d) receiving and decoding the wide-band signal of the scalable bitstream, and dividing and outputting the decoded wide-band signal into a low-band signal and a high-band signal according to a specific frequency; and
(e) outputting a wide-band synthetic signal of a combined band by receiving a first synthetic signal, which is generated when a signal output in (b) is combined with a low-band signal output in (d), and a second synthetic signal which is generated when a signal output in (c) is combined with a high-band signal output in (d). - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
29. A scalable speech coding apparatus having a mixed structure, the apparatus comprising:
-
a band divider dividing a speech input signal into a low-band signal and a high-band signal according to a specific frequency, and outputting the low-band signal and the high-band signal;
a low-band coder outputting a low-band first index by coding a low-band signal, outputting information required for coding a high-band signal, and transmitting an uncoded first error signal to a wide-band coder;
a high-band coder outputting a high-band second index obtained when the high-band signal is coded by using outputted information received from the low-band coder, and transmitting an uncoded second error signal to the wide-band coder;
a wide-band coder quantizing coefficients of the first and second error signals using a modified discrete cosine transform (MDCT) method through time-frequency mapping, and outputting a wide-band third index; and
a bit-stream generator outputting a scalable bit-stream composed of the low-band first index received from the low-band coder, the high-band second index received from the high-band coder, and the wide-band third index received from the wide-band coder. - View Dependent Claims (30)
-
-
31. A scalable speech decoding method having a mixed structure for decoding a scalable bit-stream, the method comprising:
-
(a) receiving a low-band signal of the scalable bitstream, decoding and outputting the received low-band signal, and outputting information on a pitch signal among coefficients decoded in a low-band;
(b) receiving a high-band signal of the scalable bitstream and the pitch signal information, and decoding and outputting the high-band signal by using the pitch signal information;
(c) receiving and decoding a wide-band signal of the scalable bitstream, and dividing and outputting the decoded wide-band signal into a low-band signal and a high-band signal according to a specific frequency; and
(d) outputting a wide-band synthetic signal of a combined band by receiving a first synthetic signal, which is generated when a signal output in (a) is combined with a low-band signal output in (c), and a second synthetic signal which is generated when a signal output in (b) is combined with a high-band signal output in (c). - View Dependent Claims (32)
-
Specification