Rate control strategies for speech and music coding
First Claim
Patent Images
1. A method for controlling the coding rate of a multimode coding system for coding a current audio signal frame in a sequence of audio signal frames, the method comprising the steps of:
- determining a signal type corresponding to the audio signal of each frame in the sequence of frames;
determining an expected coding rate for the current frame according to the signal type of the audio signal of the current frame and an established average coding rate for the sequence of frames;
estimating a target coding rate for the current frame by adjusting the expected coding rate wherein the adjustment to the expected coding rate is based on the signal type of at least one other frame in the sequence of frames and the status of a bit-stream buffer maintained for buffering coded frames; and
determining a coding rate for use in coding the current frame according to the target coding rate.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and a system are provided for controlling the coding rates of a multimode coding system with respect to a sequence of input audio signal frames. The method eliminates or minimizes the overflow and underflow of a bit-stream buffer maintained by the coding system for temporarily recording bit-stream data prior to transmission or storage.
70 Citations
25 Claims
-
1. A method for controlling the coding rate of a multimode coding system for coding a current audio signal frame in a sequence of audio signal frames, the method comprising the steps of:
-
determining a signal type corresponding to the audio signal of each frame in the sequence of frames;
determining an expected coding rate for the current frame according to the signal type of the audio signal of the current frame and an established average coding rate for the sequence of frames;
estimating a target coding rate for the current frame by adjusting the expected coding rate wherein the adjustment to the expected coding rate is based on the signal type of at least one other frame in the sequence of frames and the status of a bit-stream buffer maintained for buffering coded frames; and
determining a coding rate for use in coding the current frame according to the target coding rate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
calculating an expected total number of bits to be used for coding all frames in the sequence according to the class distribution over all frames in the sequence and an expected coding rate assigned to each class;
observing the total number of residue bits in the current bit-stream buffer;
estimating an expected total number of bits that will have been sent out from the bit-stream buffer during coding all frames in the sequence;
obtaining an expected total number of residue bits in the bit-stream buffer after coding all frames in the sequence according to the expected total number of bits, total number of residue bits in the current bit-stream buffer, and expected total number of bits that will have been sent out from the bit-stream buffer;
obtaining a total number of available bits for adjusting the expected coding rate for the current frame by comparing the obtained total number of residue bits in the bit-stream buffer with a predefined total ideal number of residue bits in the bit-stream buffer after coding all frames in the sequence; and
adjusting the expected coding rate for the current frame in accordance with the total number of available bits for adjusting.
-
-
6. The method of claim 5, further comprising the step of comparing the total number of available bits deducted to a first adjustment limit and a second adjustment limit for avoiding an underflow and overflow of the bit-stream buffer respectively.
-
7. The method of claim 1, wherein the step of estimating a target coding rate for the current frame by adjusting the expected coding rate further comprises the step of comparing the total number of available bits for adjusting to a first adjustment limit and a second adjustment limit for avoiding an underflow or overflow of the bit-stream buffer respectively.
-
8. The method of claim 1, wherein the step of determining a coding rate for use in coding the current frame according to the target coding rate further comprises the steps of:
-
determining whether the determined signal type is associated with a coding rate corresponding to the target coding rate; and
if the determined signal type is not associated with a coding rate corresponding to the target coding rate, selecting a coding rate associated with the determined signal type, wherein the selected coding rate most closely approximates the target coding rate.
-
-
9. The method of claim 8 further comprising the step of selecting a coding rate associated with the determined signal type, wherein the selected coding rate corresponds to the target coding rate, if the determined signal type is associated with a coding rate corresponding to the target coding rate.
-
10. The method of claim 1 further comprising the steps of:
-
coding the current frame according to the determined coding rate;
buffering coded bits corresponding to the current frame in the bit-stream buffer;
determining whether the number of frames corresponding to the encoded bits buffered in the bit-stream buffer exceeds a predefined pre-buffering number of encoding frames; and
if the number of frames exceeds the pre-buffering number, starting to output a packet of bits from the bit-stream buffer.
-
-
11. The method of claim 10 further comprising the step of holding the bits in the bit-stream buffer without outputting a packet if the total number of encoding frames does not exceed the pre-buffering number.
-
12. A computer-readable medium having computer executable instructions for performing the method of claim 1.
-
13. A coding system for coding a sequence of audio frames corresponding to a digitized sampled input audio signal to generate a series of coded bits, the system comprising:
-
a look-ahead buffer for queuing the sequence of frames;
a multimode encoder for receiving frames corresponding to the frames in the look-ahead buffer and encoding the frames into coded bits;
a bit-stream buffer for storing the coded bits generated from the encoder and emitting coded bits; and
a rate controller in connection with the look-ahead buffer and the multimode encoder for controlling the coding mode and coding rate of the multimode encoder, while encoding each frame, according to a characteristic of a current frame, a classification of each other frame in the sequence, and the status of the bit-stream buffer. - View Dependent Claims (14, 15, 16)
a feature extractor for extracting a set of at least one predefined feature from the signal contained in each frame, wherein the at least one feature is usable to characterize a signal in a frame;
a classifier in connection with the feature extractor for classifying each frame according to the at least one extracted feature from that frame; and
a mode selector in connection with the classifier for selecting a proper coding mode for the encoder for each frame based on the classification of the frame, the classification of at least one other frame in the look-ahead buffer, and the status of the bit-stream buffer.
-
-
16. The system according to claim 15, wherein the classifier is adapted to classify each frame as one of a music frame, a speech voiced frame, a speech unvoiced frame, a speech silence frame, and a speech transition frame.
-
17. A method for controlling the coding rate for each frame in a sequence of speech data frames in a multimode encoder, the method comprising:
-
classifying each frame in the sequence of frames into one of a plurality of predefined classes according to a feature of the frame data, wherein each class is associated with an expected coding rate based on a required average coding rate and the relative importance of data of the class to the perceived quality of a reproduced speech signal;
deriving an adjustment for adjusting the expected coding rate for each frame according to the class of each frame in the sequence of frames and the status of a bit-stream buffer provided for storing encoded bits corresponding to the frames;
adjusting the expected coding rate based on the derived adjustment; and
determining a coding rate for encoding each frame according to the adjusted expected coding rate. - View Dependent Claims (18, 19)
-
-
20. A method for controlling the coding rate of a multimode coding system for coding a current audio signal frame in a sequence of audio signal frames, the method comprising the steps of:
-
determining a signal type corresponding to the audio signal of each frame in the sequence of frames;
estimating a target coding rate for the current frame based on the signal type of at least one other frame in the sequence of frames, the status of a bit-stream buffer maintained for buffering coded frames, and an established average coding rate for the sequence of frames; and
determining a coding rate for use in coding the current frame according to the target coding rate. - View Dependent Claims (21, 22, 23, 24, 25)
calculating an expected total number of bits to be used for coding all frames in the sequence according to the class distribution over all frames in the sequence and an expected coding rate assigned to each class;
observing the total number of residue bits in the current bit-stream buffer;
estimating an expected total number of bits that will have been sent out from the bit-stream buffer during the coding of all frames in the sequence;
obtaining an expected total number of residue bits in the bit-stream buffer after coding all frames in the sequence according to the expected total number of bits to be used for coding all frames, the total number of residue bits in the current bit-stream buffer, and the expected total number of bits that will have been sent out from the bit-stream buffer during coding all frames in the sequence;
obtaining a total number of available bits for adjustment by comparing the obtained total number of residue bits in the bit-stream buffer with a predefined total ideal number of residue bits in the bit-stream buffer after coding all frames in the sequence; and
estimating the target coding rate for the current frame according to the signal type of the audio signals of the current frame, the established average coding rate and the total number of available bits for adjustment.
-
-
22. The method of claim 21, further comprising the step of comparing the total number of available bits for adjustment to a first adjustment limit and a second adjustment limit for avoiding an underflow or overflow of the bit-stream buffer respectively.
-
23. The method of claim 20, wherein the step of determining a coding rate for use in coding the current frame according to the target coding rate further comprises the steps of:
-
determining whether the determined signal type is associated with a coding rate corresponding to the target coding rate; and
if the determined signal type is not associated with a coding rate corresponding to the target coding rate, selecting a coding rate associated with the determined signal type, wherein the selected coding rate most closely approximates the target coding rate.
-
-
24. The method of claim 23 further comprising the step of selecting a coding rate associated with the determined signal type, wherein the selected coding rate corresponds to the target coding rate, if the determined signal type is associated with a coding rate corresponding to the target coding rate.
-
25. A computer-readable medium having computer executable instructions for performing the method of claim 20.
Specification