METHOD AND APPARATUS TO ENCODE AND DECODE AN AUDIO/SPEECH SIGNAL
First Claim
Patent Images
1. An apparatus to encode an audio/speech signal, the apparatus comprising:
- a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal;
a psychoacoustic modeling unit to control the signal transforming unit;
a time domain encoding unit to encode the signal, transformed by the signal transforming unit, based on a speech modeling; and
a quantizing unit to quantize the signal outputted from at least one of the signal transforming unit and the time domain encoding unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus to encode and decode an audio/speech signal is provided. An inputted audio signal or speech signal may be transformed into at least one of a high frequency resolution signal and a high temporal resolution signal. The signal may be encoded by determining an appropriate resolution, the encoded signal may be decoded, and thus the audio signal, the speech signal, and a mixed signal of the audio signal and the speech signal may be processed.
-
Citations
21 Claims
-
1. An apparatus to encode an audio/speech signal, the apparatus comprising:
-
a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal; a psychoacoustic modeling unit to control the signal transforming unit; a time domain encoding unit to encode the signal, transformed by the signal transforming unit, based on a speech modeling; and a quantizing unit to quantize the signal outputted from at least one of the signal transforming unit and the time domain encoding unit. - View Dependent Claims (2)
-
-
3. An apparatus to encode an audio/speech signal, the apparatus comprising:
-
a parametric stereo processing unit to process stereo information of an inputted audio signal or speech signal; a high frequency signal processing unit to process a high frequency signal of the inputted audio signal or speech signal; a signal transforming unit to transform the inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal; a psychoacoustic modeling unit to control the signal transforming unit; a time domain encoding unit to encode the signal, transformed by the signal transforming unit, based on a speech modeling; and a quantizing unit to quantize the signal outputted from at least one of the signal transforming unit and the time domain encoding unit. - View Dependent Claims (4, 5, 6, 7, 8, 9)
-
-
10. An apparatus to decode audio/speech signal, the apparatus comprising:
-
a resolution decision unit to determine whether a current frame signal is a high frequency resolution signal or a high temporal resolution signal, based on information about time domain encoding or frequency domain encoding, the information being included in a bitstream; a dequantizing unit to dequantize the bitstream when the resolution decision unit determines the signal is the high frequency resolution signal; a time domain decoding unit to decode additional information for inverse linear prediction from the bitstream, and to restore the high temporal resolution signal using the additional information; and an inverse signal transforming unit to inverse-transform at least one of an output signal from the time domain decoding unit and an output signal from the dequantizing unit into an audio signal or speech signal of a time domain. - View Dependent Claims (11)
-
-
12. An apparatus to encoding an audio/speech signal, the apparatus comprising:
-
a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal; a psychoacoustic modeling unit to control the signal transforming unit; a temporal noise shaping unit to shape at least one of the transformed high frequency resolution signal and the transformed high temporal resolution signal; a high rate stereo unit to encode stereo information of the transformed signal; and a quantizing unit to quantize the signal outputted from at least one of the temporal noise shaping unit and the high rate stereo unit. - View Dependent Claims (13)
-
-
14. An apparatus of decoding an audio/speech signal, the apparatus comprising:
-
a dequantizing unit to dequantize a bitstream; a high rate stereo/decoder to decode the dequantized signal; a temporal noise shaper/decoder to process the signal decoded by the high rate stereo/decoder; and an inverse signal transforming unit to inverse-transform the processed signal into an audio signal or speech signal of a time domain, wherein the bitstream is generated by a transformation of the inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal. - View Dependent Claims (15)
-
-
16. An apparatus to encode an audio/speech signal, the apparatus comprising:
-
a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal; a psychoacoustic modeling unit to control the signal transforming unit; a low rate determination unit to determine whether the transformed signal has a low rate; a time domain encoding unit to encode the transformed signal based on a speech modeling when the transformed signal has the low rate; a temporal noise shaping unit to shape the transformed signal; a high rate stereo unit to encode stereo information of the shaped signal; and a quantizing unit to quantize at least one of an output signal from the high rate stereo unit and an output signal from the time domain encoding unit. - View Dependent Claims (17)
-
-
18. A method of encoding an audio/speech signal, the method comprising:
-
transforming an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal, and controlling the transformed signal based on a psychoacoustic modeling; time-encoding the transformed signal based at least in part on a speech modeling; and quantizing at least one of the transformed signal and the time-encoded signal.
-
-
19. A method of decoding an audio/speech signal, the method comprising:
-
determining whether a current frame signal is a high frequency resolution signal or a high temporal resolution signal, based at least in part on information included in the bitstream about time domain encoding or frequency domain encoding; dequantizing the bitstream when the signal is determined as the high frequency resolution signal; decoding additional information for inverse linear prediction from the bitstream, and restoring the high temporal resolution signal using the additional information; and inverse-transforming at least one of the restored signal and the dequantized signal into an audio signal or speech signal of a time domain.
-
-
20. A method of encoding audio and speech signals, the method comprising:
-
receiving at least one audio signal and at least one speech signal; transforming the at least one of the received audio signal and the received speech signal into at least one of a frequency resolution signal and a temporal resolution signal; encoding the transformed signal; and quantizing at least one of the transformed signal and the encoded signal.
-
-
21. A method of decoding audio and speech signals, the method comprising:
-
determining whether a current frame signal is a frequency resolution signal or a temporal resolution signal with information in the bitstream of a received signal about time domain encoding or frequency domain encoding; dequantizing the bitstream when the received signal is the frequency resolution signal; inverse linear predicting from the information in the bitstream and restoring the temporal resolution signal using the information; and inverse-transforming at least one of the dequantized signal and the restored temporal resolution signal into an audio signal or speech signal of a time domain.
-
Specification