Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus
First Claim
Patent Images
1. A method of encoding an audio signal, the method comprising:
- receiving the audio signal;
obtaining, performed by at least one processor, first parameters of a current frame of the audio signal;
selecting, performed by the at least one processor, a class of the current frame in the audio signal from among a plurality of classes including a music class and a speech class, based on first parameters of the current frame by using a Gaussian mixture model (GMM);
obtaining second parameters including first tonality, second tonality and third tonality;
generating a plurality of conditions, where each of the plurality of conditions is generated based on a combination of the obtained second parameters;
determining, performed by the at least one processor, whether an error occurs in the selected class of the current frame based on whether at least one of the plurality of conditions is met;
when the error occurs in the selected class of the current frame, correcting, performed by the at least one processor, the selected class of the current frame;
encoding, performed by the at least one processor, the current frame, based on either the corrected class or the selected class of the current frame; and
generating a bitstream based on the encoded current frame,wherein the first tonality is obtained from a subband of 0 to 1 kHz, the second tonality is obtained from a subband of 1 to 2 kHz and the third tonality is obtained from a subband of 2 to 4 kHz, andwherein the correcting comprises;
when the error occurs in the selected class of the current frame and the selected class of the current frame is the speech class, correcting the selected class of the current frame from the speech class to the music class; and
when the error occurs in the selected class of the current frame and the selected class of the current frame is the music class, correcting the selected class of the current frame from the music class to the speech class.
0 Assignments
0 Petitions
Accused Products
Abstract
Provided are a method and an apparatus for determining an encoding mode for improving the quality of a reconstructed audio signal. A method of determining an encoding mode includes determining one from among a plurality of encoding modes including a first encoding mode and a second encoding mode as an initial encoding mode in correspondence to characteristics of an audio signal, and if there is an error in the determination of the initial encoding mode, generating a modified encoding mode by modifying the initial encoding mode to a third encoding mode.
-
Citations
6 Claims
-
1. A method of encoding an audio signal, the method comprising:
-
receiving the audio signal; obtaining, performed by at least one processor, first parameters of a current frame of the audio signal; selecting, performed by the at least one processor, a class of the current frame in the audio signal from among a plurality of classes including a music class and a speech class, based on first parameters of the current frame by using a Gaussian mixture model (GMM); obtaining second parameters including first tonality, second tonality and third tonality; generating a plurality of conditions, where each of the plurality of conditions is generated based on a combination of the obtained second parameters; determining, performed by the at least one processor, whether an error occurs in the selected class of the current frame based on whether at least one of the plurality of conditions is met; when the error occurs in the selected class of the current frame, correcting, performed by the at least one processor, the selected class of the current frame; encoding, performed by the at least one processor, the current frame, based on either the corrected class or the selected class of the current frame; and generating a bitstream based on the encoded current frame, wherein the first tonality is obtained from a subband of 0 to 1 kHz, the second tonality is obtained from a subband of 1 to 2 kHz and the third tonality is obtained from a subband of 2 to 4 kHz, and wherein the correcting comprises; when the error occurs in the selected class of the current frame and the selected class of the current frame is the speech class, correcting the selected class of the current frame from the speech class to the music class; and when the error occurs in the selected class of the current frame and the selected class of the current frame is the music class, correcting the selected class of the current frame from the music class to the speech class. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification