Audio encoding method and apparatus
First Claim
1. An audio encoding method, comprising:
- dividing an energy spectrum of each of N audio frames into P fast Fourier transform (FFT) energy spectrum coefficients, wherein P and N are positive integers, and the N audio frames comprise a current audio frame;
determining a general sparseness parameter according to energy of the P FFT energy spectrum coefficients of each of the N audio frames by determining an average value of minimum bandwidths of distribution on spectrums of a first preset proportion of energy of the N audio frames according to the energy of the P FFT energy spectrum coefficients of each of the N audio frames, wherein the general sparseness parameter comprises a first minimum bandwidth, wherein the average value of the minimum bandwidths of the distribution on spectrums of the first preset proportion of the energy of the N audio frames is used as the first minimum bandwidth, and wherein the general sparseness parameter indicates sparseness of distribution in energy spectrums of the N audio frames; and
determining, based on the sparseness of distribution, whether to use a first encoding method or a second encoding method to encode the current audio frame, wherein the first encoding method is based on time-frequency transform and transform coefficient quantization, and the second encoding method is a linear-predication-based encoding method, andwherein the first encoding method is determined to be used to encode the current audio frame based on a condition that the first minimum bandwidth is less than a first preset value, or the second encoding method is determined to be used to encode the current audio frame based on a condition that the first minimum bandwidth is greater than the first preset value.
1 Assignment
0 Petitions
Accused Products
Abstract
An audio encoding method and an apparatus are provided. The method includes: determining sparseness of distribution, on spectrums, of energy of N input audio frames (101), where the N audio frames include a current audio frame, and N is a positive integer; and determining, according to the sparseness of distribution, on the spectrums, of the energy of the N audio frames, whether to use a first encoding method or a second encoding method to encode the current audio frame (102), where the first encoding method is an encoding method that is based on time-frequency transform and transform coefficient quantization and that is not based on linear prediction, and the second encoding method is a linear-predication-based encoding method. The method can reduce encoding complexity and ensure that encoding is of relatively high accuracy.
-
Citations
20 Claims
-
1. An audio encoding method, comprising:
-
dividing an energy spectrum of each of N audio frames into P fast Fourier transform (FFT) energy spectrum coefficients, wherein P and N are positive integers, and the N audio frames comprise a current audio frame; determining a general sparseness parameter according to energy of the P FFT energy spectrum coefficients of each of the N audio frames by determining an average value of minimum bandwidths of distribution on spectrums of a first preset proportion of energy of the N audio frames according to the energy of the P FFT energy spectrum coefficients of each of the N audio frames, wherein the general sparseness parameter comprises a first minimum bandwidth, wherein the average value of the minimum bandwidths of the distribution on spectrums of the first preset proportion of the energy of the N audio frames is used as the first minimum bandwidth, and wherein the general sparseness parameter indicates sparseness of distribution in energy spectrums of the N audio frames; and determining, based on the sparseness of distribution, whether to use a first encoding method or a second encoding method to encode the current audio frame, wherein the first encoding method is based on time-frequency transform and transform coefficient quantization, and the second encoding method is a linear-predication-based encoding method, and wherein the first encoding method is determined to be used to encode the current audio frame based on a condition that the first minimum bandwidth is less than a first preset value, or the second encoding method is determined to be used to encode the current audio frame based on a condition that the first minimum bandwidth is greater than the first preset value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 20)
-
-
11. An audio encoder, comprising:
-
a memory comprising instructions; and one or more processors in communication with the memory, wherein the one or more processors execute the instructions to; divide an energy spectrum of each of N audio frames into P fast Fourier transform (FFT) energy spectrum coefficients, wherein P and N are positive integers, and the N audio frames comprise a current audio frame; determine a general sparseness parameter according to energy of the P FFT energy spectrum coefficients of each of the N audio frames by determining an average value of minimum bandwidths of distribution on the spectrums of a first preset proportion energy of the N audio frames according to the energy of the P FFT energy spectrum coefficients of each of the N audio frames, wherein the general sparseness parameter comprises a first minimum bandwidth, wherein the average value of the minimum bandwidths of the distribution on the spectrums of the first preset proportion of the energy of the N audio frames is used as first minimum bandwidth, and wherein the general sparseness parameter indicates sparseness of distribution in energy spectrums of the N audio frames; and determine, based on the sparseness of distribution, whether to use a first encoding method or a second encoding method to encode the current audio frame, wherein the first encoding method is based on time-frequency transform and transform coefficient quantization, and the second encoding method is a linear-predication-based encoding method, and wherein the first encoding method is determined to be used to encode the current audio frame based on a condition that the first minimum bandwidth is less than a first preset value, or the second encoding method is determined to be used to encode the current audio frame based on a condition that the first minimum bandwidth is greater than the first preset value. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification