Speech post-processing using MDCT coefficients
First Claim
1. A method of post-processing a speech signal having a high-band frequency range and a low-band frequency range to generate a post-processed speech signal, the method comprising:
- applying a time-domain post-processing to the speech signal, using LPC (Linear Prediction Coding) coefficients, for the low-band frequency range of the speech signal;
applying a frequency-domain post-processing to the speech signal, using MDCT (Modified Discrete Cosine Transform) coefficients, for the high-band frequency range of the speech signal;
wherein applying the frequency-domain post-processing includes;
decoding an encoded speech signal to obtain MDCT coefficients representative of the speech signal divided into a plurality of sub-bands;
generating an envelope for each sub-band of the plurality of sub-bands as an average magnitude of the MDCT coefficients of the sub-band;
generating an envelope modification factor for each sub-band of the plurality of sub-bands using the MDCT coefficients of the sub-band;
determining a gain based on the envelope and the envelope modification factor of the sub-bands;
generating a fine structure modification factor for each MDCT coefficient in each sub-band of the plurality of sub-band using the MDCT coefficients of the sub-band;
modifying the MDCT coefficients in each sub-band by multiplying by the gain, the envelope modification factor of the sub-band and the fine structure modification factor of the MDCT coefficient of the sub-band to provide post-processed MDCT coefficients;
generating the post-processed speech signal using the post-processed MDCT coefficients; and
converting the post-processed speech signal from a digital form into an analog form using an digital-to-analog converter.
3 Assignments
0 Petitions
Accused Products
Abstract
There is provided a method of post-processing a speech signal. The method comprises applying a time-domain post-processing to the speech signal, using LPC coefficients, for a low-band frequency range and applying a frequency-domain post-processing to the speech signal, using MDCT coefficients, for the high-band frequency range. Applying the frequency-domain post-processing includes decoding an encoded speech signal to obtain MDCT coefficients representative of the speech signal divided into a plurality of sub-bands, generating an envelope for each sub-band of the plurality of sub-bands as an average magnitude of the MDCT coefficients of the sub-band, generating an envelope modification factor for each sub-band of the plurality of sub-band using the MDCT coefficients of the sub-band, modifying the envelope by the envelope modification factor for each sub-band of the plurality of sub-bands to provide a modified envelope, and generating the post-processed speech signal using the modified envelope.
61 Citations
10 Claims
-
1. A method of post-processing a speech signal having a high-band frequency range and a low-band frequency range to generate a post-processed speech signal, the method comprising:
-
applying a time-domain post-processing to the speech signal, using LPC (Linear Prediction Coding) coefficients, for the low-band frequency range of the speech signal; applying a frequency-domain post-processing to the speech signal, using MDCT (Modified Discrete Cosine Transform) coefficients, for the high-band frequency range of the speech signal; wherein applying the frequency-domain post-processing includes; decoding an encoded speech signal to obtain MDCT coefficients representative of the speech signal divided into a plurality of sub-bands; generating an envelope for each sub-band of the plurality of sub-bands as an average magnitude of the MDCT coefficients of the sub-band; generating an envelope modification factor for each sub-band of the plurality of sub-bands using the MDCT coefficients of the sub-band; determining a gain based on the envelope and the envelope modification factor of the sub-bands; generating a fine structure modification factor for each MDCT coefficient in each sub-band of the plurality of sub-band using the MDCT coefficients of the sub-band; modifying the MDCT coefficients in each sub-band by multiplying by the gain, the envelope modification factor of the sub-band and the fine structure modification factor of the MDCT coefficient of the sub-band to provide post-processed MDCT coefficients; generating the post-processed speech signal using the post-processed MDCT coefficients; and converting the post-processed speech signal from a digital form into an analog form using an digital-to-analog converter. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech post-processor for post-processing a speech signal having a high-band frequency range and a low-band frequency range to generate a post-processed speech signal, the speech post-processor comprising:
software and circuitry for; applying a time-domain post-processing to the speech signal, using LPC (Linear Prediction Coding) coefficients, for the low-band frequency range of the speech signal; applying a frequency-domain post-processing to the speech signal, using MDCT (Modified Discrete Cosine Transform) coefficients, for the high-band frequency range of the speech signal; wherein applying the frequency-domain post-processing includes; decoding an encoded speech signal to obtain MDCT coefficients representative of the speech signal divided into a plurality of sub-bands; generating an envelope for each sub-band of the plurality of sub-bands as an average magnitude of the MDCT coefficients of the sub-band; generating an envelope modification factor for each sub-band of the plurality of sub-bands using the MDCT coefficients of the sub-band; determining a gain based on the envelope and the envelope modification factor of the sub-bands; generating a fine structure modification factor for each MDCT coefficient in each sub-band of the plurality of sub-band using the MDCT coefficients of the sub-band; modifying the MDCT coefficients in each sub-band by multiplying by the gain, the envelope modification factor of the sub-band and the fine structure modification factor of the MDCT coefficient of the sub-band to provide post-processed MDCT coefficients; generating the post-processed speech signal using the post-processed MDCT coefficients; and converting the post-processed speech signal from a digital form into an analog form using an digital-to-analog converter. - View Dependent Claims (7, 8, 9, 10)
Specification