Reconstructing an audio signal by spectral component regeneration and noise blending
First Claim
1. A method, performed by one or more processors, for generating a time-domain representation of a reconstructed signal that comprises:
- receiving a formatted signal comprising;
baseband transform coefficients of an audio signal, wherein the baseband transform coefficients are Modified Discrete Cosine Transform coefficients,a noise blending parameter, andinformation regarding a spectral envelope of the audio signal, wherein the information regarding the spectral envelope of the audio signal is indicative of an estimate of the spectral envelope of the audio signal;
extracting, from the formatted signal;
the baseband transform coefficients,the noise blending parameter, andthe information regarding the spectral envelope of the audio signal;
generating noise-signal transform coefficients, wherein the noise-signal transform coefficients are weighted in amplitude by a noise blending function that is a function of frequency and the noise blending parameter and that gives greater weight to transform coefficients corresponding to higher frequencies;
generating regenerated-signal transform coefficients that;
correspond to frequencies above the baseband transform coefficients,are copied from a subset of the baseband transform coefficients, andare weighted in amplitude by an inverse of the noise blending function;
generating noisy regenerated transform coefficients by combining;
the regenerated-signal transform coefficients, andthe noise-signal transform coefficients,wherein amplitudes of the noisy regenerated transform coefficients are adjusted in response to the information regarding the spectral envelope of the audio signal;
generating a frequency-domain representation of the reconstructed signal by combining;
the baseband transform coefficients, andthe noisy regenerated transform coefficients;
generating the time-domain representation of the reconstructed signal by applying an inverse Modified Discrete Cosine Transform to the frequency-domain representation of the reconstructed signal; and
generating, from the time-domain representation of the reconstructed signal, an acoustical representation of the reconstructed signal.
1 Assignment
0 Petitions
Accused Products
Abstract
An audio signal is conveyed more efficiently by transmitting or recording a baseband of the signal with an estimated spectral envelope and a noise-blending parameter derived from a measure of the signal'"'"'s noise-like quality. The signal is reconstructed by translating spectral components of the baseband signal to frequencies outside the baseband, adjusting phase of the regenerated components to maintain phase coherency, adjusting spectral shape according to the estimated spectral envelope, and adding noise according to the noise-blending parameter. Preferably, the transmitted or recorded signal also includes an estimated temporal envelope that is used to adjust the temporal shape of the reconstructed signal.
-
Citations
18 Claims
-
1. A method, performed by one or more processors, for generating a time-domain representation of a reconstructed signal that comprises:
-
receiving a formatted signal comprising; baseband transform coefficients of an audio signal, wherein the baseband transform coefficients are Modified Discrete Cosine Transform coefficients, a noise blending parameter, and information regarding a spectral envelope of the audio signal, wherein the information regarding the spectral envelope of the audio signal is indicative of an estimate of the spectral envelope of the audio signal; extracting, from the formatted signal; the baseband transform coefficients, the noise blending parameter, and the information regarding the spectral envelope of the audio signal; generating noise-signal transform coefficients, wherein the noise-signal transform coefficients are weighted in amplitude by a noise blending function that is a function of frequency and the noise blending parameter and that gives greater weight to transform coefficients corresponding to higher frequencies; generating regenerated-signal transform coefficients that; correspond to frequencies above the baseband transform coefficients, are copied from a subset of the baseband transform coefficients, and are weighted in amplitude by an inverse of the noise blending function; generating noisy regenerated transform coefficients by combining; the regenerated-signal transform coefficients, and the noise-signal transform coefficients, wherein amplitudes of the noisy regenerated transform coefficients are adjusted in response to the information regarding the spectral envelope of the audio signal; generating a frequency-domain representation of the reconstructed signal by combining; the baseband transform coefficients, and the noisy regenerated transform coefficients; generating the time-domain representation of the reconstructed signal by applying an inverse Modified Discrete Cosine Transform to the frequency-domain representation of the reconstructed signal; and generating, from the time-domain representation of the reconstructed signal, an acoustical representation of the reconstructed signal. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An apparatus for generating a time-domain representation of a reconstructed signal, wherein the apparatus comprises one or more processors configured to:
-
receive a formatted signal comprising; baseband transform coefficients of an audio signal, wherein the baseband transform coefficients are Modified Discrete Cosine Transform coefficients, a noise blending parameter, and information regarding a spectral envelope of the audio signal, wherein the information regarding the spectral envelope of the audio signal is indicative of an estimate of the spectral envelope of the audio signal; extract, from the formatted signal; the baseband transform coefficients, the noise blending parameter, and the information regarding the spectral envelope of the audio signal; generate noise-signal transform coefficients, wherein the noise-signal transform coefficients are weighted in amplitude by a noise blending function that is a function of frequency and the noise blending parameter and that gives greater weight to transform coefficients corresponding to higher frequencies; generate regenerated-signal transform coefficients that; correspond to frequencies above the baseband transform coefficients, are copied from a subset of the baseband transform coefficients, and are weighted in amplitude by an inverse of the noise blending function; generate noisy regenerated transform coefficients by combining; the regenerated-signal transform coefficients, and the noise-signal transform coefficients, wherein amplitudes of the noisy regenerated transform coefficients are adjusted in response to the information regarding the spectral envelope of the audio signal; generate a frequency-domain representation of the reconstructed signal by combining; the baseband transform coefficients, and the noisy regenerated transform coefficients; generate the time-domain representation of the reconstructed signal by applying an inverse Modified Discrete Cosine Transform to the frequency-domain representation of the reconstructed signal; and wherein the apparatus is further configured to generate, from the time-domain representation of the reconstructed signal, an acoustical representation of the reconstructed signal. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory medium that is readable by a device and that records a program of instructions executable by the device to perform a method for generating a time-domain representation of a reconstructed signal, wherein the method comprises:
-
receiving a formatted signal comprising; baseband transform coefficients of an audio signal, wherein the baseband transform coefficients are Modified Discrete Cosine Transform coefficients, a noise blending parameter, and information regarding a spectral envelope of the audio signal, wherein the information regarding the spectral envelope of the audio signal is indicative of an estimate of the spectral envelope of the audio signal; extracting, from the formatted signal; the baseband transform coefficients, the noise blending parameter, and the information regarding the spectral envelope of the audio signal; generating noise-signal transform coefficients, wherein the noise-signal transform coefficients are weighted in amplitude by a noise blending function that is a function of frequency and the noise blending parameter and that gives greater weight to transform coefficients corresponding to higher frequencies; generating regenerated-signal transform coefficients that; correspond to frequencies above the baseband transform coefficients, are copied from a subset of the baseband transform coefficients, and are weighted in amplitude by an inverse of the noise blending function; generating noisy regenerated transform coefficients by combining; the regenerated-signal transform coefficients, and the noise-signal transform coefficients, wherein amplitudes of the noisy regenerated transform coefficients are adjusted in response to the information regarding the spectral envelope of the audio signal; generating a frequency-domain representation of the reconstructed signal by combining; the baseband transform coefficients, and the noisy regenerated transform coefficients; generating the time-domain representation of the reconstructed signal by applying an inverse Modified Discrete Cosine Transform to the frequency-domain representation of the reconstructed signal; and generating, from the time-domain representation of the reconstructed signal, an acoustical representation of the reconstructed signal. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification