Method and apparatus for coding audio signals based on perceptual model
First Claim
1. A method for coding an input set of stereophonic audio signals comprising respective left channel and right channel sets of signals, said method comprisingforming for each of said left channel and right channel sets of signals, a set of first signals representing the frequency content of said input set, said set of first signals comprising signals representing amplitude and phase information for each of a plurality of frequency bands,forming sets of sum and difference channel signals corresponding, respectively, to the sum of, and difference between, corresponding ones of said set of first signals for said left channel and said set of first signals for said right channel,determining a randomness metric for each of said plurality of frequency bands for each of said sets of first signals,based on the distribution of power with frequency for each of said sets of first signals and on said randomness metrics for each of said sets of first signals, forming a tonality function as a function of frequency, andbased on said randomness metric determining a first perceptual threshold for each of said left channel and right channel sets of signals and selecting between (i) said sum and difference channel signals, or (ii) said left channel and right channel signals, based on the determined threshold values for each of said plurality of frequency bands.
3 Assignments
0 Petitions
Accused Products
Abstract
Coding of high quality stereophonic audio signals is accomplished in a perceptual filterbank coder which exploits the interchannel redundancies and psychoacoustic. Using perceptual principles, switching between a normal and short window of input samples improve output signal quality for certain input signals, particularly those having a rapid attack. Switching is also accomplished between coding of left and right channels and so- called sum and difference channels in response to particular signal conditions. A number of new perceptually based techniques, including improved threshold determinations, result in high quality.
372 Citations
4 Claims
-
1. A method for coding an input set of stereophonic audio signals comprising respective left channel and right channel sets of signals, said method comprising
forming for each of said left channel and right channel sets of signals, a set of first signals representing the frequency content of said input set, said set of first signals comprising signals representing amplitude and phase information for each of a plurality of frequency bands, forming sets of sum and difference channel signals corresponding, respectively, to the sum of, and difference between, corresponding ones of said set of first signals for said left channel and said set of first signals for said right channel, determining a randomness metric for each of said plurality of frequency bands for each of said sets of first signals, based on the distribution of power with frequency for each of said sets of first signals and on said randomness metrics for each of said sets of first signals, forming a tonality function as a function of frequency, and based on said randomness metric determining a first perceptual threshold for each of said left channel and right channel sets of signals and selecting between (i) said sum and difference channel signals, or (ii) said left channel and right channel signals, based on the determined threshold values for each of said plurality of frequency bands.
Specification