Audio coding with gain profile extraction and transmission for speech enhancement at the decoder
First Claim
1. An audio encoding system for producing, based on an audio signal, a gain profile to be distributed with said audio signal, the gain profile comprising a time-variable voice activity gain and a time-variable and frequency-variable cleaning gain, wherein the audio encoding system comprises:
- a voice activity detector adapted to determine the voice activity gain by at least determining voice activity in the audio signal; and
a noise estimator adapted to determine the cleaning gain by at least estimating noise in said audio signal,wherein the cleaning gain is separable from the voice activity gain in the gain profile.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention provides a layered audio coding format with a monophonic layer and at least one sound field layer. A plurality of audio signals is decomposed, in accordance with decomposition parameters controlling the quantitative properties of an orthogonal energy-compacting transform, into rotated audio signals. Further, a time-variable gain profile specifying constructively how the rotated audio signals may be processed to attenuate undesired audio content is derived. The monophonic layer may comprise one of the rotated signals and the gain profile. The sound field layer may comprise the rotated signals and the decomposition parameters. In one embodiment, the gain profile comprises a cleaning gain profile with the main purpose of eliminating non-speech components and/or noise. The gain profile may also comprise mutually independent broadband gains. Because signals in the audio coding format can be mixed with a limited computational effort, the invention may advantageously be applied in a tele-conferencing application.
-
Citations
20 Claims
-
1. An audio encoding system for producing, based on an audio signal, a gain profile to be distributed with said audio signal, the gain profile comprising a time-variable voice activity gain and a time-variable and frequency-variable cleaning gain, wherein the audio encoding system comprises:
-
a voice activity detector adapted to determine the voice activity gain by at least determining voice activity in the audio signal; and a noise estimator adapted to determine the cleaning gain by at least estimating noise in said audio signal, wherein the cleaning gain is separable from the voice activity gain in the gain profile. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An audio encoding method for producing, based on an audio signal, a gain profile to be distributed with said audio signal, the gain profile comprising a time-variable voice activity gain and a time-variable and frequency-variable cleaning gain, wherein the audio encoding method comprises:
-
determining voice activity in said audio signal; assigning a value to the voice activity gain based the determined voice activity; estimating noise in said audio signal; and assigning a value to the cleaning gain based on the estimated noise, wherein the cleaning gain is separable from the voice activity gain in the gain profile. - View Dependent Claims (10)
-
-
11. A mixing system for combining a plurality of received pairs of an audio signal and an associated gain profile, each of said gain profiles comprising a time-variable voice activity gain and a time-variable and frequency-variable cleaning gain, wherein the mixing system comprises:
-
a decoder adapted to derive, from each of the gain profiles, a representation of the audio signal, the voice activity gain and the cleaning gain, wherein the voice activity gain is separable from the cleaning gain in the gain profile; a gain combining stage adapted to; determine a combined voice activity gain by combining the derived voice activity gains using a first combining rule, and determine a combined cleaning gain by combining the derived cleaning gains by a second combining rule different from the first combining rule; and a mixing stage adapted to combine one or more of the audio signals into a combined audio signal to be distributed with the combined voice activity gain and the combined cleaning gain. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification