Enhancement of multichannel audio
First Claim
Patent Images
1. A method for enhancing an audio signal, wherein the audio signal comprises two or more channels of audio content, the method comprising:
- dividing the audio signal into segments;
examining the segments to determine whether the segments contain one or more indicia of speech, and if the one or more indicia are present in a segment, classifying the segment as a speech segment;
estimating a loudness of a speech component associated with the speech segment;
calculating a gain for the speech segment based at least in part on the estimated loudness, a reference loudness level, and an estimated loudness associated with a previous segment;
smoothing the calculated gain to control the rate at which the calculated gain changes from the speech segment to a second segment of the audio signal; and
applying the smoothed gain to the audio signal.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to audio signal processing. More specifically, the invention relates to enhancing multichannel audio, such as television audio, by applying a gain to the audio that has been smoothed between segments of the audio. The invention relates to methods, apparatus for performing such methods, and to software stored on a computer-readable medium for causing a computer to perform such methods.
-
Citations
19 Claims
-
1. A method for enhancing an audio signal, wherein the audio signal comprises two or more channels of audio content, the method comprising:
-
dividing the audio signal into segments; examining the segments to determine whether the segments contain one or more indicia of speech, and if the one or more indicia are present in a segment, classifying the segment as a speech segment; estimating a loudness of a speech component associated with the speech segment; calculating a gain for the speech segment based at least in part on the estimated loudness, a reference loudness level, and an estimated loudness associated with a previous segment; smoothing the calculated gain to control the rate at which the calculated gain changes from the speech segment to a second segment of the audio signal; and applying the smoothed gain to the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for enhancing an audio signal, wherein the audio signal comprises two or more channels of audio content, the system comprising:
-
a controller that receives the audio signal, wherein the controller comprises a 30 buffer that temporarily stores segments of the audio signal as the segments are received; a detection module that determines whether one or more of the stored segments contains characteristics of dialog, and if a segment is determined to contain characteristics of dialog, identifies the segment as a dialog segment; an analysis module that estimates a power level of a speech component associated with the dialog segment; and an enhancement processor that calculates a gain for the dialog segment and smooths the calculated gain to control the rate at which the gain changes from the dialog segment to a second segment of the audio signal, the gain being calculated based at least in part on, the estimated power level of the speech component and an estimated loudness associated with a previous segment. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for signal processing comprising:
-
receiving an audio signal, wherein the audio signal comprises two or more channels of audio content; analyzing features of the audio signal; classifying a segment of the audio signal as a speech segment if the segment contains one or more features of speech; analyzing the speech segment to obtain an estimated loudness of a speech component of the speech segment; calculating a gain for the speech segment based at least in part on the estimated loudness, a reference loudness, and an estimated loudness associated with a previous segment; and smoothing the calculated gain to control the rate at which the calculated gain changes from the speech segment to a second segment of the audio signal.
-
Specification