Method and system for dialog enhancement
First Claim
1. A method for enhancing dialog determined by an audio input signal, said method including the steps of:
- (a) analyzing the input signal to generate filter control values without use of feedback; and
(b) providing at least one of the control values to a peaking filter, filtering a speech channel determined by the input signal in the peaking filter in a manner steered by said at least one of the control values to generate a dialog-enhanced speech channel, and attenuating non-speech channels determined by the input signal in ducking circuitry steered by at least a subset of the control values to generate attenuated non-speech channels, where the control values are distinct from the speech channel, the control values are distinct from the non-speech channels, the peaking filter is distinct from the ducking circuitry, the peaking filter is coupled and configured to filter the speech channel but not the non-speech channels, the ducking circuitry is coupled and configured to attenuate the non-speech channels but not the speech channel, the peaking filter is configured to emphasize frequency components of the speech channel in a frequency range critical to intelligibility of speech, relative to frequency components of the speech channel outside the frequency range, and said frequency range has a center frequency,wherein the step of attenuating the non-speech channels includes reducing gain application to the non-speech channels in response to a change in said at least a subset of the control values indicative of increase of power of the speech channel relative to combined power of the non-speech channels, and the step of filtering the speech channel includes applying more gain to the frequency components of the speech channel at the center frequency in response to a change in said at least one of the control values indicative of an increase in power of the speech channel relative to power of at least one of the non-speech channels.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for enhancing dialog determined by an audio input signal. In some embodiments the input signal is a stereo signal, and the system includes an analysis subsystem configured to analyze the stereo signal to generate filter control values, and a filtering subsystem including upmixing circuitry configured to upmix the input signal to generate a speech channel and non-speech channels and a peaking filter configured to filter the speech channel to enhance dialog while being steered by at least one of the control values. The filtering subsystem also includes ducking circuitry for attenuating the non-speech channels while being steered by at least some of the control values, and downmixing circuitry configured to combine outputs of the peaking filter and ducking circuitry to generate a filtered stereo output. In some embodiments, the system is configured to downmix a multichannel input signal to generate a downmixed stereo signal, an analysis subsystem is configured to analyze the downmixed stereo signal to generate filter control values, and a filtering subsystem is configured to generate a dialog-enhanced audio signal in response to the input signal while being steered by at least some of the filter control values. Preferably, the filter control values are generated without use of feedback including by generating power ratios (for pairs of speech and non-speech channels) and preferably also shaping in nonlinear fashion and scaling at least one of the power ratios.
56 Citations
31 Claims
-
1. A method for enhancing dialog determined by an audio input signal, said method including the steps of:
-
(a) analyzing the input signal to generate filter control values without use of feedback; and (b) providing at least one of the control values to a peaking filter, filtering a speech channel determined by the input signal in the peaking filter in a manner steered by said at least one of the control values to generate a dialog-enhanced speech channel, and attenuating non-speech channels determined by the input signal in ducking circuitry steered by at least a subset of the control values to generate attenuated non-speech channels, where the control values are distinct from the speech channel, the control values are distinct from the non-speech channels, the peaking filter is distinct from the ducking circuitry, the peaking filter is coupled and configured to filter the speech channel but not the non-speech channels, the ducking circuitry is coupled and configured to attenuate the non-speech channels but not the speech channel, the peaking filter is configured to emphasize frequency components of the speech channel in a frequency range critical to intelligibility of speech, relative to frequency components of the speech channel outside the frequency range, and said frequency range has a center frequency, wherein the step of attenuating the non-speech channels includes reducing gain application to the non-speech channels in response to a change in said at least a subset of the control values indicative of increase of power of the speech channel relative to combined power of the non-speech channels, and the step of filtering the speech channel includes applying more gain to the frequency components of the speech channel at the center frequency in response to a change in said at least one of the control values indicative of an increase in power of the speech channel relative to power of at least one of the non-speech channels. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for enhancing dialog determined by an audio input signal, including:
-
an analysis subsystem coupled and configured to analyze the input signal to generate filter control values without use of feedback; and a filtering subsystem coupled to the analysis subsystem and including a peaking filter and ducking circuitry, wherein the peaking filter is coupled to receive at least one of the control values and configured to filter a speech channel determined by the input signal, while being steered by said at least one of the control values, to generate a dialog-enhanced speech channel, and the ducking circuitry is configured to attenuate non-speech channels determined by the input signal, while being steered by at least a subset of the control values, to generate attenuated non-speech channels, where the control values are distinct from the speech channel, the control values are distinct from the non-speech channels, the peaking filter is distinct from the ducking circuitry, the peaking filter is coupled and configured to filter the speech channel but not the non-speech channels, including by emphasizing frequency components of the speech channel in a frequency range critical to intelligibility of speech relative to frequency components of the speech channel outside the frequency range, where said frequency range has a center frequency, the ducking circuitry is coupled and configured to attenuate the non-speech channels but not the speech channel, including by reducing gain application to the non-speech channels in response to a change in said at least a subset of the control values indicative of an increase, within limits, in power of the speech channel relative to combined power of the non-speech channels, and the peaking filter is configured to apply increased gain to frequency components of the speech channel having the center frequency in response to a change in said at least one of the control values indicating an increase in power of the speech channel relative to power of at least one of the non-speech channels. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
Specification