Method and apparatus for environmental noise compensation by determining a presence or an absence of an audio event
First Claim
1. A method automatically performed by a system for environmental noise compensation of a first speech captured by a first audio capture device in a speech audio signal outside of an audio environment, the method comprising:
- estimating a fast audio energy level and a slow audio energy level from an environment audio signal captured by a second audio capture device in the audio environment, wherein;
the fast audio energy level corresponds to a second speech captured in the audio environment, andthe slow audio energy level corresponds to an ambient noise captured in the audio environment;
applying a gain to the speech audio signal to generate an environment compensated speech audio signal;
determining either a presence or an absence of an audio event by comparing the fast audio energy level with the slow audio energy level against a predetermined energy threshold;
updating the gain based on the estimated slow audio energy level during the absence of the audio event; and
freezing the gain at a current level during the presence of the audio event.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of environmental noise compensation a speech audio signal is provided that includes estimating a fast audio energy level and a slow audio energy level in an audio environment, wherein the speech audio signal is not part of the audio environment, and applying a gain to the speech audio signal to generate an environment compensated speech audio signal, wherein the gain is updated based on the estimated slow audio energy level when the estimated fast audio energy level is not indicative of an audio event in the audio environment and the estimated gain is not updated when the estimated fast audio energy level is indicative an audio event in the audio environment.
-
Citations
15 Claims
-
1. A method automatically performed by a system for environmental noise compensation of a first speech captured by a first audio capture device in a speech audio signal outside of an audio environment, the method comprising:
-
estimating a fast audio energy level and a slow audio energy level from an environment audio signal captured by a second audio capture device in the audio environment, wherein; the fast audio energy level corresponds to a second speech captured in the audio environment, and the slow audio energy level corresponds to an ambient noise captured in the audio environment; applying a gain to the speech audio signal to generate an environment compensated speech audio signal; determining either a presence or an absence of an audio event by comparing the fast audio energy level with the slow audio energy level against a predetermined energy threshold; updating the gain based on the estimated slow audio energy level during the absence of the audio event; and freezing the gain at a current level during the presence of the audio event. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method performed by a system for environmental noise compensation of a first speech captured by a first audio capture device in a speech audio signal outside an audio environment, the method comprising:
-
tracking a noise level and a speech level from samples of an environment audio signal captured by a second audio capture device in the audio environment, wherein; the speech level corresponds to a second speech in the audio environment, and the noise level corresponds to an ambient noise in the audio environment; determining either a presence or an absence of the second speech in the audio environment by comparing the noise level and the speech level against a predetermined threshold; updating a gain based on the noise level during the absence of the second speech; freezing the gain at a current level during the presence of the second speech; and applying the gain to the speech audio signal to generate an environment compensated speech audio signal. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A digital system comprising:
-
a processor; means for receiving an audio signal captured by a first audio capture device in an audio environment; means for receiving a speech audio signal carrying a first speech captured by a second audio capture device outside of the audio environment; and a non-transitory memory configured to store instructions that, when executed by the processor, cause the digital system to perform a method comprising; estimating a fast audio energy level corresponding to a second speech captured in the audio environment; estimating a slow audio energy level corresponding to an ambient noise captured in the audio environment; applying a gain to the speech audio signal to generate an environment compensated speech audio signal; determining either a presence or an absence of an audio event by comparing the fast audio energy level with the slow audio energy level against a predetermined energy threshold; updating the gain based on the estimated slow audio energy level during the absence of the audio event; and freezing the gain at a current level during the presence of the audio event. - View Dependent Claims (12, 13, 14, 15)
-
Specification