Method for enhancing audio signal using phase information
First Claim
Patent Images
1. A method for transforming a noisy audio signal to an enhanced audio signal, comprising steps:
- acquiring the noisy audio signal from an environment;
inputting the noisy audio signal to a deep neural network having network parameters to produce a magnitude mask and a phase estimate, wherein the deep neural network is a deep recurrent neural network (DRNN), a bidirectional long short-term memory (BLSTM) deep recurrent neural network (DRNN) or a long short-term memory (LSTM) network, wherein the deep neural network uses a phase-sensitive objective function based on an error in a complex spectrum that includes an error in amplitude and a phase of the noisy audio signal;
using the magnitude mask and the phase estimate to obtain the enhanced audio signal, wherein the steps are performed in a processor.
0 Assignments
0 Petitions
Accused Products
Abstract
A method transforms a noisy audio signal to an enhanced audio signal, by first acquiring the noisy audio signal from an environment. The noisy audio signal is processed by an enhancement network having network parameters to jointly produce a magnitude mask and a phase estimate. Then, the magnitude mask and the phase estimate are used to obtain the enhanced audio signal.
-
Citations
12 Claims
-
1. A method for transforming a noisy audio signal to an enhanced audio signal, comprising steps:
-
acquiring the noisy audio signal from an environment; inputting the noisy audio signal to a deep neural network having network parameters to produce a magnitude mask and a phase estimate, wherein the deep neural network is a deep recurrent neural network (DRNN), a bidirectional long short-term memory (BLSTM) deep recurrent neural network (DRNN) or a long short-term memory (LSTM) network, wherein the deep neural network uses a phase-sensitive objective function based on an error in a complex spectrum that includes an error in amplitude and a phase of the noisy audio signal; using the magnitude mask and the phase estimate to obtain the enhanced audio signal, wherein the steps are performed in a processor. - View Dependent Claims (2, 3, 4, 11, 12)
-
-
5. An audio signal transformation system comprising:
-
a sound detecting device configured to acquire a noisy audio signal from an environment; a signal input interface device configured to receive and transmit the noisy audio signal; an audio signal processing device configured to process the noisy audio signal, wherein the audio signal processing device comprises; a processor configured to connected to a memory, the memory being configured to input/output data, wherein the processor executes the steps of; inputting the noisy audio signal to a deep neural network having network parameters to produce a magnitude mask and a phase estimate, wherein the deep neural network is a bidirectional long short-term memory (BLSTM) deep recurrent neural network (DRNN) or a long short-term memory (LSTM) network, wherein the deep neural network uses a phase-sensitive objective function based on an error in a complex spectrum that includes an error in amplitude and a phase of the noisy audio signal; using the magnitude mask and the phase estimate to obtain an enhanced audio signal, and a signal output device configured to output the enhanced audio signal. - View Dependent Claims (6, 7, 8, 9, 10)
-
Specification