Selective Audio Source Enhancement
First Claim
1. A selective audio source enhancement system comprising:
- a system processor and a system memory, the system memory including;
a pre-processing unit controlled by the system processor to receive audio data including a target audio signal, and to perform sub-band domain decomposition of the audio data to generate a plurality of buffered outputs;
a target source detection unit controlled by the system processor to receive the plurality of buffered outputs, and to generate a target presence probability corresponding to the target audio signal;
a spatial filter estimation unit controlled by the system processor to receive the target presence probability, transform frames buffered in each sub-band into a higher resolution frequency-domain, and update the spatial filters in the higher resolution frequency-domaina spectral filtering unit controlled by the system processor to retrieve a multichannel image of the target audio signal and noise signals associated with the target audio signal;
an audio synthesis unit controlled by the system processor to extract an enhanced mono signal corresponding to the target audio signal from the multichannel image.
4 Assignments
0 Petitions
Accused Products
Abstract
A selective audio source enhancement system includes a processor and a memory, and a pre-processing unit configured to receive audio data including a target audio signal, and to perform sub-band domain decomposition of the audio data to generate buffered outputs. In addition, the system includes a target source detection unit configured to receive the buffered outputs, and to generate a target presence probability corresponding to the target audio signal, as well as a spatial filter estimation unit configured to receive the target presence probability, and to transform frames buffered in each sub-band into a higher resolution frequency-domain. The system also includes a spectral filtering unit configured to retrieve a multichannel image of the target audio signal and noise signals associated with the target audio signal, and an audio synthesis unit configured to extract an enhanced mono signal corresponding to the target audio signal from the multichannel image.
16 Citations
20 Claims
-
1. A selective audio source enhancement system comprising:
a system processor and a system memory, the system memory including; a pre-processing unit controlled by the system processor to receive audio data including a target audio signal, and to perform sub-band domain decomposition of the audio data to generate a plurality of buffered outputs; a target source detection unit controlled by the system processor to receive the plurality of buffered outputs, and to generate a target presence probability corresponding to the target audio signal; a spatial filter estimation unit controlled by the system processor to receive the target presence probability, transform frames buffered in each sub-band into a higher resolution frequency-domain, and update the spatial filters in the higher resolution frequency-domain a spectral filtering unit controlled by the system processor to retrieve a multichannel image of the target audio signal and noise signals associated with the target audio signal; an audio synthesis unit controlled by the system processor to extract an enhanced mono signal corresponding to the target audio signal from the multichannel image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A method for use by a selective audio source enhancement system including a system processor and a system memory, the method comprising:
-
pre-processing, by a pre-processing unit stored in the system memory and controlled by the system processor, received audio data including a target audio signal by performing sub-band domain decomposition of the audio data to generate a plurality of buffered outputs; generating, by a target source detection unit stored in the system memory and controlled by the system processor, a target presence probability corresponding to the target audio signal based on the plurality of buffered outputs; receiving, by a spatial filter estimation unit stored in the system memory and controlled by the system processor, the target presence probability, and transforming frames buffered in each sub-band into a higher resolution frequency-domain; retrieving, by a spectral filtering unit stored in the system memory and controlled by the system processor, a multichannel image of the target audio signal and noise signals associated with the target audio signal; extracting, by an audio synthesis unit stored in the system memory and controlled by the system processor, an enhanced mono signal corresponding to the target audio signal from the multichannel image. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification