Blind source separation systems
First Claim
1. A method of processing acoustic data representing audio from a plurality of different acoustic sources mixed together to extract the audio from an individual one of the acoustic sources so that it can be listened to separately, the method comprising performing blind source separation by:
- inputting acoustic data from a plurality of acoustic sensors, said acoustic data comprising acoustic signals combined from said plurality of acoustic sources;
converting said input acoustic data to combined source time-frequency domain data representing said acoustic signals combined from said plurality of acoustic sources, wherein said time-frequency domain data is represented by an observation matrix Xƒ
for each of a plurality of frequencies ƒ
;
performing an independent component analysis (ICA) on said observation matrix Xƒ
to determine a demixing matrix Wƒ
for each said frequency such that an estimate Yƒ
of the acoustic signals from said plurality of acoustic sources at said frequencies ƒ
is determined by Xƒ
Wƒ
;
wherein said ICA is performed based on an estimation of an individual source spectrogram of each individual said acoustic source; and
wherein said estimation of said individual source spectrogram of each individual said acoustic source is determined from a model of said individual acoustic source, the model representing individual source time-frequency variations in a signal output of said individual acoustic source;
using said demixing matrix Wƒ
to process said acoustic data comprising acoustic signals combined from said plurality of acoustic sources and demix individual acoustic data for an individual one of said plurality of acoustic sources; and
providing the acoustic data for the individual one of said plurality of acoustic sources to an output device for transmission to a user.
2 Assignments
0 Petitions
Accused Products
Abstract
We describe a method of blind source separation for use, for example, in a listening or hearing aid. The method processes input data from multiple microphones each receiving a mixed signal from multiple audio sources, performing independent component analysis (ICA) on the data in the time-frequency domain based on an estimation of a spectrogram of each acoustic source. The spectrograms of the sources are determined from non-negative matrix factorization (NMF) models of each source, the NMF model representing time-frequency variations in the output of an acoustic source in the time-frequency domain. The NMF and ICA models are jointly optimized, thus automatically resolving an inter-frequency permutation ambiguity.
-
Citations
20 Claims
-
1. A method of processing acoustic data representing audio from a plurality of different acoustic sources mixed together to extract the audio from an individual one of the acoustic sources so that it can be listened to separately, the method comprising performing blind source separation by:
-
inputting acoustic data from a plurality of acoustic sensors, said acoustic data comprising acoustic signals combined from said plurality of acoustic sources; converting said input acoustic data to combined source time-frequency domain data representing said acoustic signals combined from said plurality of acoustic sources, wherein said time-frequency domain data is represented by an observation matrix Xƒ
for each of a plurality of frequencies ƒ
;performing an independent component analysis (ICA) on said observation matrix Xƒ
to determine a demixing matrix Wƒ
for each said frequency such that an estimate Yƒ
of the acoustic signals from said plurality of acoustic sources at said frequencies ƒ
is determined by Xƒ
Wƒ
;wherein said ICA is performed based on an estimation of an individual source spectrogram of each individual said acoustic source; and wherein said estimation of said individual source spectrogram of each individual said acoustic source is determined from a model of said individual acoustic source, the model representing individual source time-frequency variations in a signal output of said individual acoustic source; using said demixing matrix Wƒ
to process said acoustic data comprising acoustic signals combined from said plurality of acoustic sources and demix individual acoustic data for an individual one of said plurality of acoustic sources; andproviding the acoustic data for the individual one of said plurality of acoustic sources to an output device for transmission to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 14)
-
-
10. A method as claimed in 1 further comprising pre-processing said acoustic data to reduce a number of said acoustic signals from said plurality of acoustic sensors to a reduced number of acoustic signals which is less than a number of said acoustic sensors, wherein said reduced number of acoustic signals is equal to a number of said plurality of said acoustic sources.
-
15. A method of processing acoustic data representing audio from a plurality of different acoustic sources mixed together to extract the audio from an individual one of the acoustic sources so that it can be listened to separately, the method comprising performing blind source separation by:
-
capturing the acoustic data representing audio from the plurality of acoustic sources at a plurality of microphones; processing the captured acoustic data to provide a set of observation matrices, said set of observation matrices representing observations of acoustic signals combined from said plurality of acoustic sources, wherein said set of observation matrices comprises a plurality of observation matrices, wherein each observation matrix is denoted Xƒ
and comprises data in a time-frequency domain for one of a plurality of frequencies ƒ
;wherein acoustic data for one of said plurality of acoustic sources and at one of said plurality of frequencies, demixed from said acoustic signals combined from said plurality of acoustic sources, is denoted Yƒ
, where Yƒ
comprises data in said time-frequency domain, andprocessing said set of observation matrices using a demixing matrix Wƒ
for each of said plurality of frequencies to determine an estimate of said acoustic data, denoted Yƒ
, demixed from said acoustic signals combined from said plurality of acoustic sources;wherein said processing comprises iteratively updating Yƒ
from Xƒ
Wƒ
; andwherein said processing is performed based on a probability distribution p(Ytkf;
σ
tkf) for Y dependent upon - View Dependent Claims (16, 17)
-
-
18. Apparatus to improve audibility of an audio signal by blind source separation, the apparatus comprising:
-
a set of microphones, each of the set of microphones having a known geometry, to receive signals from a plurality of audio sources disposed around the microphones; and an audio signal processor coupled to said microphones, and configured to providing a demixed audio signal output; the audio signal processor comprising; at least one analog-to-digital converter to digitise said signals received by said microphones to provide digital time-domain signals; and a digital filter to filter said digital time-domain signals in the time domain in accordance with a set of filter coefficients to provide said demixed audio signal output; the audio signal processor further comprising; a time-to-frequency domain converter to divide said digital time-domain signals into time segments and to convert said digital time-domain signals in said time segments into the frequency domain to generate time-frequency domain data; a blind source separation module, to perform audio signal demixing on said time-frequency domain data to determine a demixing matrix for at least one of said audio sources, wherein said set of filter coefficients is determined by said demixing matrix and is determined asynchronously in said time-frequency domain; and
wherein said audio signal processor is further configured to;process said demixing matrix, in view of a frequency and phase response of each microphone, determined from the known geometry of the microphone, to select one or more said audio sources responsive to a phase correlation determined from said demixing matrix. - View Dependent Claims (19, 20)
-
Specification