Speech separation with microphone arrays
First Claim
1. A computer-implemented audio blind source separation system, comprising:
- a frequency transform component for transforming a plurality of sensor signals to a corresponding plurality of frequency domain sensor signals, the plurality of sensor signals received from a plurality of input sensors; and
,a frequency domain blind source separation component for estimating a plurality of source signals for each of a plurality of frequency bands based on the plurality of frequency domain sensor signals and processing matrices computed independently for each of the plurality of frequency bands; and
a maximum attenuation based de-permutation component for obtaining modified permutations of the processing matrices based upon a maximum-magnitude based de-permutation scheme,wherein the system provides estimates of the plurality of source signals based on the plurality of frequency domain sensor signals and the modified permutations of the processing matrices.
2 Assignments
0 Petitions
Accused Products
Abstract
A system that facilitates blind source separation in a distributed microphone meeting environment for improved teleconferencing. Input sensor (e.g., microphone) signals are transformed to the frequency-domain and independent component analysis is applied to compute estimates of frequency-domain processing matrices. Modified permutations of the processing matrices are obtained based upon a maximum magnitude based de-permutation scheme. Estimates of the plurality of source signals are provided based upon the modified frequency-domain processing matrices and input sensor signals.
Optionally, segments during which the set of active sources is a subset of the set of all sources can be exploited to compute more accurate estimates of frequency-domain mixing matrices. Source activity detection can be applied to determine which speaker(s), if any, are active. Thereafter, a least squares post-processing of the frequency-domain independent components analysis outputs can be employed to adjust the estimates of the source signals based on source inactivity.
27 Citations
20 Claims
-
1. A computer-implemented audio blind source separation system, comprising:
-
a frequency transform component for transforming a plurality of sensor signals to a corresponding plurality of frequency domain sensor signals, the plurality of sensor signals received from a plurality of input sensors; and
,a frequency domain blind source separation component for estimating a plurality of source signals for each of a plurality of frequency bands based on the plurality of frequency domain sensor signals and processing matrices computed independently for each of the plurality of frequency bands; and a maximum attenuation based de-permutation component for obtaining modified permutations of the processing matrices based upon a maximum-magnitude based de-permutation scheme, wherein the system provides estimates of the plurality of source signals based on the plurality of frequency domain sensor signals and the modified permutations of the processing matrices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method of blindly separating a plurality of source signals, comprising:
-
receiving a plurality of input sensor signals; transforming the input sensor signals to a corresponding plurality of frequency-domain sensor signals using a short-time Fourier transform; and computing estimates of the plurality of source signals for each of a plurality of frequency bands based upon the plurality of frequency-domain sensor signals and processing matrices computed independently for each of the plurality of frequency bands; and obtaining modified permutations of the processing matrices based upon a maximum magnitude based de-permutation scheme. - View Dependent Claims (12, 13, 14)
-
-
15. A computer-implemented method of blindly separating a plurality of source signals, comprising:
-
determining source activity information specifying which two or more sources are active at a plurality of times; and
,modifying processing matrices based upon a least squares estimation of the processing matrices and the source activity information. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification