Sound source localization method and sound source localization apparatus based coherence-to-diffuseness ratio mask
First Claim
1. A sound source localization method implemented by execution of a processor of a sound source localization apparatus, comprising steps of:
- (a) receiving a mixed signal of a target sound source signal and a noise signal through multiple microphones including at least two microphones;
(b) generating a mask based on a diffuseness reflecting information on a target sound source and a noise source by using the mixed signal;
(c) pre-processing the mixed signal received to the multiple microphones by using the generated mask; and
(d) estimating a direction for the target sound source by performing a predetermined algorithm on the pre-processed mixed signal.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided is a sound source localization method including steps of: (a) receiving a mixed signal of a target sound source signal and noise and echo signals through multiple microphones including at least two microphones; (b) generating a binarized mask based on a diffuseness by using a coherence-to-diffuseness ratio CDR, which is information on the target sound source and the noise source, by using the input signal; (c) pre-processing an input signal to multiple microphones by using the generated binarized mask; and (d) performing a predetermined algorithm such as the GCC-PHAT or the SRP-PHAT on the pre-processed input signal to estimate a direction of the target sound source.
4 Citations
11 Claims
-
1. A sound source localization method implemented by execution of a processor of a sound source localization apparatus, comprising steps of:
-
(a) receiving a mixed signal of a target sound source signal and a noise signal through multiple microphones including at least two microphones; (b) generating a mask based on a diffuseness reflecting information on a target sound source and a noise source by using the mixed signal; (c) pre-processing the mixed signal received to the multiple microphones by using the generated mask; and (d) estimating a direction for the target sound source by performing a predetermined algorithm on the pre-processed mixed signal. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A sound source localization apparatus having a processor and being operable to estimate a direction of a target sound source by using signals input from multiple microphones by execution of the processor, comprising:
-
a mixed signal input module which is connected to the multiple microphones and receives mixed signals of a target sound source signal and a noise signal from the multiple microphones; a mask generation module which generates and outputs a binarized mask based on a diffuseness by using the mixed signal provided from the mixed signal input module; an input signal pre-processing module which receives the binarized mask from the mask generation module, pre-processes the mixed signal by applying the binarized mask to the mixed signal provided from the mixed signal input module, and outputs the pre-processed mixed signal; and a target direction estimation module which receives the pre-processed mixed signal from the input signal pre-processing module, estimates a direction of the target sound source by performing a predetermined localization algorithm on the pre-processed mixed signal, and outputs the estimated direction. - View Dependent Claims (9, 10, 11)
-
Specification