Sound source separation device, speech recognition device, mobile telephone, sound source separation method, and program
First Claim
1. A sound source separation device for separating a sound source signal of a target sound source from a mixed sound which includes sound source signals emitted from a plurality of sound sources using at least two microphones arranged separately from each other comprising:
- beamforming means forperforming a first beamforming processing to attenuate a sound source signal arriving from a predetermined direction by performing computations using first coefficients on an output signal of said microphones, andperforming a second beamforming processing to attenuate a sound source signal arriving from a direction symmetrical to said predetermined direction with respect to a perpendicular line to a straight line connecting the two microphones by performing computations using second coefficients which are complex conjugate of said first coefficients in a frequency domain on the output signal of said microphone;
power computation means for computing power spectrum information with respect to each of sound source signals obtained by said beamforming means; and
target sound spectrum extraction means for extracting spectrum information of a target sound source based on a difference between the power spectrum information calculated by said power computation means.
1 Assignment
0 Petitions
Accused Products
Abstract
A sound source signal from a target sound source is allowed to be separated from a mixed sound which consists of sound source signals emitted from a plurality of sound sources without being affected by uneven sensitivity of microphone elements. A beamformer section 3 of a source separation device 1 performs beamforming processing for attenuating sound source signals arriving from directions symmetrical with respect to a perpendicular line to a straight line connecting two microphones 10 and 11 respectively by multiplying output signals from the microphones 10 and 11 after spectrum analysis by weighted coefficients which are complex conjugate to each other. Power computation sections 40 and 41 compute power spectrum information, and target sound spectrum extraction sections 50 and 51 extract spectrum information of a target sound source based on a difference between the power spectrum information.
58 Citations
13 Claims
-
1. A sound source separation device for separating a sound source signal of a target sound source from a mixed sound which includes sound source signals emitted from a plurality of sound sources using at least two microphones arranged separately from each other comprising:
-
beamforming means for performing a first beamforming processing to attenuate a sound source signal arriving from a predetermined direction by performing computations using first coefficients on an output signal of said microphones, and performing a second beamforming processing to attenuate a sound source signal arriving from a direction symmetrical to said predetermined direction with respect to a perpendicular line to a straight line connecting the two microphones by performing computations using second coefficients which are complex conjugate of said first coefficients in a frequency domain on the output signal of said microphone; power computation means for computing power spectrum information with respect to each of sound source signals obtained by said beamforming means; and target sound spectrum extraction means for extracting spectrum information of a target sound source based on a difference between the power spectrum information calculated by said power computation means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A sound source separation method comprising:
-
a sound source signal receiving step of inputting sound source signals emitted from a plurality of sound sources to at least two microphones arranged separately from each other; a beamforming processing step of performing a first beamforming processing and a second beamforming processing to attenuate sound source signals arriving from predetermined directions symmetrical with respect to a perpendicular line to a straight line connecting two microphones respectively by performing computations using two weighted coefficients which are complex conjugate to each other in a frequency domain on an output signal of said microphone respectively; a power computation step of computing power spectrum information with respect to each of sound source signals obtained in said beamforming processing step; and a target sound spectrum extracting step of extracting spectrum information of a target sound source based on a difference between the power spectrum information calculated in said power computation step.
-
-
13. A tangible, non-transitory, computer-readable storage medium storing instructions that, when executed by a processor, cause the processor to perform a sound source separation method, comprising:
-
acquiring an output signal which includes sound source signals emitted from a plurality of sound sources are mixed from at least two microphones arranged separately from each other; performing a first beamforming processing and a second beamforming processing to attenuate the sound source signals arriving from predetermined directions symmetrical with respect to a perpendicular line to a straight line connecting the two microphones respectively by performing computations using two weighted coefficients which are complex conjugate to each other in a frequency domain on the acquired output signal; computing power spectrum information with respect to each of the sound source signals; and extracting spectrum information of a target sound source based on a difference between the computed power spectrum information.
-
Specification