Sound source separation apparatus and sound source separation method
First Claim
1. A sound source separation apparatus, comprising:
- a plurality of sound input means, into which a plurality of mixed sound signals in which sound source signals from a plurality of sound sources are superimposed are inputted;
an SIMO-ICA process means, separating and generating SIMO signals each of which corresponds to at least one of the sound source signals from the plurality of mixed sound signals by a sound source separation process of a blind source separation method based on an independent component analysis method;
a sound source direction estimation means, estimating sound source directions which are directions in which the sound sources are present, respectively, based on a separating matrix calculated by a learning calculation executed in the sound source separation process of the blind source separation method based on the independent component analysis method in the SIMO-ICA process means;
a beamformer process means,applying, to each of the SIMO signals separated and generated in the SIMO-ICA process means, a beamformer process of enhancing, according to each of plurally sectioned frequency components, a sound component from each of the sound source directions estimated by the sound source estimation means, andoutputting beamformer processed sound signals;
an intermediate process execution means,performing a predetermined intermediate process including a selection process or a synthesis process, according to each of the plurally sectioned frequency components, on the beamformer processed sound signals other than a specific beamformer processed sound signal with which a sound component from a specific sound source direction which is one of the sound source directions is enhanced for a specific SIMO-signal which is one of the SIMO signals, andoutputting an intermediate processed signal obtained thereby; and
an untargeted signal component elimination means,performing, on one signal in the specific SIMO signal, a process of comparing volumes of the specific beamformer processed sound signal and the intermediate processed signal according to each of the plurally sectioned frequency components and, when a comparison result meets a predetermined condition, of eliminating a signal of the corresponding frequency component, andgenerating a signal obtained thereby as a separated signal corresponding to one of the sound source signals.
1 Assignment
0 Petitions
Accused Products
Abstract
A sound source separation apparatus includes: an SIMO-ICA process unit, separating and generating an SIMO signal by the BSS method based on the ICA method; a sound source direction estimation unit, estimating a sound source direction based on a separating matrix, computed by a learning calculation of the BSS method based on the ICA method; a beamformer process unit, performing, on each SIMO signal, a beamformer process of enhancing, according to each frequency bin, a sound component from each sound source direction; an intermediate process unit, performing an intermediate process that includes performing a selection process, etc., according to each frequency bin on signals other than a specific signal among the beamformer processed sound signals; and an untargeted signal component elimination unit, eliminating noise signal components by comparing for one signal in the specific SIMO signal, volumes of the specific beam former processed sound signal and the intermediate processed signal according to each frequency bin.
42 Citations
14 Claims
-
1. A sound source separation apparatus, comprising:
-
a plurality of sound input means, into which a plurality of mixed sound signals in which sound source signals from a plurality of sound sources are superimposed are inputted; an SIMO-ICA process means, separating and generating SIMO signals each of which corresponds to at least one of the sound source signals from the plurality of mixed sound signals by a sound source separation process of a blind source separation method based on an independent component analysis method; a sound source direction estimation means, estimating sound source directions which are directions in which the sound sources are present, respectively, based on a separating matrix calculated by a learning calculation executed in the sound source separation process of the blind source separation method based on the independent component analysis method in the SIMO-ICA process means; a beamformer process means, applying, to each of the SIMO signals separated and generated in the SIMO-ICA process means, a beamformer process of enhancing, according to each of plurally sectioned frequency components, a sound component from each of the sound source directions estimated by the sound source estimation means, and outputting beamformer processed sound signals;
an intermediate process execution means,performing a predetermined intermediate process including a selection process or a synthesis process, according to each of the plurally sectioned frequency components, on the beamformer processed sound signals other than a specific beamformer processed sound signal with which a sound component from a specific sound source direction which is one of the sound source directions is enhanced for a specific SIMO-signal which is one of the SIMO signals, and outputting an intermediate processed signal obtained thereby; and an untargeted signal component elimination means, performing, on one signal in the specific SIMO signal, a process of comparing volumes of the specific beamformer processed sound signal and the intermediate processed signal according to each of the plurally sectioned frequency components and, when a comparison result meets a predetermined condition, of eliminating a signal of the corresponding frequency component, and generating a signal obtained thereby as a separated signal corresponding to one of the sound source signals. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A sound source separation method comprising:
-
a plurality of sound input steps of inputting a plurality of mixed sound signals in which sound source signals from a plurality of sound sources are superimposed; an SIMO-ICA process step of separating and generating SIMO signals each of which corresponds to at least one of the sound source signals from the plurality of mixed sound signals by a sound source separation process of a blind source separation method based on an independent component analysis method; a sound source direction estimating step of estimating sound source directions which are directions in which the sound sources are present, respectively, based on a separating matrix calculated by a learning calculation executed in the sound source separation process of the blind source separation method based on the independent component analysis method in the SIMO-ICA process step; a beamformer process step of applying, to each of the SIMO signals separated and generated in the SIMO-ICA process step, a beamformer process of enhancing, according to each of plurally sectioned frequency components, a sound component from each of the sound source directions estimated by the sound source estimation step, and outputting beamformer processed sound signals; an intermediate process execution step of performing a predetermined intermediate process including a selection process or a synthesis process, according to each of the plurally sectioned frequency components, on the beamformer processed sound signals other than a specific beamformer processed sound signal with which a sound component from a specific sound source direction, which is one of the sound source directions is enhanced for a specific SIMO signal which is one of the SIMO signals, and outputting an intermediate processed signal obtained thereby; and an untargeted signal component elimination step of performing, on one signal in the specific SIMO signal, a process of comparing volumes of the specific beamformer processed sound signal and the intermediate processed signal according to each of the plurally sectioned frequency components and, when a comparison result meets a predetermined condition, of eliminating a signal of the corresponding frequency component, and generating a signal obtained thereby as a separated signal corresponding to one of the sound source signals. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification