Object sound extraction apparatus and object sound extraction method
First Claim
1. An object sound extraction apparatus comprising:
- a main sound input section for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal;
sub voice input sections for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals;
sound source separation sections for performing a sound source separation processing for separating and generating an object sound separation signal corresponding to the object sound and reference sound separation signals corresponding to the one or more reference sounds other than the object sound based on each combination of the main acoustic signal and the sub acoustic signals;
an object sound separation signal synthesis section for synthesizing the object sound separation signals and outputting a synthesis signal; and
a spectrum subtraction processing section for extracting an acoustic signal corresponding to the object sound from the synthesis signal by performing a spectrum subtraction processing between the synthesis signal and the reference sound separation signals, and outputting an extracted signal corresponding to the acoustic signal.
1 Assignment
0 Petitions
Accused Products
Abstract
An object sound extraction apparatus includes sound source separation sections for separating and generating an object sound separation signal corresponding to an object sound and reference sound separation signals corresponding to the other reference sound based on each combination of a main acoustic signal and sub acoustic signals, an object sound separation signal synthesis section for synthesizing the object sound separation signals, and a spectrum subtraction processing section for extracting an acoustic signal corresponding to the object sound from the synthesis signal by performing a spectrum subtraction processing between the synthesis signal and the reference sound separation signals. Accordingly, in acoustic environments where the object sound and the noises are mixed in the acoustic signals obtained via the microphones, and the mixed conditions can vary, a high object sound extraction performance can be ensured by a small object sound extraction apparatus.
11 Citations
33 Claims
-
1. An object sound extraction apparatus comprising:
-
a main sound input section for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal; sub voice input sections for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals; sound source separation sections for performing a sound source separation processing for separating and generating an object sound separation signal corresponding to the object sound and reference sound separation signals corresponding to the one or more reference sounds other than the object sound based on each combination of the main acoustic signal and the sub acoustic signals; an object sound separation signal synthesis section for synthesizing the object sound separation signals and outputting a synthesis signal; and a spectrum subtraction processing section for extracting an acoustic signal corresponding to the object sound from the synthesis signal by performing a spectrum subtraction processing between the synthesis signal and the reference sound separation signals, and outputting an extracted signal corresponding to the acoustic signal. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
2. An object sound extraction apparatus comprising:
-
a main sound input section for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal; sub voice input sections for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals; sound source separation sections for performing a sound source separation processing for separating and generating an object sound separation signal corresponding to the object sound based on each combination of the main acoustic signal and the sub acoustic signals; and a spectrum approximate signal extraction section for extracting an acoustic signal corresponding to the object sound from the object sound extraction signals and outputting an extracted signal corresponding to the acoustic signal by dividing the object sound separation signals into signal components of each of a plurality of frequency bands, and extracting signal components that satisfy a predetermined approximation condition between the object sound separation signals. - View Dependent Claims (16, 18, 20, 22, 24, 26, 28, 30, 32)
-
-
3. An object sound extraction apparatus comprising:
-
a main sound input section for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal; sub voice input sections for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals; sound source separation sections for performing a sound source separation processing for separating and generating a reference sound separation signal corresponding to the one or more reference sounds other than the object sound based on each combination of the main acoustic signal and the sub acoustic signals; and a spectrum subtraction processing section for extracting an acoustic signal corresponding to the object sound from the main acoustic signal and outputting an extracted signal corresponding to the acoustic signal by performing a spectrum subtraction processing between the reference sound separation signals separated and generated by the main acoustic signal and the sound source separation sections. - View Dependent Claims (17, 19, 21, 23, 25, 27, 29, 31, 33)
-
-
13. An object sound extraction method comprising:
-
a main sound input processing for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal; a sub voice input processing for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals; a sound source separation processing for performing a sound source separation processing for separating and generating an object sound separation signal corresponding to the object sound and reference sound separation signals corresponding to the one or more reference sounds other than the object sound based on each combination of the main acoustic signal and the sub acoustic signals; an object sound separation signal synthesis processing for synthesizing the object sound separation signals and outputting a synthesis signal; and a spectrum subtraction processing for extracting an acoustic signal corresponding to the object sound from the synthesis signal by performing a spectrum subtraction processing between the synthesis signal and the reference sound separation signals, and outputting an extracted signal corresponding to the acoustic signal.
-
-
14. An object sound extraction method comprising:
-
a main sound input processing for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal; a sub voice input processing for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals; a sound source separation processing for performing a sound source separation processing for separating and generating an object sound separation signal corresponding to the object sound based on each combination of the main acoustic signal and the sub acoustic signals; and a spectrum approximate signal extraction processing for extracting an acoustic signal corresponding to the object sound from the object sound extraction signals and outputting an extracted signal corresponding to the acoustic signal by dividing the object sound separation signals into signal components of each of a plurality of frequency bands, and extracting signal components that satisfy a predetermined approximation condition between the object sound separation signals.
-
-
15. An object sound extraction method comprising:
-
a main sound input processing for mainly inputting an object sound generated by a predetermined object sound source and outputting a main acoustic signal; a sub voice input processing for mainly inputting one or more reference sounds generated by one or more sound sources other than the object sound source and outputting sub acoustic signals; a sound source separation processing for performing a sound source separation processing for separating and generating a reference sound separation signal corresponding to the one or more reference sounds other than the object sound based on each combination of the main acoustic signal and the sub acoustic signals; and a spectrum subtraction processing for extracting an acoustic signal corresponding to the object sound from the main acoustic signal and outputting an extracted signal corresponding to the acoustic signal by performing a spectrum subtraction processing between the reference sound separation signals separated and generated by the main acoustic signal and the sound source separation processing.
-
Specification