Speech processing system, speech processing method, speech processing program, vehicle including speech processing system on board, and microphone placing method
First Claim
1. A speech processing system comprising:
- at least one hardware processor configured to implement;
a linear microphone array comprising a plurality of microphones, arranged on a straight line, each of which inputs speech of a speaker of interest and noise from a noise source region comprising a plurality of noise sources, and outputs a mixture signal comprising said speech and said noise; and
a noise suppressor that suppresses said noise based on said mixture signals,wherein a direction of arranging the plurality of microphones in said linear microphone array is determined such that said straight line is perpendicular to a longitudinal axis of said noise source region, anda distance, between said linear microphone array and said noise source region, and a tilt of a placement plane of said microphone array are adjusted so as to decrease differences between respective distances from each of noise sources in said noise source region to the plurality of microphones in said linear microphone array.
1 Assignment
0 Petitions
Accused Products
Abstract
A system of this invention is directed to a speech processing system that efficiently performs noise suppression processing for a plurality of noise sources spreading in a lateral direction with respect to a speaker of interest. The speech processing system includes a microphone array including a plurality of microphones, each of which inputs a sound mixture including speech of a speaker of interest and noise from a noise source region including a plurality of noise sources placed in a lateral direction with respect to the speaker of interest, and outputs a mixture signal including a speech signal and a noise signal, the plurality of microphones being arranged such that a difference between respective distances from the plurality of microphones to the speaker of interest becomes different from a difference between respective distances from the plurality of microphones to the noise source region, and a noise suppressor that suppresses the noise based on the mixture signals output from the plurality of microphones.
29 Citations
15 Claims
-
1. A speech processing system comprising:
-
at least one hardware processor configured to implement; a linear microphone array comprising a plurality of microphones, arranged on a straight line, each of which inputs speech of a speaker of interest and noise from a noise source region comprising a plurality of noise sources, and outputs a mixture signal comprising said speech and said noise; and a noise suppressor that suppresses said noise based on said mixture signals, wherein a direction of arranging the plurality of microphones in said linear microphone array is determined such that said straight line is perpendicular to a longitudinal axis of said noise source region, and a distance, between said linear microphone array and said noise source region, and a tilt of a placement plane of said microphone array are adjusted so as to decrease differences between respective distances from each of noise sources in said noise source region to the plurality of microphones in said linear microphone array. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech processing system comprising:
-
at least one hardware processor configured to implement; a first microphone that is placed on a ceiling in a vehicle, inputs a sound mixture comprising noise from a noise source region comprising a plurality of noise sources and a voice of a passenger of the vehicle, and outputs a first mixture signal; a second microphone that is placed on the ceiling in the vehicle such that a straight line, on which the first microphone and the second microphone are placed, of a linear microphone array, comprising the first microphone and the second microphone, is perpendicular to a longitudinal axis of said noise source region, inputs a sound mixture comprising the noise from the noise source region and the voice of the passenger of the vehicle, and outputs a second mixture signal; and a noise suppressor that outputs an enhanced speech signal based on the first mixture signal and the second mixture signal, wherein a distance, between said linear microphone array and said noise source region, and a tilt of a placement plane of said microphone array are adjusted so as to decrease differences between respective distances from each of noise sources in said noise source region to the first microphone and the second microphone in said linear microphone array. - View Dependent Claims (12)
-
-
13. A microphone placing method comprising:
-
arranging on a straight line a plurality of microphones, each of which inputs a sound mixture, comprising speech of a speaker of interest and noise from a noise source region comprising a plurality of noise sources, and outputs a mixture signal comprising a speech signal and a noise signal, wherein the plurality of microphones are of a linear microphone array and are arranged on the straight line perpendicular to a longitudinal axis of said noise source region, and a distance, between said linear microphone array and said noise source region, and a tilt of a placement plane of said microphone array are adjusted so as to decrease differences between respective distances from each of noise sources in said noise source region to the plurality of microphones in said linear microphone array.
-
-
14. A speech processing method comprising:
at least one hardware processor configured to implement; selecting microphones, to output a plurality of mixture signals comprising a speech signal and a noise signal, out of a plurality of microphones, each of which inputs a sound mixture, comprising speech of the speaker of interest and noise from a noise source region comprising a plurality of noise sources, and outputs the mixture signal, the selected microphones are of a linear microphone array and are arranged on a straight line perpendicular to a longitudinal axis of said noise source region; and suppressing the noise based on the mixture signals output from the selected microphones, and wherein a distance, between said linear microphone array and said noise source region, and a tilt of a placement plane of said microphone array are adjusted so as to decrease differences between respective distances from each of noise sources in said noise source region to the plurality of microphones in said linear microphone array.
-
15. A non-transitory computer readable storage medium storing a speech processing program for causing a computer to execute a method, comprising:
implementing by at least one hardware processor; selecting microphones, to output a plurality of mixture signals comprising a speech signal and a noise signal, out of a plurality of microphones, each of which inputs a sound mixture, comprising speech of the speaker of interest and noise from a noise source region comprising a plurality of noise sources, and outputs the mixture signal, the selected microphones are of a linear microphone array and are arranged on a straight line perpendicular to a longitudinal axis of said noise source region; and suppressing the noise based on the mixture signals output from the selected microphones, wherein a distance, between said linear microphone array and said noise source region, and a tilt of a placement plane of said microphone array are adjusted so as to decrease differences between respective distances from each of noise sources in said noise source region to the plurality of microphones in said linear microphone array.
Specification