×

Speech recognition apparatus, speech recognition apparatus and program thereof

  • US 7,478,041 B2
  • Filed: 03/12/2003
  • Issued: 01/13/2009
  • Est. Priority Date: 03/14/2002
  • Status: Active Grant
First Claim
Patent Images

1. A speech recognition apparatus comprising:

  • a microphone array, comprising at least three microphones, each microphone measuring a delay and a sum of peak power for each of a plurality of angles from a horizontal axis and from a vertical axis in response to a white noise source located at a plurality of locations about said microphone array;

    a first directional sound source profile database for storing a plurality of first directional sound source profiles, each of said plurality of first directional sound source profiles determining a first direction sound source profile for each of said plurality of locations based on said measuring;

    a target location for said microphone array, where a voice and noise are recorded;

    a noise suppressor, receiving a voice signal and a noise signal recorded at said target location by said microphone array, said noise suppressor comprising;

    an array of delay and sum units, each delay and sum unit introducing a different delay from a range of negative and positive delays into said recording of said voice and said noise signal and producing a sum of peak power for said voice signal associated with each of said plurality of angles from said horizontal axis and with each of said plurality of angles from said vertical axis;

    wherein said voice signal associated with an angle of said horizontal axis and an angle of said vertical axis, corresponding to said target location, produces a maximal in-phase sum of peak power signal associated with said target location;

    an array of Fourier transform units, each Fourier transform unit corresponding to one of said array of delay and sum units and converting said voice signal from said one of said array of delay and sum units to a voice power distribution for each of a plurality of frequency bands correspondingly associated with each of said plurality of angles from said horizontal axis and from said vertical axis;

    an array of second profile fitting units, each said second profile fitting unit approximately decomposing said voice power distribution for each of said plurality of frequency bands, received from each Fourier transform units, providing a number of second profiles corresponding to said plurality of frequency bands, and selecting one of said second profiles based on correlating each of said voice power distributions that are approximately decomposed to each of said plurality of first directional sound source profiles, stored in said first directional sound source profile database, to one direction corresponding to said voice recorded at said target location;

    wherein said approximately decomposing comprises evaluating a directional target voice profile that equals a weighted sum of a first directional sound source profile for said white noise source in said one direction of said target location and a non-directional noise profile;

    wherein a weight coefficient of said first directional sound source profile and a weight coefficient for said non-directional noise profile are obtained by minimizing an evaluative function; and

    wherein a power of only a voice signal, without noise components, is determined for each of said plurality of frequency bands, based on said weight coefficient of said first directional sound source profile and said weight coefficient for said non-directional noise profile;

    a spectrum reconstruction unit that receives said power of only a voice signal for each of said plurality of frequency bands for reconstructing said power of only a voice recorded at said target location; and

    an output device that outputs said reconstruction of said power of only a voice recorded at said target location as a voice recording, without noise, from said target location.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×