×

Method for detecting voice section from time-space by using audio and video information and apparatus thereof

  • US 9,431,029 B2
  • Filed: 02/10/2010
  • Issued: 08/30/2016
  • Est. Priority Date: 02/27/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method for detecting a time-space voice section using audio and video information, comprising:

  • detecting a voice section from an audio signal input to a microphone array;

    performing speaker verification by comparing the detected voice section to a voice model constructed in advance;

    detecting a speaker'"'"'s face by using a video signal input to a camera in response to the verification of the voice section;

    estimating a speaker'"'"'s face direction based on the direction of the speaker'"'"'s face in the video signal; and

    determining the detected voice section as a speaker'"'"'s voice section for voice recognition when the estimated face direction matches a previously stored reference direction so that voices generated from speakers that do not match the previously stored reference direction are not recognized,wherein the detecting the voice section includes;

    estimating a position of a sound source by using the audio signal input to the microphone array;

    distinguishing noise by comparing the estimated position of the sound source and the previously stored reference direction with each otherremoving the distinguished noise; and

    detecting a voice section on the basis of a single microphone in the signal of which the noise is removed.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×