×

Audio-visual feature fusion and support vector machine useful for continuous speech recognition

  • US 7,472,063 B2
  • Filed: 12/19/2002
  • Issued: 12/30/2008
  • Est. Priority Date: 12/19/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for recognizing speech by fusing audio and visual features, comprisinggenerating an audio vector representing detected audio data of a speech utterance,detecting a face in a video data stream linked to the audio data of the speech utterance,applying a cascade of linear support vector machine classifiers to the detected face to locate a mouth region,generating vector data for the mouth region,training a hidden Markov model (HMM) by fusing audio and visual vector data with the HMM, andrecognizing an input speech by extracting audio and visual features and by comparing the extracted audio and visual features with HMMs obtained at least in part through audio and visual fusion.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×