×

Enhancing speech recognition using visual information

  • US 8,660,842 B2
  • Filed: 03/09/2010
  • Issued: 02/25/2014
  • Est. Priority Date: 03/09/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing speech recognition, comprising:

  • capturing one or more images;

    extracting environmental features affecting reverberation of an audio signal or noise in the audio signal from the captured one or more images, the environmental features including at least a configuration of an enclosed area in which a speaker is located, the audio signal including the speaker'"'"'s utterance;

    determining an environment adaptation parameter based on the extracted environment features;

    performing dereverberation or noise cancellation processing on the audio signal including the speaker'"'"'s utterance based on the environment adaptation parameter; and

    producing speech elements by processing the processed audio signal.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×