×

Speaker detection and tracking using audiovisual data

  • US 7,692,685 B2
  • Filed: 03/31/2005
  • Issued: 04/06/2010
  • Est. Priority Date: 06/27/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. An object tracker system, comprising:

  • a processor that executes the following computer executable components stored on a computer readable medium;

    an audio model component that models an original audio signal of an object, a time delay between at least two audio input signals and a variability component of the original audio signal, the audio model employing a probabilistic generative model;

    a video model component that models a location of the object, an original image of the object and a variability component of the original image, the video model employing a probabilistic generative model, the video model receiving a video input; and

    an audio video tracker component that models the location of the object based, at least in part, upon the audio model and the video model, wherein the audio video tracker provides an output associated with the location of the object based on, at least in past, a linear mapping that approximates the location of the object, wherein the linear mapping is computed as a function of the time delay between the at least two audio input signals, wherein error in approximating the location of the object is modeled by a zero mean Gaussian distribution associated with a precision matrix, and wherein the zero mean Gaussian distribution associated with the precision matrix is based on, at least in part;

    a product of a horizontal position of the object and a difference in horizontal position of a first audio input device and a second audio input device;

    a product of a vertical position of the object and a difference in vertical position of the first audio input device and the second audio input device; and

    a precision matrix of an approximation error modeled by a zero mean Gaussian.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×