×

Associating Audio with Three-Dimensional Objects in Videos

  • US 20170366896A1
  • Filed: 06/20/2016
  • Published: 12/21/2017
  • Est. Priority Date: 06/20/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method for locating and tracking one or more audio sources recorded by a plurality of microphones, the method comprising:

  • receiving positions and orientations for each of at least one camera;

    receiving positions for each of the plurality of microphones.receiving at least one video recorded by a camera;

    receiving a plurality of audio signals, each audio signal recorded by a microphone of the plurality of microphones;

    applying source separation to the plurality of audio signals to generate one or more audio source signals, each audio source signal having originated from a respective audio source of the one or more audio sources;

    estimating, for each audio source, a position associated with the audio source;

    estimating a position with computer vision for each of one or more visual objects based on a visual analysis of the at least one video and the at least one position of the at least one camera;

    matching each of the one or more audio sources to a corresponding visual object of the one or more visual objects based on the estimated position of the audio source and the estimated position of the visual object;

    tracking movement of the one or more visual objects to generate visual object position data associated with movement of the one or more visual objects; and

    storing audio source position data for each of the one or more audio sources based on the visual object position data associated with the visual object to which the audio source was matched.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×