Microphone array steering with image-based source location
First Claim
Patent Images
1. A method of focusing a microphone array, comprising:
- receiving an audio signal from the microphone array, the microphone array disposed a distance away from a first component of an audio beam forming system;
receiving a visual image from a plurality of images forming a video segment from an optical image sensor having a known position relative to the microphone array, the visual image including the first component within a field of view (FOV);
determining positional information for the first component, relative to the microphone array by analyzing the visual image, and by determining a distance between the first component and the image sensor based on a predetermined size of the first component;
tracking a change in the position of the first component relative to the image sensor based on changes in the visual image within the video segment; and
filtering the audio signal to emphasize or de-emphasize audio sources proximate to the first component based on a beam forming algorithm that is a function of the positional information and on the change in the position of the first component.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for beam forming an audio signal based on a location of an object relative to the listening device, the location being determined from positional data deduced from an optical image including the object. In an embodiment, an object'"'"'s position is tracked based on video images of the object and the audio signal received from a microphone array located at a fixed position is filtered based on the tracked object position. Beam forming techniques may be applied to emphasize portions of an audio signal associated with sources near the object.
42 Citations
17 Claims
-
1. A method of focusing a microphone array, comprising:
-
receiving an audio signal from the microphone array, the microphone array disposed a distance away from a first component of an audio beam forming system; receiving a visual image from a plurality of images forming a video segment from an optical image sensor having a known position relative to the microphone array, the visual image including the first component within a field of view (FOV); determining positional information for the first component, relative to the microphone array by analyzing the visual image, and by determining a distance between the first component and the image sensor based on a predetermined size of the first component; tracking a change in the position of the first component relative to the image sensor based on changes in the visual image within the video segment; and filtering the audio signal to emphasize or de-emphasize audio sources proximate to the first component based on a beam forming algorithm that is a function of the positional information and on the change in the position of the first component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
-
a computing platform; a microphone array coupled to the computing platform to receive an audio signal; and an optical image sensor coupled to the computing platform, wherein the computing platform further comprises; an object tracking module to determine positional information for a first component and for a second component of an audio beam steering system relative to the microphone array by analyzing an image signal from the optical image sensor, the image signal including the first component and the second component within a field of view (FOV) of the image sensor; and an audio signal processor to emphasize or de-emphasize audio sources proximate to the first component based on a beam forming algorithm that is a function of the positional information for the first component, and to steer the received audio signal from the first component to the second component based on the positional information for the second component. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
Specification