Listen to people you recognize
First Claim
Patent Images
1. A method comprising:
- processing, at a first mobile computing device, a video image and an audio communication associated with the video image, wherein the audio communication comprises at least two raw electronic audio signals created from at least two separate microphones, and wherein a relative position of the at least two separate microphones is known;
identifying at least one source of the audio communication from the processing of the video image as part of a visual identification of at least one source of the audio communication;
determining, based on the identifying of the at least one source of the audio communication, an angle from the first mobile computing device to the at least one source of the audio communication; and
contemporaneously displaying, on a display output of the first mobile computing device, (1) first location information associated with the visual identification of the at least one source of the audio communication overlaid on the video image and (2) second location information comprising the angle from the first mobile computing device to the at least one source of the audio communication.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, devices, and methods are described for recognizing and focusing on at least one source of an audio communication as part of a communication including a video image and an audio communication derived from two or more microphones when a relative position between the microphones is known. In certain embodiments, linked audio and video focus areas providing location information for one or more sound sources may each be associated with different user inputs, and an input to adjust a focus in either the audio or video domain may automatically adjust the focus in the another domain.
14 Citations
30 Claims
-
1. A method comprising:
-
processing, at a first mobile computing device, a video image and an audio communication associated with the video image, wherein the audio communication comprises at least two raw electronic audio signals created from at least two separate microphones, and wherein a relative position of the at least two separate microphones is known; identifying at least one source of the audio communication from the processing of the video image as part of a visual identification of at least one source of the audio communication; determining, based on the identifying of the at least one source of the audio communication, an angle from the first mobile computing device to the at least one source of the audio communication; and contemporaneously displaying, on a display output of the first mobile computing device, (1) first location information associated with the visual identification of the at least one source of the audio communication overlaid on the video image and (2) second location information comprising the angle from the first mobile computing device to the at least one source of the audio communication. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A mobile computing device comprising:
-
a processor; a display output for outputting video image, wherein the display output is coupled to the processor; at least two separate microphones, wherein the at least two separate microphones are coupled to the processor; and a memory coupled to the processor, wherein the memory comprises instructions that, when executed by the processor, cause the processor to; process the video image and an audio communication associated with the video image, wherein the audio communication comprises at least two raw electronic audio signals created from the at least two separate microphones, and wherein a relative position of the at least two separate microphones is known; identify at least one source of the audio communication from the processing of the video image as part of a visual identification of the at least one source of the audio communication; determine, based on the identifying of the at least one source of the audio communication, an angle from the mobile computing device to the at least one source of the audio communication; and contemporaneously display, on the display output (1) first location information associated with the visual identification of the at least one source of the audio communication overlaid on the video image and (2) second location information comprising the angle from the mobile computing device to the at least one source of the audio communication. - View Dependent Claims (17, 18)
-
-
19. A mobile computing device comprising:
-
means for processing video image and an audio communication associated with the video image, wherein the audio communication comprises at least two raw electronic audio signals created from at least two separate microphones, and wherein a relative position of the at least two separate microphones is known; means for identifying at least one source of the audio communication from the processing of the video image as part of a visual identification of the at least one source of the audio communication; means for determining, based on the identifying of the at least one source of the audio communication, an angle from the mobile computing device to the at least one source of the audio communication; and means for contemporaneously displaying, on a display output of the mobile computing device (1) first location information associated with the visual identification of the at least one source of the audio communication overlaid on the video image and (2) second location information comprising the angle from the first mobile computing device to the at least one source of the audio communication. - View Dependent Claims (20)
-
-
21. A method of visual and audio identification of a sound source comprising:
-
capturing, by a far-side mobile device, a far-side video image and a far-side audio communication, wherein the far-side audio communication comprises at least two raw electronic audio signals created from at least two separate microphones integrated as part of the far-side mobile device, and wherein a relative position of the at least two separate microphones is known; communicating the far-side video image and the far-side audio communication from the far-side mobile device to a near-side mobile device via a network; processing the far-side video image and the far-side audio communication to identify at least one source of the far-side audio communication as part of a visual identification of the at least one source of the far-side audio communication; determining, based on the identifying of the at least one source of the far-side audio communication, at least one angle from the far-side mobile device to the at least one source of the far-side audio communication; processing the at least two raw electronic audio signals to (a) filter sounds received from outside the at least one angle from the far-side mobile device to the at least one source of the far-side audio communication and/or (b) to emphasize sounds received from the at least one angle from the far-side mobile device to the at least one source of the far-side audio communication; and creating an output comprising (1) first far-side location information associated with the visual identification of the at least one source of the far-side audio communication overlaid on the far-side video image and (2) second far-side location information comprising the at least one angle from the far-side mobile device to the at least one source of the far-side audio communication. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification