METHOD AND APPARATUS FOR USING IMAGE DATA TO AID VOICE RECOGNITION
First Claim
1. A method performed by a device for using image data to aid in voice recognition, the method comprising:
- capturing image data of a vicinity of the device; and
adjusting, based on the image data, a set of parameters for voice recognition performed by the device.
2 Assignments
0 Petitions
Accused Products
Abstract
A device performs a method for using image data to aid voice recognition. The method includes the device capturing image data of a vicinity of the device and adjusting, based on the image data, a set of parameters for voice recognition performed by the device. The set of parameters for the device performing voice recognition include, but are not limited to: a trigger threshold of a trigger for voice recognition; a set of beamforming parameters; a database for voice recognition; and/or an algorithm for voice recognition, wherein the algorithm can include using noise suppression or using acoustic beamforming.
246 Citations
23 Claims
-
1. A method performed by a device for using image data to aid in voice recognition, the method comprising:
-
capturing image data of a vicinity of the device; and adjusting, based on the image data, a set of parameters for voice recognition performed by the device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method performed by a device for using image data to aid in voice recognition, the method comprising:
-
capturing image data; receiving first voice data spoken into the device from a first individual and second voice data spoken into the device from a second individual; associating the first voice data to the first individual and the second voice data to the second individual using the image data; translating, using a voice recognition process, the first voice data into a first written passage within a document and the second voice data into a second written passage within the document; associating the first written passage with the first individual using a first annotation within the document that identifies the first individual; and associating the second written passage with the second individual using a second annotation within the document that identifies the second individual. - View Dependent Claims (16, 17, 18)
-
-
15. The method of 14, wherein the first annotation comprises a first name, and the second annotation comprises a second name.
-
19. A device configured for using image data to aid in voice recognition, the device comprising:
-
a set of cameras configured for capturing image data; at least one acoustic transducer configured for receiving voice data; a voice recognition module for processing the received voice data; and a processor configured for; detecting a set of individuals within the image data; determining from the image data whether at least one person within the set of individuals is gazing at the device; and adapting processing by the voice recognition module of the voice data based on whether the at least one individual is gazing at the device. - View Dependent Claims (20, 21, 22, 23)
-
Specification