Predictive Determination
First Claim
1. A method for predicting a gesture made by a user to a first application, comprising:
- receiving image data captured by a camera and sound data captured by a microphone, wherein the image data is representative of a gesture performed by the user and the sound data is representative of a sound made by the user;
applying a filter to the image data to interpret the gesture, wherein the sound data at least one of;
augments, distinguishes or clarifies the gesture;
determining, from the applied filter, an output corresponding to the gesture being performed; and
sending the first application the output.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, methods and computer readable media are disclosed for a gesture recognizer system architecture. A recognizer engine is provided, which receives user motion data and provides that data to a plurality of filters. A filter corresponds to a gesture, that may then be tuned by an application receiving information from the gesture recognizer so that the specific parameters of the gesture—such as an arm acceleration for a throwing gesture—may be set on a per-application level, or multiple times within a single application. Each filter may output to an application using it a confidence level that the corresponding gesture occurred, as well as further details about the user motion data.
48 Citations
20 Claims
-
1. A method for predicting a gesture made by a user to a first application, comprising:
-
receiving image data captured by a camera and sound data captured by a microphone, wherein the image data is representative of a gesture performed by the user and the sound data is representative of a sound made by the user; applying a filter to the image data to interpret the gesture, wherein the sound data at least one of;
augments, distinguishes or clarifies the gesture;determining, from the applied filter, an output corresponding to the gesture being performed; and sending the first application the output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for predicting a gesture made by a user to a first application, comprising:
-
a processor; a gesture library comprising at least one filter to interpret the gesture; and a gesture recognizer engine that; receives image data captured by a camera and sound data captured by a microphone, wherein the image data is representative of a gesture performed by the user and the sound data is representative of a sound made by the user; determines an output from the filter based on the image data, wherein the sound data at least one of;
augments, distinguishes or clarifies the gesture; andsends the application the output before receiving data corresponding to the gesture being completed. - View Dependent Claims (15, 16)
-
-
17. A computer readable storage medium, comprising computer readable instructions that when executed on a processor, cause the processor to perform the operations of:
-
receiving from a first application of the a plurality of applications a value for at least one parameter; receiving image data captured by a depth camera and sound data captured by a microphone, wherein the image data is representative of a gesture performed by the user and the sound data is representative of a sound made by the user; applying the filter to the image data to interpret the gesture, wherein the sound data at least one of;
augments, distinguishes or clarifies the gesture;determining a confidence level that the image and sound data is indicative of at least one gesture; and sending the first application an indication of at least one gesture with its associated confidence level. - View Dependent Claims (18, 19, 20)
-
Specification