ARCHITECTURE FOR CONTROLLING A COMPUTER USING HAND GESTURES
First Claim
1. A method of determining a command, comprising:
- capturing an image of an object with a camera;
determining a gesture based at least partly upon the image;
detecting an audio input; and
determining, at one or more processors, the command based at least partly upon the gesture and the audio input.
1 Assignment
0 Petitions
Accused Products
Abstract
Architecture for implementing a perceptual user interface. The architecture comprises alternative modalities for controlling computer application programs and manipulating on-screen objects through hand gestures or a combination of hand gestures and verbal commands. The perceptual user interface system includes a tracking component that detects object characteristics of at least one of a plurality of objects within a scene, and tracks the respective object. Detection of object characteristics is based at least in part upon image comparison of a plurality of images relative to a course mapping of the images. A seeding component iteratively seeds the tracking component with object hypotheses based upon the presence of the object characteristics and the image comparison. A filtering component selectively removes the tracked object from the object hypotheses and/or at least one object hypothesis from the set of object hypotheses based upon predetermined removal criteria.
176 Citations
18 Claims
-
1. A method of determining a command, comprising:
-
capturing an image of an object with a camera; determining a gesture based at least partly upon the image; detecting an audio input; and determining, at one or more processors, the command based at least partly upon the gesture and the audio input. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-readable medium having instruction that cause a processor to execute steps, the steps comprising:
-
capturing an image of an object with a camera; determining a gesture based at least partly upon the image; detecting an audio input; and determining, at one or more processors, a command based at least partly upon the gesture and the audio input. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A command determining system, comprising:
-
a camera configured to capture an image of an object; a first determiner configured to determine a gesture based at least partly upon the image; an audio detection unit configured to detect an audio input; and a second determiner configured to determine the command based at least partly upon the gesture and the audio input. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification