HAND GESTURE RECOGNITION FOR VIRTUAL REALITY AND AUGMENTED REALITY DEVICES
First Claim
1. A system for hand gesture recognition, comprising:
- a display;
a camera;
a memory that is to store instructions and that is communicatively coupled to the camera and the display; and
a processor communicatively coupled to the camera, the display, and the memory, wherein when the processor is to execute the instructions, the processor is to;
estimate one or more motion vectors of an object using a pair of consecutive frames;
estimate an average motion vector of the object;
determine a first histogram of optical flow (HOOF) based on the one or more motion vectors and the average motion vector;
determine depth values based on the first HOOF;
determine a second histogram of optical flow (HOOF) based on the depth values;
calculate a descriptor using the first HOOF, the second HOOF, a shape descriptor, and an average motion vector; and
classify the descriptor as a gesture.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for hand gesture recognition includes a display, camera, memory, and processor. When the processor is to execute instructions, the processor is to estimate one or more motion vectors of an object using a pair of consecutive frames and estimate an average motion vector of the object. The processor may also estimate an average motion vector of the object, determine a first histogram of optical flow (HOOF) based on the one or more motion vectors and the average motion vector, determine depth values based on motion vectors from the first HOOF, and determine a second histogram of optical flow (HOOF) based on the depth values. The processor is also to obtain a descriptor based on histogram values from a histogram of optical flow (HOOF) of the one or more motion vectors, a shape descriptor, and the average motion vector and classify the descriptor as a gesture.
30 Citations
25 Claims
-
1. A system for hand gesture recognition, comprising:
-
a display; a camera; a memory that is to store instructions and that is communicatively coupled to the camera and the display; and a processor communicatively coupled to the camera, the display, and the memory, wherein when the processor is to execute the instructions, the processor is to; estimate one or more motion vectors of an object using a pair of consecutive frames; estimate an average motion vector of the object; determine a first histogram of optical flow (HOOF) based on the one or more motion vectors and the average motion vector; determine depth values based on the first HOOF; determine a second histogram of optical flow (HOOF) based on the depth values; calculate a descriptor using the first HOOF, the second HOOF, a shape descriptor, and an average motion vector; and classify the descriptor as a gesture. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method, comprising:
-
extracting a hand mask using depth data; estimating a first plurality of motion vectors based via an optical flow applied to the hand mask; estimating an average motion vector from the optical flow; estimating a second plurality of motion vectors via depth values derived from the first plurality of motion vectors; generating a descriptor based on a histogram of optical flow applied to the first plurality of motion vectors, the second plurality of motion vectors, the hand mask, and the average motion vector; and classifying the descriptor as a gesture. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. An apparatus for hand gesture recognition, comprising:
-
an image capture mechanism to obtain a sequence of frames; an estimator to estimate a first plurality of motion vectors, a second plurality of motion vectors, and an average motion vector for each frame of the sequence of frames; a generator to generate a descriptor based on a first histogram of the first plurality of motion vectors, a second histogram based on the second plurality of motion vectors, a shape descriptor, and the average motion vector for the sequence of frames; and a classifier to classify the descriptor as a gesture. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A tangible, non-transitory, computer-readable medium comprising instructions that, when executed by a processor, direct the processor to:
-
extract a hand mask using depth data; estimate a first plurality of motion vectors based via an optical flow applied to the hand mask; estimate an average motion vector from the optical flow; estimate a second plurality of motion vectors via depth values derived from the first plurality of motion vectors; generate a descriptor based on a histogram of optical flow applied to the hand mask and the average motion vector; and classify the descriptor as a gesture. - View Dependent Claims (23, 24, 25)
-
Specification