Dynamic gesture recognition from stereo sequences
First Claim
1. A method comprising:
- capturing a sequence of stereo images, the stereo images including at least a portion of a subject performing a dynamic gesture;
obtaining depth disparities relating to the stereo images;
automatically initializing parameters of a statistical model of the subject based upon matching an image of the subject to the statistical model;
tracking the subject using the statistical model of the subject;
extracting three-dimensional features from the stereo images; and
interpreting the dynamic gesture performed by the subject.
1 Assignment
0 Petitions
Accused Products
Abstract
According to an embodiment, an apparatus and method are disclosed for dynamic gesture recognition from stereo sequences. In an embodiment, a stereo sequence of images of a subject is obtained and a depth disparity map is generated from the stereo sequence. The system is initiated automatically based upon a statistical model of the upper body of the subject. The upper body of the subject is modeled as three planes, representing the torso and arms of the subject, and three Gaussian components, representing the head and hands of the subject. The system tracks the upper body of the subject using the statistical upper body model and extracts three-dimensional features of the gestures performed. The system recognizes the gestures using recognition units, which, under a particular embodiment, utilizes hidden Markov models for the three-dimensional gestures.
-
Citations
32 Claims
-
1. A method comprising:
-
capturing a sequence of stereo images, the stereo images including at least a portion of a subject performing a dynamic gesture; obtaining depth disparities relating to the stereo images; automatically initializing parameters of a statistical model of the subject based upon matching an image of the subject to the statistical model; tracking the subject using the statistical model of the subject; extracting three-dimensional features from the stereo images; and interpreting the dynamic gesture performed by the subject. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A gesture recognition system comprising:
-
an imaging device to capture a sequence of three-dimensional images of a least a portion of a subject and a background, the subject performing a dynamic gesture; a processor to perform operations comprising; processing a set of depth disparities relating to the stereo images; automatically initializing parameters of a statistical model of the subject based upon matching an image of the subject to the statistical model; tracking the subject using the statistical model of the subject; extracting three-dimensional features from the subject; and interpreting the dynamic gesture performed by the subject. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A machine-readable medium having stored thereon data representing sequences of instruction that, when executed by a machine, cause the machine to perform operations comprising:
-
capturing a sequence of stereo images, the stereo images including at least a portion of a subject performing a dynamic gesture; obtaining depth disparities relating to the stereo images; automatically initializing parameters of a statistical model of the subject based upon matching an image of the subject to the statistical model; tracking the subject using the statistical model of the subject; extracting three-dimensional features from the stereo images; and interpreting the dynamic gesture performed by the subject. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification