Visual Language for Human Computer Interfaces
First Claim
1. A computer-implemented method for recognizing hand gestures, the method comprising:
- presenting a region of interest (ROI) on a display;
receiving a digital color image of a user hand against a background, the digital color image captured using a digital image capturing device, the digital color image being represented by pixels in a first color space;
selecting a set of pixels of the digital color image in the first color space from the pixels of the digital color image within the ROI, the selected set of pixels of the digital color image in the first color space describing a general parametric model associated with the digital color image in the first color space;
obtaining specific parametric templates in additional color spaces, each specific parametric template in a color space comprising a selected set of pixels within the ROI representing the user hand in a corresponding color space of the additional color spaces, the additional color spaces emphasizing chrominance over luminance information;
combining the specific parametric templates in the additional color spaces to generate an improved specific parametric template;
obtaining a contour of the user hand by applying the improved specific parametric template to subsequent digital images of the user hand; and
detecting a hand gesture based on the contour of the user hand.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the invention recognize human visual gestures, as captured by image and video sensors, to develop a visual language for a variety of human computer interfaces. One embodiment provides a method for recognizing a hand gesture positioned by a user hand. The method includes steps of capturing a digital color image of a user hand against a background, applying a general parametric model to the digital color image of the user hand to generate a specific parametric template of the user hand, receiving a second digital image of the user hand positioned to represent a hand gesture, detecting a hand contour of the hand gesture based at least in part on the specific parametric template of the user hand, and recognizing the hand gesture based at least in part on the detected hand contour. Other embodiments include recognizing hand gestures, facial gestures or body gestures captured in a video.
34 Citations
18 Claims
-
1. A computer-implemented method for recognizing hand gestures, the method comprising:
-
presenting a region of interest (ROI) on a display; receiving a digital color image of a user hand against a background, the digital color image captured using a digital image capturing device, the digital color image being represented by pixels in a first color space; selecting a set of pixels of the digital color image in the first color space from the pixels of the digital color image within the ROI, the selected set of pixels of the digital color image in the first color space describing a general parametric model associated with the digital color image in the first color space; obtaining specific parametric templates in additional color spaces, each specific parametric template in a color space comprising a selected set of pixels within the ROI representing the user hand in a corresponding color space of the additional color spaces, the additional color spaces emphasizing chrominance over luminance information; combining the specific parametric templates in the additional color spaces to generate an improved specific parametric template; obtaining a contour of the user hand by applying the improved specific parametric template to subsequent digital images of the user hand; and detecting a hand gesture based on the contour of the user hand. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer readable medium storing instructions for recognizing hand gestures, the instruction when executed by one or more processors cause the one or more processors to:
-
present a region of interest (ROI) on a display; receive a digital color image of a user hand against a background, the digital color image captured using a digital image capturing device, the digital color image being represented by pixels in a first color space; select a set of pixels of the digital color image in the first color space from the pixels of the digital color image within the ROI, the selected set of pixels of the digital color image in the first color space describing a general parametric model associated with the digital color image in the first color space; obtain specific parametric templates in additional color spaces, each specific parametric template in a color space comprising a selected set of pixels within the ROI representing the user hand in a corresponding color space of the additional color spaces, the additional color spaces emphasizing chrominance over luminance information; combine the specific parametric templates in the additional color spaces to generate an improved specific parametric template; obtain a contour of the user hand by applying the improved specific parametric template to subsequent digital images of the user hand; and detect a hand gesture based on the contour of the user hand.
-
-
10. A computer implemented method for recognizing a visual gesture, the method comprising:
-
receiving a first set of images including a part of a human body within a region of interest (ROI), the part of a human body oriented in a first configuration; registering a flesh tone of the part of the human body within the ROI of the first set of images in a first color space and a second color space; receiving a second set of images including the part of the human body, the part of the human body oriented in different configurations than the first configuration, the part of the human body oriented in the different configurations representing a visual gesture; identifying one or more objects in the second set of images corresponding to the part of the human body based on the registered flesh tone in the first color space and the second color space; obtaining motion vectors of the one or more objects in the second set of images by tracking the one or more objects in the second set of images; and determining the visual gesture represented by the part of the human body oriented in the different configurations according to the identified one or more objects and the motion vectors. - View Dependent Claims (11)
-
-
12. A computer-implemented method for recognizing hand gestures in video imagery, wherein the hand gesture has a static element, a motion element, or a mixture of the two, the method comprising:
-
pre-registering a user hand, using one or multiple frames of the video; performing an adaptive hand-detection on subsequent video frames, performing an adaptive hand-detection comprising; applying at least one of skin color/tone analysis in color spaces and motion estimation, to segment regions or objects, especially a hand, within the video frames, wherein said skin color/tone analysis in color spaces comprises; using skin tone analysis in one or more color spaces, to obtain skin map(s), merging skin maps from the one or more color spaces, and using an adaptive threshold to segment a region or object by detecting skin pixels and grouping them together, and wherein said motion estimation comprises; obtaining motion vector fields between two successive frames, typically for blocks within video frames, tracking motion vectors on subsequent frames, applying a combination of cluster analysis of motion vectors, and tracking the evolution of motion vectors on spatial regions and their features; and recognizing a hand gesture in part by applying at least one of the following four tool groups;
(i) skin color/tone analysis in one or more color spaces, (ii) motion estimation, (iii) morphological operations, and (iv) other image processing tools, on regions or objects, especially a segmented hand, to detect hands, hand contours and their features, as well as any motion of hand parts;wherein said skin color/tone analysis in one or more color spaces comprises; using skin tone analysis in one or more color spaces, to obtain skin map(s), merging skin maps from the one or more color spaces, and using an adaptive threshold to segment a region or object by detecting skin pixels and grouping them together, wherein said motion estimation comprises; obtaining motion vectors between two successive frames, typically for blocks within video frames, tracking motion vectors on subsequent frames, and applying any combination of cluster analysis of motion vectors, and tracking the evolution of motion vectors, defined spatial regions and their features, to detect regions or objects within video frames, wherein said morphological operations include dilations, erosions, opening, and closing operations, and wherein said other image processing tools include measuring distances, angles, extrema points, convexity, and shape on hand contours. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
Specification