Hand and indicating-point positioning method and hand gesture determining method used in human-computer interaction system
First Claim
1. A hand-positioning method used in a human-computer interaction system comprises:
- continuously capturing a current image of a monitored display area so as to obtain a sequence of video images including a foreground object;
extracting a foreground image from each of the obtained video images, and then carrying out binary processing with regard to the extracted foreground image so as to obtain a binary foreground image;
obtaining a set of vertexes of a minimum convex hull of the binary foreground image, and then creating areas of concern serving as candidate hand areas by letting the respective vertexes of the minimum convex hull of the binary foreground image be central points of the areas of concern; and
extracting hand imaging features from the respective created areas of concern, and then determining a hand area from the candidate hand areas by carrying out pattern recognition based on the extracted hand imaging features, whereinthe binary processing carried out with regard to the extracted foreground image includes;
calculating an edge image of the extracted foreground image;
calculating a gradient image of the extracted foreground image, and then carrying out binary processing with regard to the gradient image so as to obtain a first binary result;
carrying out binary processing with regard to the extracted foreground image by using a predetermined threshold value so as to obtain a second binary result;
combining the edge image, the first binary result, and the second binary result by using an OR logical operation so as to obtain a new binary image; and
filling a closed area in the obtained new binary image.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are a hand positioning method and a human-computer interaction system. The method comprises a step of continuously capturing a current image so as to obtain a sequence of video images; a step of extracting a foreground image from each of the captured video images, and then carrying out binary processing so as to obtain a binary foreground image; a step of obtaining a vertex set of a minimum convex hull of the binary foreground image, and then creating areas of concern serving as candidate hand areas; and a step of extracting hand imaging features from the respective created areas of concern, and then determining a hand area from the candidate hand areas by carrying out pattern recognition based on the extracted hand imaging features.
16 Citations
9 Claims
-
1. A hand-positioning method used in a human-computer interaction system comprises:
-
continuously capturing a current image of a monitored display area so as to obtain a sequence of video images including a foreground object; extracting a foreground image from each of the obtained video images, and then carrying out binary processing with regard to the extracted foreground image so as to obtain a binary foreground image; obtaining a set of vertexes of a minimum convex hull of the binary foreground image, and then creating areas of concern serving as candidate hand areas by letting the respective vertexes of the minimum convex hull of the binary foreground image be central points of the areas of concern; and extracting hand imaging features from the respective created areas of concern, and then determining a hand area from the candidate hand areas by carrying out pattern recognition based on the extracted hand imaging features, wherein the binary processing carried out with regard to the extracted foreground image includes; calculating an edge image of the extracted foreground image; calculating a gradient image of the extracted foreground image, and then carrying out binary processing with regard to the gradient image so as to obtain a first binary result; carrying out binary processing with regard to the extracted foreground image by using a predetermined threshold value so as to obtain a second binary result; combining the edge image, the first binary result, and the second binary result by using an OR logical operation so as to obtain a new binary image; and filling a closed area in the obtained new binary image. - View Dependent Claims (2, 3, 4)
-
-
5. A human-computer interaction system comprises:
-
circuitry configured to; continuously capture a current image of a monitored display area so as to obtain a sequence of video images including a foreground object; extract a foreground image from each of the obtained video images, and then carry out binary processing with regard to the extracted foreground image so as to obtain a binary foreground image; obtain a set of vertexes of a minimum convex hull of the binary foreground image, and then create areas of concern serving as candidate hand areas by letting the respective vertexes of the minimum convex hull of the binary foreground image be central points of the areas of concern; and extract hand imaging features from the respective created areas of concern, and then determine a hand area from the candidate hand areas by carrying out pattern recognition based on the extracted hand imaging features, wherein the circuitry is configured to carry out the binary processing with regard to the extracted foreground image by; calculating an edge image of the extracted foreground image; calculating a gradient image of the extracted foreground image, and then carrying out binary processing with regard to the gradient image so as to obtain a first binary result; carrying out binary processing with regard to the extracted foreground image by using a predetermined threshold value so as to obtain a second binary result; combining the edge image, the first binary result, and the second binary result by using an OR logical operation so as to obtain a new binary image; and filling a closed area in the obtained new binary image. - View Dependent Claims (6, 7, 8)
-
-
9. A non-transitory computer-readable medium having machine-executable instructions for execution by a processing system, wherein, the machine-executable instructions are used for carrying out a hand-positioning method used in a human-computer interaction system, and the machine-executable instructions, when executed, cause the processing system to:
-
continuously capture a current image of a monitored display area so as to obtain a sequence of video images including a foreground object; extract a foreground image from each of the obtained video images, and then carry out binary processing with regard to the extracted foreground image so as to obtain a binary foreground image; obtain a set of vertexes of a minimum convex hull of the binary foreground image, and then create areas of concern serving as candidate hand areas by letting the respective vertexes of the minimum convex hull of the binary foreground image be central points of the areas of concern; and extract hand imaging features from the respective created areas of concern, and then determine a hand area from the candidate hand areas by carrying out pattern recognition based on the extracted hand imaging features, wherein the binary processing carried out with regard to the extracted foreground image includes; calculating an edge image of the extracted foreground image; calculating a gradient image of the extracted foreground image, and then carrying out binary processing with regard to the gradient image so as to obtain a first binary result; carrying out binary processing with regard to the extracted foreground image by using a predetermined threshold value so as to obtain a second binary result; combining the edge image, the first binary result, and the second binary result by using an OR logical operation so as to obtain a new binary image; and filling a closed area in the obtained new binary image.
-
Specification