Gesture pre-processing of video stream using a markered region
First Claim
1. A system for processing a raw video stream, comprising:
- a threshold detection stage comprising integrated circuitry configured to determine if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, the threshold detection stage is further configured to stop further processing of the current frame by a subsequent stage;
a target presence stage comprising integrated circuitry configured to determine if the current frame includes skintone colored pixels, and in response to the current frame not including skintone colored pixels, the target presence stage is further configured to stop further processing of the current frame by a subsequent stage; and
a gesture processing stage configured to perform at least one of spatial and temporal interpretation of image data within frames passed by the threshold detection and target presence stages.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for processing a video stream to reduce platform power by employing a stepped and distributed pipeline process, wherein CPU-intensive processing is selectively performed. The techniques are particularly well-suited for hand-based navigational gesture processing. In one example case, for instance, the techniques are implemented in a computer system wherein initial threshold detection (image disturbance) and optionally user presence (hand image) processing components are proximate to or within the system'"'"'s camera, and the camera is located in or proximate to the system'"'"'s primary display. In some cases, image processing and communication of pixel information between various processing stages which lies outside a markered region is suppressed. In some embodiments, the markered region is aligned with, a mouse pad or designated desk area or a user input device such as a keyboard. Pixels evaluated by the system can be limited to a subset of the markered region.
24 Citations
27 Claims
-
1. A system for processing a raw video stream, comprising:
-
a threshold detection stage comprising integrated circuitry configured to determine if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, the threshold detection stage is further configured to stop further processing of the current frame by a subsequent stage; a target presence stage comprising integrated circuitry configured to determine if the current frame includes skintone colored pixels, and in response to the current frame not including skintone colored pixels, the target presence stage is further configured to stop further processing of the current frame by a subsequent stage; and a gesture processing stage configured to perform at least one of spatial and temporal interpretation of image data within frames passed by the threshold detection and target presence stages. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer navigation method using a raw video stream, comprising:
-
determining, at a first processing stage, if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, stopping further processing of the current frame by a subsequent stage; determining, at a second processing stage, if the current frame includes skintone colored pixels, and in response to the current frame not including skintone colored pixels, stopping further processing of the current frame by a subsequent stage; receiving, at a third processing stage, data from the current frame and separating pixels of interest from pixels that are not of interest, wherein pixels of interest are represented as blobs and pixels not of interest are represented by non-blobs; performing, at a fourth processing stage, spatial interpretation of blob data received from the third processing stage by mapping the blob data to human hand geometry; performing, at a fifth processing stage, temporal interpretation of blob data received from the fourth processing stage to identify a user navigation gesture; and directing a computing system based on the user navigation gesture; wherein the processing stages are configured in a stepped and distributed arrangement and image processing and communication of pixel information which lies outside a markered region is eliminated. - View Dependent Claims (21)
-
-
22. A non-transient computer program product encoded with instructions that when executed by one or more processors causes a navigation method to be carried out using a raw video stream, the method comprising:
-
determining, at a first processing stage, if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, stopping further processing of the current frame by a subsequent stage; determining, at a second processing stage, if the current frame includes skintone colored pixels, and in response to the current frame not including skintone colored pixels, stopping further processing of the current frame by a subsequent stage; and performing, at a stage subsequent to the first and second processing stages, at least one of spatial and temporal interpretation of image data within frames passed by the threshold detection and target presence stages; wherein image processing and communication of pixel information which lies outside a markered region is eliminated. - View Dependent Claims (23, 24, 25, 26, 27)
-
Specification