Gesture pre-processing of video stream with hold-off period to reduce platform power
First Claim
1. A system for processing a raw video stream, comprising:
- a threshold detection stage comprising integrated circuitry configured to determine if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, the threshold detection stage is further configured to stop further processing of the current frame by a subsequent stage; and
a target presence stage comprising integrated circuitry configured to determine if the current frame includes skin tone colored pixels, and in response to the current frame not including skin tone colored pixels, the target presence stage is further configured to stop further processing of the current frame by a subsequent stage;
wherein at least one of the threshold detection and/or target presence stages can be selectively disabled for a hold-off period.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for processing a video stream to reduce platform power by employing a stepped and distributed pipeline process, wherein CPU-intensive processing is selectively performed. In one example case, the techniques are implemented in a user'"'"'s computer system wherein initial threshold detection (image disturbance) and optionally user presence (e.g., hand image) processing components are proximate to or within the system'"'"'s camera, and the camera is located in or proximate to the system'"'"'s primary display. The threshold detection and/or target presence stages can be selectively disabled for a hold-off period. The hold-off period may be, for example, in the range of 50 to 1000 mSec and triggered in response to an indication that a user of the system is unlikely to be making navigational gestures or that the system is not ready to process video, thereby conserving power by avoiding processing of video frames free of navigation gestures.
-
Citations
25 Claims
-
1. A system for processing a raw video stream, comprising:
-
a threshold detection stage comprising integrated circuitry configured to determine if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, the threshold detection stage is further configured to stop further processing of the current frame by a subsequent stage; and a target presence stage comprising integrated circuitry configured to determine if the current frame includes skin tone colored pixels, and in response to the current frame not including skin tone colored pixels, the target presence stage is further configured to stop further processing of the current frame by a subsequent stage; wherein at least one of the threshold detection and/or target presence stages can be selectively disabled for a hold-off period. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for processing a raw video stream, comprising:
-
a threshold detection stage comprising integrated circuitry configured to determine if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, the threshold detection stage is further configured to stop further processing of the current frame by a subsequent stage; a target presence stage comprising integrated circuitry configured to determine if the current frame includes skin tone colored pixels, and in response to the current frame not including skin tone colored pixels, the target presence stage is further configured to stop further processing of the current frame by a subsequent stage; a segmentation stage configured to receive data from the current frame, and separate pixels of interest from pixels that are not of interest, wherein pixels of interest are represented as blobs and pixels not of interest are represented by non-blobs; a hand parser stage that is configured to perform spatial interpretation of blob data received from the segmentation stage; and a gesture recognition stage that is configured to perform temporal interpretation of blob data received from the hand parser stage and to identify user hand-based navigation gestures; wherein the threshold detection, target presence, segmentation, parser, and gesture recognition stages are configured in a stepped and distributed arrangement, and at least one of the threshold detection and/or target presence stages can be selectively disabled for a hold-off period. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer navigation method using a raw video stream, comprising:
-
determining, at a first processing stage, if a current frame of the video stream is sufficiently different from a previous frame of the video stream, and in response to the current frame not being sufficiently different than the previous frame, stopping further processing of the current frame by a subsequent stage; and determining, at a second processing stage, if the current frame includes skin tone colored pixels, and in response to the current frame not including skin tone colored pixels, stopping further processing of the current frame by a subsequent stage; wherein at least one of the first and second processing stages can be selectively disabled for a hold-off period, the hold-off period being in the range of 50 to 1000 mSec and triggered in response to an indication that no navigational gestures are being made, thereby conserving power by avoiding processing of video frames free of navigation gestures.
-
Specification