Head pose assessment methods and systems
First Claim
1. An apparatus for use with at least one display device and at least one image capturing device, the apparatus comprising:
- a processor;
one or more memories coupled to the processor, the one or more memories having stored instructions that configure the apparatus to implement;
display logic configurable to output at least one signal suitable for causing a display device to display at least two different selectable regions;
interface logic configurable to receive image data from an image capturing device;
pose estimation logic operatively coupled to said display logic and said interface logic and configured to determine a first head pose based on a first image and at least a second head pose based on a second image temporally subsequent to said first image, and automatically operatively switch an operative user input focus between said at least two selectable regions based on at least one difference between said first head pose and at least said second head pose; and
programmable filter logic operatively coupled to said display logic and said interface logic and configured to filter the second image with respect to the first image, the second image being logically weighted differently than the first image based on an amount of time between a capture of the first image and a capture of the second image.
2 Assignments
0 Petitions
Accused Products
Abstract
Improvements are provided to effectively assess a user'"'"'s face and head pose such that a computer or like device can track the user'"'"'s attention towards a display device(s). Then the region of the display or graphical user interface that the user is turned towards can be automatically selected without requiring the user to provide further inputs. A frontal face detector is applied to detect the user'"'"'s frontal face and then key facial points such as left/right eye center, left/right mouth corner, nose tip, etc., are detected by component detectors. The system then tracks the user'"'"'s head by an image tracker and determines yaw, tilt and roll angle and other pose information of the user'"'"'s head through a coarse to fine process according to key facial points and/or confidence outputs by pose estimator.
19 Citations
20 Claims
-
1. An apparatus for use with at least one display device and at least one image capturing device, the apparatus comprising:
-
a processor; one or more memories coupled to the processor, the one or more memories having stored instructions that configure the apparatus to implement; display logic configurable to output at least one signal suitable for causing a display device to display at least two different selectable regions; interface logic configurable to receive image data from an image capturing device; pose estimation logic operatively coupled to said display logic and said interface logic and configured to determine a first head pose based on a first image and at least a second head pose based on a second image temporally subsequent to said first image, and automatically operatively switch an operative user input focus between said at least two selectable regions based on at least one difference between said first head pose and at least said second head pose; and programmable filter logic operatively coupled to said display logic and said interface logic and configured to filter the second image with respect to the first image, the second image being logically weighted differently than the first image based on an amount of time between a capture of the first image and a capture of the second image. - View Dependent Claims (2)
-
-
3. An apparatus for use with at least one display device and at least one image capturing device, the apparatus comprising:
-
a processor; one or more memories coupled to the processor, the one or more memories having stored instructions that configure the apparatus to implement; display logic configurable to output at least one signal suitable for causing a display device to display at least two different selectable regions; interface logic configurable to receive image data from an image capturing device; pose estimation logic operatively coupled to said display logic and said interface logic and configured to determine a first head pose based on a first image and at least a second head pose based on a second image temporally subsequent to said first image, and automatically operatively switch an operative user input focus between said at least two selectable regions based on at least one difference between said first head pose and at least said second head pose; and memory operatively coupled to at least said pose estimation logic, said memory being configurable to store said first image and said second image, wherein said first image captures at least a first portion of a face of a user at a first time and said second image captures at least a second portion of said face of said user at a second subsequent time; wherein said pose estimation logic is further configured to; access said first image, detect at least said first portion of said face within said first image, detect at least one point within said detected first portion of said face, and store first tracking information associated with each of said at least one point within said detected first portion to said memory, and access said second image, track at least said second portion of said face within said second image, detect said at least one point within said detected second portion of said face, and store second tracking information associated with each of said at least one point within said detected second portion to said memory, and classify each of a plurality of portions of image data associated with said first image based on at least one classifying parameter to determine at least one facial region associated with said first portion of said face. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
at least one display device; at least one image capturing device; and a computing device operatively coupled to said display device and said image capturing device and having; display logic configured to output at least one signal suitable for causing a display device to display at least two different selectable regions, interface logic configured to receive image data from an image capturing device, and pose estimation logic configured to determine a first head pose based on a first image and at least a second head pose based on a second image temporally subsequent to said first image, and automatically operatively switch an operative user input focus between said at least two selectable regions based on at least one difference between said first head pose and at least said second head pose, wherein said pose estimation logic is further configured to classify each of a plurality of portions of image data associated with said first image based on at least one classifying parameter to determine at least one facial region associated with at least one portion of a face of a user, wherein said face of said user is captured by said first image. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification