HEAD POSE ESTIMATION USING RGBD CAMERA
First Claim
1. A method comprising:
- capturing multiple images with depth data of a head of a subject; and
determining a rotation matrix and a translation vector associated with a pose of the head in the multiple images relative to a reference coordinate frame using the depth data.
1 Assignment
0 Petitions
Accused Products
Abstract
A three-dimensional pose of the head of a subject is determined based on depth data captured in multiple images. The multiple images of the head are captured, e.g., by an RGBD camera. A rotation matrix and translation vector of the pose of the head relative to a reference pose is determined using the depth data. For example, arbitrary feature points on the head may be extracted in each of the multiple images and provided along with corresponding depth data to an Extended Kalman filter with states including a rotation matrix and a translation vector associated with the reference pose for the head and a current orientation and a current position. The three-dimensional pose of the head with respect to the reference pose is then determined based on the rotation matrix and the translation vector.
63 Citations
24 Claims
-
1. A method comprising:
-
capturing multiple images with depth data of a head of a subject; and determining a rotation matrix and a translation vector associated with a pose of the head in the multiple images relative to a reference coordinate frame using the depth data. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An apparatus comprising:
-
a Red, Green, Blue, Distance (RGBD) camera to capture images with depth data of a head of a subject; and a processor coupled to the RGBD camera to receive multiple images with the depth data of the head of the subject, the processor being configured to determine a rotation matrix and a translation vector associated with a pose of the head in the multiple images relative to a reference coordinate frame using the depth data. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
means for capturing multiple images with depth data of a head of a subject; and means for determining a rotation matrix and a translation vector associated with a pose of the head in the multiple images relative to a reference coordinate frame using the depth data. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A non-transitory computer-readable medium including program code stored thereon, comprising:
-
program code to receive multiple images with the depth data of a head of a subject; program code to determine a rotation matrix and a translation vector associated with a pose of the head in the multiple images relative to a reference coordinate frame using the depth data. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification