Body feature detection and human pose estimation using inner distance shape contexts
First Claim
1. A computer based method for detecting a feature point of an object in an image of the object, the method comprising:
- receiving a plurality of sequential images including the image and a previous image captured earlier in time than the image;
detecting a set of feature points from within the previous image;
estimating a pose of a human actor in a human model based on enforcing joint limitations and self-penetration avoidance based on the detected set of feature points from within the previous image;
segmenting an image region of the object from an image region of background in the image based on the estimated pose;
sampling a plurality of points along a contour of the segmented image region of the object;
determining Inner Distance Shape Context (IDSC) descriptors for the sampled plurality of points;
for each of the sampled plurality of points, comparing a threshold value with a difference between the IDSC descriptor of a point and a feature point IDSC descriptor of the feature point;
responsive to the threshold value exceeding differences associated with two or more of the sampled plurality of points, selecting one of the two or more of the sampled plurality of points as the feature point of the object in the image, wherein the object comprises a human actor;
augmenting a position of a missing feature point with the detected set of feature points based on the selected feature point; and
reconstructing a pose of the human actor based at least in part on the augmented missing feature point.
1 Assignment
0 Petitions
Accused Products
Abstract
A system, method, and computer program product for estimating human body pose are described. According to one aspect, a human figure silhouette is segmented from a depth image of a human actor. Contour points are sampled along the human figure silhouette. Inner Distance Shape Context (IDSC) descriptors of the sample contour points are determined and compared to IDSC descriptors of the feature points in an IDSC gallery for similarity. For each of the feature points, the sample contour point with the IDSC descriptor that is most similar to an IDSC of the feature point is identified as that feature point in the depth image. An estimated pose of a human model is estimated based on the detected feature points and kinematic constraints of the human model.
31 Citations
16 Claims
-
1. A computer based method for detecting a feature point of an object in an image of the object, the method comprising:
-
receiving a plurality of sequential images including the image and a previous image captured earlier in time than the image; detecting a set of feature points from within the previous image; estimating a pose of a human actor in a human model based on enforcing joint limitations and self-penetration avoidance based on the detected set of feature points from within the previous image; segmenting an image region of the object from an image region of background in the image based on the estimated pose; sampling a plurality of points along a contour of the segmented image region of the object; determining Inner Distance Shape Context (IDSC) descriptors for the sampled plurality of points; for each of the sampled plurality of points, comparing a threshold value with a difference between the IDSC descriptor of a point and a feature point IDSC descriptor of the feature point; responsive to the threshold value exceeding differences associated with two or more of the sampled plurality of points, selecting one of the two or more of the sampled plurality of points as the feature point of the object in the image, wherein the object comprises a human actor; augmenting a position of a missing feature point with the detected set of feature points based on the selected feature point; and reconstructing a pose of the human actor based at least in part on the augmented missing feature point. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer program product for detecting a feature point of an object in an image of the object, the computer program product comprising a computer-readable storage medium containing executable computer program code for performing a method comprising:
-
receiving a plurality of sequential images including the image and a previous image captured earlier in time than the image; detecting a set of feature points from within the previous image; estimating a pose of a human actor in a human model based on enforcing joint limitations and self-penetration avoidance based on the detected set of feature points from within the previous image; segmenting an image region of the object from an image region of background in the image based on the estimated pose; sampling a plurality of points along a contour of the segmented image region of the object; determining Inner Distance Shape Context (IDSC) descriptors for the sampled plurality of points; for each of the sampled plurality of points, comparing a threshold value with a difference between the IDSC descriptor of a point and a feature point IDSC descriptor of the feature point; responsive to the threshold value exceeding differences associated with two or more of the sampled plurality of points, selecting one of the two or more of the sampled plurality of points as the feature point of the object in the image, wherein the object comprises a human actor; augmenting a position of a missing feature point with the detected set of feature points based on the selected feature point; and reconstructing a pose of the human actor based at least in part on the augmented missing feature point.
-
-
16. A system for detecting a feature point of an object in an image of the object, the system comprising:
-
a computer processor for executing executable computer program code; a computer-readable storage medium containing the executable computer program code for performing a method comprising; receiving a plurality of sequential images including the image and a previous image captured earlier in time than the image; detecting a set of feature points from within the previous image; estimating a pose of a human actor in a human model based on enforcing joint limitations and self-penetration avoidance based on the detected set of feature points from within the previous image; segmenting an image region of the object from an image region of background in the image based on the estimated pose; sampling a plurality of points along a contour of the segmented image region of the object; determining Inner Distance Shape Context (IDSC) descriptors for the sampled plurality of points; for each of the sampled plurality of points, comparing a threshold value with a difference between the IDSC descriptor of a point and a feature point IDSC descriptor of the feature point; responsive to the threshold value exceeding differences associated with two or more of the sampled plurality of points, selecting one of the two or more of the sampled plurality of points as the feature point of the object in the image, wherein the object comprises a human actor; augmenting a position of a missing feature point with the detected set of feature points based on the selected feature point; and reconstructing a pose of the human actor based at least in part on the augmented missing feature point.
-
Specification