×

Object detection and tracking

  • US 9,729,865 B1
  • Filed: 06/18/2014
  • Issued: 08/08/2017
  • Est. Priority Date: 06/18/2014
  • Status: Active Grant
First Claim
Patent Images

1. A non-transitory computer-readable storage medium storing instructions that, when executed by a processor, cause a computing device to:

  • generate first right image data during a first period of time with a right camera;

    generate first left image data during the first period of time with a left camera, the right camera and the left camera having at least partially overlapping fields of view;

    detect, using a face detection algorithm, a first right representation of a face in the first right image data, the face detection algorithm returning a first right bounding box for the face in the first right image data;

    detect, using the face detection algorithm, a first left representation of the face in the first left image data, the face detection algorithm returning a first left bounding box for the face in the first left image data;

    detect, using a feature extraction algorithm, a set of features of the face in the first right image data by analyzing the first right image data within the first right bounding box;

    detect, using the feature extraction algorithm, the set of features of the face in the first left image data by analyzing the first left image data within the first left bounding box;

    determine a first right position for a point relative to the set of features in the first right image data;

    determine a first left position for the point relative to the set of features in the first left image data;

    generate second right image data during a second period of time with the right camera;

    generate second left image data during the second period of time with the left camera;

    detect, using the face detection algorithm, a second right representation of the face in the second right image data, the face detection algorithm returning a second right bounding box for the face in the second right image data;

    detect, using the face detection algorithm, a second left representation of the face in the second left image data, the face detection algorithm returning a second left bounding box for the face in the second left image data;

    detect, using the feature extraction algorithm, the set of features of the face in the second right image data by analyzing the second right image data within the second right bounding box;

    detect, using the feature extraction algorithm, the set of features of the face in the first left image data by analyzing the second left image data within the second left bounding box;

    determine a second right position for the point relative to the set of features in the second right image data;

    determine a second left position for the point relative to the set of features in the second left image data;

    determine a right two-dimensional (2D) change in position of the point relative to the set of features between the first right image data and the second right image data;

    determine a left 2D change in position of the point relative to the set of features between the first left image data and the second left image data;

    determine, using a feature tracking algorithm and based at least in part on the right 2D change in position, a right 2D output for the point relative to the set of features;

    determine, using the feature tracking algorithm and based at least in part on the left 2D change in position, a left 2D output for the point relative to the set of features;

    determine stereo disparity of the point relative to the set of features between the right 2D output and the left 2D output in the at least partially overlapping fields of view of the right camera and the left camera;

    determine a z-depth for the point relative to the set of features of the face using the stereo disparity and calibration information for the right camera and the left camera to determine a three-dimensional (3D) position for the point relative to the set of features;

    generate third right image data during a third period of time with the right camera;

    generate third left image data during the third period of time with the left camera;

    determine that a third right representation of the face is detected in the third right image data;

    determine that a third left representation of the face is not detected in the third left image data;

    generate a template of the face using information for the face from the third right image data; and

    use the template to detect the face in the third left image data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×