Facial tracking with classifiers
First Claim
1. A computer-implemented method for facial detection comprising:
- obtaining a video that includes a face;
performing face detection to initialize locations for a first set of facial landmarks within a first frame from the video containing an image of the face wherein the face detection comprises;
performing facial landmark detection within the first frame from the video; and
estimating a rough bounding box for the face based on the facial landmark detection;
refining the locations for the first set of facial landmarks based on localized information around the first set of facial landmarks;
estimating future locations for landmarks within the first set of facial landmarks for a future frame from the first frame; and
training a classifier for a video clip for facial detection, wherein the training includes generating a scaled version of the image of the face directly from an initial frame, wherein the video is a chronological series of frames and the initial frame is an earliest frame containing the face in the chronological series of frames.
1 Assignment
0 Petitions
Accused Products
Abstract
Concepts for facial tracking with classifiers is disclosed. One or more faces are detected and tracked in a series of video frames that include at least one face. Video is captured and partitioned into the series of frames. A first video frame is analyzed using classifiers trained to detect the presence of at least one face in the frame. The classifiers are used to initialize locations for a first set of facial landmarks for the first face. The locations of the facial landmarks are refined using localized information around the landmarks, and a rough bounding box that contains the facial landmarks is estimated. The future locations for the facial landmarks detected in the first video frame are estimated for a future video frame. The detection of the facial landmarks and estimation of future locations of the landmarks are insensitive to rotation, orientation, scaling, or mirroring of the face.
197 Citations
26 Claims
-
1. A computer-implemented method for facial detection comprising:
-
obtaining a video that includes a face; performing face detection to initialize locations for a first set of facial landmarks within a first frame from the video containing an image of the face wherein the face detection comprises; performing facial landmark detection within the first frame from the video; and estimating a rough bounding box for the face based on the facial landmark detection; refining the locations for the first set of facial landmarks based on localized information around the first set of facial landmarks; estimating future locations for landmarks within the first set of facial landmarks for a future frame from the first frame; and training a classifier for a video clip for facial detection, wherein the training includes generating a scaled version of the image of the face directly from an initial frame, wherein the video is a chronological series of frames and the initial frame is an earliest frame containing the face in the chronological series of frames. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 22, 23, 24, 25, 26)
-
-
20. A computer program product embodied in a non-transitory computer readable medium for facial detection comprising:
-
code for obtaining a video that includes a face; code for performing face detection to initialize locations for a first set of facial landmarks within a first frame from the video containing an image of the face wherein the face detection comprises; performing facial landmark detection within the first frame from the video; and estimating a rough bounding box for the face based on the facial landmark detection; code for refining the locations for the first set of facial landmarks based on localized information around the first set of facial landmarks; code for estimating future locations for landmarks within the first set of facial landmarks for a future frame from the first frame; and code for training a classifier for a video clip for facial detection, wherein the training includes generating a scaled version of the image of the face directly from an initial frame, wherein the video is a chronological series of frames and the initial frame is an earliest frame containing the face in the chronological series of frames.
-
-
21. A computer system for facial detection comprising:
-
a memory which stores instructions; one or more processors attached to the memory wherein the one or more processors when executing the instructions which are stored, are configured to; obtain a video that includes a face; perform face detection to initialize locations for a first set of facial landmarks within a first frame from the video containing an image of the face wherein the face detection comprises; performing facial landmark detection within the first frame from the video; and estimating a rough bounding box for the face based on the facial landmark detection; refine the locations for the first set of facial landmarks based on localized information around the first set of facial landmarks; estimate future locations for landmarks within the first set of facial landmarks for a future frame from the first frame; and train a classifier for a video clip for facial detection, wherein the training includes generating a scaled version of the image of the face directly from an initial frame, wherein the video is a chronological series of frames and the initial frame is an earliest frame containing the face in the chronological series of frames.
-
Specification