Method and system using a data-driven model for monocular face tracking
First Claim
1. A method for image processing comprising:
- obtaining stereo data based on input image sequences from of varying facial expressions;
building a three-dimensional (3D) model using the obtained stereo data to obtain principal shape vectors; and
tracking a second input image sequence using the 3D model to approximate a linear combination of the principal shape vectors of a facial expression in the second input image sequence, wherein the second input image sequence is a monocular image sequence.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system using a data-driven model for monocular face tracking are disclosed, which provide a versatile system for tracking three-dimensional (3D) images, e.g., a face, using a single camera. For one method, stereo data based on input image sequences is obtained. A 3D model is built using the obtained stereo data. A monocular image sequence is tracked using the built 3D model. Principal Component Analysis (PCA) can be applied to the stereo data to learn, e.g., possible facial deformations, and to build a data-driven 3D model (“3D face model”). The 3D face model can be used to approximate a generic shape (e.g., facial pose) as a linear combination of shape basis vectors based on the PCA analysis.
33 Citations
12 Claims
-
1. A method for image processing comprising:
-
obtaining stereo data based on input image sequences from of varying facial expressions; building a three-dimensional (3D) model using the obtained stereo data to obtain principal shape vectors; and tracking a second input image sequence using the 3D model to approximate a linear combination of the principal shape vectors of a facial expression in the second input image sequence, wherein the second input image sequence is a monocular image sequence. - View Dependent Claims (2, 3, 4)
-
-
5. A computing system comprising:
-
an input unit to stereo data based on input image sequences from of varying facial expressions; and a processing unit to build a three-dimensional (3D) model using the obtained stereo data to approximate a generic shape as a linear combination of shape basis vectors and track a second input image sequence using the 3D model to approximate a linear combination of the principal shape vectors of a facial expression in the second input image sequence, wherein the second input image sequence is a monocular image sequence. - View Dependent Claims (6, 7, 8)
-
-
9. A non-transitory machine-readable medium providing instructions, which if executed by a processor, causes the processor to perform an operation comprising:
-
obtaining stereo data based on input image sequences from of varying facial expressions; building a three-dimensional (3D) model using the obtained stereo data to approximate a generic shape as a linear combination of shape basis vectors; and tracking a second input image sequence using the 3D model to approximate a linear combination of the principal shape vectors of a facial expression in the second input image sequence, wherein the second input image sequence is a monocular image sequence. - View Dependent Claims (10, 11, 12)
-
Specification