Method and system for generating facial animation values based on a combination of visual and audio information
First Claim
1. Method for generating facial animation values using a sequence of facial image frames and synchronously captured audio data of a speaking actor, comprising the steps for:
- providing a plurality of visual-facial-animation values based on tracking of facial features in the sequence of facial image frames of the speaking actor;
providing a plurality of audio-facial-animation values based on visemes detected using the synchronously captured audio voice data of the speaking actor; and
combining the plurality of visual facial animation values and the plurality of audio facial animation values to generate output facial animation values for use in facial animation.
5 Assignments
0 Petitions
Accused Products
Abstract
Facial animation values are generated using a sequence of facial image frames and synchronously captured audio data of a speaking actor. In the technique, a plurality of visual-facial-animation values are provided based on tracking of facial features in the sequence of facial image frames of the speaking actor, and a plurality of audio-facial-animation values are provided based on visemes detected using the synchronously captured audio voice data of the speaking actor. The plurality of visual facial animation values and the plurality of audio facial animation values are combined to generate output facial animation values for use in facial animation.
-
Citations
20 Claims
-
1. Method for generating facial animation values using a sequence of facial image frames and synchronously captured audio data of a speaking actor, comprising the steps for:
-
providing a plurality of visual-facial-animation values based on tracking of facial features in the sequence of facial image frames of the speaking actor;
providing a plurality of audio-facial-animation values based on visemes detected using the synchronously captured audio voice data of the speaking actor; and
combining the plurality of visual facial animation values and the plurality of audio facial animation values to generate output facial animation values for use in facial animation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for generating facial animation values using a sequence of facial image frames and synchronously captured audio data of a speaking actor, comprising:
-
means for providing a plurality of visual-facial-animation values based on tracking of facial features in the sequence of facial image frames of the speaking actor;
means for providing a plurality of audio-facial-animation values based on visemes detected using the synchronously captured audio voice data of the speaking actor; and
means for providing a plurality of visual-facial-animation values based on tracking of facial features in the sequence of facial image frames of the speaking actor;
means for combining the plurality of visual facial animation values and the plurality of audio facial animation values to generate output facial animation values for use in facial animation. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification