×

Computing system for expressive three-dimensional facial animation

  • US 11,238,885 B2
  • Filed: 10/29/2018
  • Issued: 02/01/2022
  • Est. Priority Date: 10/29/2018
  • Status: Active Grant
First Claim
Patent Images

1. A computing device, comprising:

  • a processor;

    memory storing instructions, wherein the instructions, when executed by the processor, cause the processor to perform acts comprising;

    receiving an audio sequence reflective of words uttered by a speaker;

    based upon the audio sequence, generating a first set of coefficients that are indicative of lips of the speaker as the speaker utters the words, wherein the first set of coefficients are generated based upon latent content variables that have been generated via a computer-implemented model, wherein the computer-implemented model has been trained without utilization of motion capture data, wherein the latent content variables are generated by the computer-implemented model without utilization of motion capture techniques;

    based upon the audio sequence, generating a second set of coefficients that are indicative of facial features of the speaker other than the lips of the speaker as the speaker utters the words, wherein the second set of coefficients are generated based upon latent style variables that have been generated via the computer-implemented model, wherein the latent style variables comprise latent identity variables that are based upon identity factors of a plurality of speaker as the plurality of speakers speak and latent emotional variables that are based upon emotions of the plurality of speakers as the plurality of speakers speak, wherein the latent style variables are generated by the computer-implemented model without utilization of the motion capture techniques;

    generating a third set of coefficients based upon the first set of coefficients and the second set of coefficients; and

    causing a visual representation of a face to be animated on a display based upon the third set of coefficients such that movement of lips of the visual representation reflects the words uttered by the speaker while the visual representation is animated, wherein facial features of the visual representation of the face other than the lips are synced to the lips of the visual representation, and further wherein the visual representation of the face reflects an identity of the speaker and an emotion of the speaker as the speaker utters the words while the visual representation of the face is animated.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×