Systems and methods for providing personal video services

US 8,842,154 B2
Filed: 07/03/2012
Issued: 09/23/2014
Est. Priority Date: 01/23/2007
Status: Active Grant

First Claim

Patent Images

1. A method of video conferencing, the method comprising the computer implemented steps of:

detecting a human face of a video conference participant depicted in portions of a video stream;

creating, by explicitly modeling, one or more explicit object models to model the face of the video conference participant;

generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face; and

using the implicit object models, creating a photorealistic avatar representation of the video conference participant, wherein creating a photorealistic avatar representation of the video conference participant further includes enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems and methods for processing video are provided. Video compression schemes are provided to reduce the number of bits required to store and transmit digital media in video conferencing or videoblogging applications. A photorealistic avatar representation of a video conference participant is created. The avatar representation can be based on portions of a video stream that depict the conference participant. A face detector is used to identify, track and classify the face. Object models including density, structure, deformation, appearance and illumination models are created based on the detected face. An object based video compression algorithm, which uses machine learning face detection techniques, creates the photorealistic avatar representation from parameters derived from the density, structure, deformation, appearance and illumination models.

151 Citations

25 Claims

1. A method of video conferencing, the method comprising the computer implemented steps of:
- detecting a human face of a video conference participant depicted in portions of a video stream;
  
  creating, by explicitly modeling, one or more explicit object models to model the face of the video conference participant;
  
  generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face; and
  
  using the implicit object models, creating a photorealistic avatar representation of the video conference participant, wherein creating a photorealistic avatar representation of the video conference participant further includes enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
- - 2. A method for providing video conferencing as in claim 1 wherein the face of the video conference participant is detected and tracked using a Viola/Jones face detection algorithm.
  - 3. A method for providing video conferencing as in claim 1 wherein the implicit object models provide an implicit representation of the face of the video conference participant.
  - 4. A method for providing video conferencing as in claim 3 wherein the implicit representation of the video conference participant is a simulated representation of the face of the video conference participant.
  - 5. A method for providing video conferencing as in claim 3 wherein the detecting and tracking comprise using a Viola/Jones face detection algorithm further includes the steps of:
    - identifying corresponding elements of at least one object associated with the face in two or more video frames from the video stream; and
      
      tracking and classifying the corresponding elements to identify relationships between the corresponding elements based on previously calibrated and modeled faces.
  - 6. A method for providing video conferencing as in claim 1 wherein the explicit object models include one or more object models for structure, deformation, pose, motion, illumination, and appearance.
  - 7. A method for providing video conferencing as in claim 1 wherein the implicit object models are configured using parameters obtained from the explicit object models, such that the explicit object model parameters are used as a ground truth for estimating portions of the video stream with the implicit object models.
  - 8. A method for providing video conferencing as in claim 7 wherein the explicit object model parameters are used to define expectations about how lighting interacts with the structure of the face of the video conference participant.
  - 9. A method for providing video conferencing as in claim 7 wherein the explicit object model parameters are used to limit a search space to the face or portions thereof for the implicit object modeling.
  - 10. A method for providing video conferencing as in claim 1 further includes periodically checking to determine whether the implicit object modeling is working optimally.
  - 11. A method for providing video conferencing as in claim 10 wherein periodically checking to determine whether the implicit object modeling is working optimally further includes determining that the implicit object models, which are used to create the photorealistic avatar representation, are working optimally by:
    - determining that reprojection error is low in the photorealistic avatar representation; and
      
      determining that there is a significant amount of motion in the photorealistic avatar representation.
  - 12. A method for providing video conferencing as in claim 10 wherein the determination that the implicit object modeling is working optimally causes subsequent instances of the photorealistic avatar representation of the conference participant to be created without relying on the step of detecting a human face in the portions of the video stream.
  - 13. A method for providing video conferencing as in claim 10 wherein determining that the implicit object modeling is not working optimally by:
    - determining that processing of the photorealistic avatar representation uses a disproportional amount of transmission bandwidth;
      
      ordetermining that the implicit object modeling is not working optimally if reprojection error is high.
  - 14. A method for providing video conferencing as in claim 10 further includes responding to the determination that the implicit object modeling is not working by processing the step of detecting a human face of a video conference participant;
    - andin response to detecting a human face, searching for existing calibration information for the detected human face.
  - 15. A method for providing video conferencing as in claim 14 wherein if a human face is undetectable, using a Viola-Jones face detector to facilitate detection.
  - 16. A method for providing video conferencing as in claim 1 wherein the gaze adjustment enables configuration of the gaze of the photorealistic avatar representation, such that it causes eyes of the photorealistic avatar representation to appear to focus directly in the direction of a video camera.

17. A computer program product for facilitating video conferencing, the computer program product being embodied on a non-transitory computer-readable medium and comprising code configured so as when executed on a computer to perform operations of:
- creating, by explicitly modeling, one or more explicit object models to model a detected face of a video conference participant;
  
  generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face;
  
  using the implicit object models, creating a photorealistic avatar representation of the video conference participant; and
  
  enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.

18. A video conferencing system comprising:
- a face detector configured to detect a face of a video conference participant in a video stream;
  
  a calibrator configured to generate a calibration model calibrating the face of the video conference participant;
  
  an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model;
  
  an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face;
  
  the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and
  
  the system further operable to enable the video conference participant to adjust a gaze of their respective photorealistic avatar representation.

19. A method of video conferencing, the method comprising the computer implemented steps of:
- generating explicit object models to model a human face of a video conference participant depicted in portions of a video stream;
  
  using parameters from the explicit object models, generating implicit object models to create a photorealistic avatar representation of the video conference participant, where the explicit object model parameters are used to define expectations for the implicit object models regarding how lighting interacts with a structure of the face of the video conference participant; and
  
  enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.

20. A video conferencing system comprising:
- a face detector configured to detect a face of a video conference participant in a video stream;
  
  a calibrator configured to generate a calibration model calibrating the face of the video conference participant;
  
  an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model;
  
  an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face;
  
  the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and
  
  the system operable to periodically check to determine whether the implicit object modeling is working optimally, where the determination that the implicit object modeling is working optimally causes subsequent instances of the photorealistic avatar representation of the conference participant to be created without relying on the step of detecting a human face in the portions of the video stream.

21. A video conferencing system comprising:
- a face detector configured to detect a face of a video conference participant in a video stream;
  
  a calibrator configured to generate a calibration model calibrating the face of the video conference participant;
  
  an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model;
  
  an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face;
  
  the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and
  
  the system operable to periodically check to determine whether the implicit object modeling is working optimally;
  
  wherein determining that the implicit object modeling is not working optimally by;
  
  determining that processing of the photorealistic avatar representation uses a disproportional amount of transmission bandwidth;
  
  ordetermining that the implicit object modeling is not working optimally if reprojection error is high.

22. A method of video conferencing, the method comprising the computer implemented steps of:
- detecting a human face of a video conference participant depicted in portions of a video stream;
  
  creating, by explicitly modeling, one or more explicit object models to model the face of the video conference participant;
  
  generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face;
  
  using the implicit object models, creating a photorealistic avatar representation of the video conference participant;
  
  wherein the implicit object models provide an implicit representation of the face of the video conference participant;
  
  wherein the detecting and tracking comprise using a Viola/Jones face detection algorithm further includes the steps of;
  
  identifying corresponding elements of at least one object associated with the face in two or more video frames from the video stream; and
  
  tracking and classifying the corresponding elements to identify relationships between the corresponding elements based on previously calibrated and modeled faces.

23. A video conferencing system comprising:
- a face detector configured to detect a face of a video conference participant in a video stream;
  
  a calibrator configured to generate a calibration model calibrating the face of the video conference participant;
  
  an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model;
  
  an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face;
  
  the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and
  
  wherein the implicit object models provide an implicit representation of the face of the video conference participant;
  
  wherein the face detector includes a Viola/Jones face detector further includes the steps of;
  
  identifying corresponding elements of at least one object associated with the face in two or more video frames from the video stream; and
  
  tracking and classifying the corresponding elements to identify relationships between the corresponding elements based on previously calibrated and modeled faces.

24. A video conferencing system comprising:
- a face detector configured to detect a face of a video conference participant in a video stream;
  
  a calibrator configured to generate a calibration model calibrating the face of the video conference participant;
  
  an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model;
  
  an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face;
  
  the system operable to periodically check to determine whether the implicit object modeling is working optimally;
  
  the system operable to respond to the determination that the implicit object modeling is not working by processing the step of detecting a human face of a video conference participant; and
  
  in response to detecting a human face, the system operable to search for existing calibration information for the detected human face.

25. A video conferencing system comprising:
- a face detector configured to detect a face of a video conference participant in a video stream;
  
  a calibrator configured to generate a calibration model calibrating the face of the video conference participant;
  
  an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model;
  
  an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face;
  
  the system operable to periodically check to determine whether the implicit object modeling is working optimally, the system operable to determine that the implicit object models, which are used to create the photorealistic avatar representation, are working optimally by;
  
  determining that reprojection error is low in the photorealistic avatar representation; and
  
  determining that there is a significant amount of motion in the photorealistic avatar representation.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Euclid Discoveries LLC
Original Assignee
Euclid Discoveries LLC
Inventors
Pace, Charles P.
Primary Examiner(s)
El-Zoobi, Maria

Application Number

US13/541,453
Publication Number

US 20120281063A1
Time in Patent Office

812 Days
Field of Search

None
US Class Current

348/14.01
CPC Class Codes

G06V 10/7557   based on appearance, e.g. a...

G06V 40/167   using comparisons between t...

H04N 21/23412   for generating or manipulat...

H04N 21/4223   Cameras H04N23/00 takes pre...

H04N 21/44012   involving rendering scenes ...

H04N 21/4415   using biometric characteris...

H04N 21/4532   involving end-user characte...

H04N 21/4788   communicating with other us...

H04N 7/147   Communication arrangements,...

H04N 7/157   defining a virtual conferen...

Systems and methods for providing personal video services

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

151 Citations

25 Claims

Specification

Solutions

Use Cases

Quick Links

Systems and methods for providing personal video services

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

151 Citations

25 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links