Systems and methods for providing personal video services
First Claim
1. A method of video conferencing, the method comprising the computer implemented steps of:
- detecting a human face of a video conference participant depicted in portions of a video stream;
creating, by explicitly modeling, one or more explicit object models to model the face of the video conference participant;
generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face; and
using the implicit object models, creating a photorealistic avatar representation of the video conference participant, wherein creating a photorealistic avatar representation of the video conference participant further includes enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for processing video are provided. Video compression schemes are provided to reduce the number of bits required to store and transmit digital media in video conferencing or videoblogging applications. A photorealistic avatar representation of a video conference participant is created. The avatar representation can be based on portions of a video stream that depict the conference participant. A face detector is used to identify, track and classify the face. Object models including density, structure, deformation, appearance and illumination models are created based on the detected face. An object based video compression algorithm, which uses machine learning face detection techniques, creates the photorealistic avatar representation from parameters derived from the density, structure, deformation, appearance and illumination models.
151 Citations
25 Claims
-
1. A method of video conferencing, the method comprising the computer implemented steps of:
-
detecting a human face of a video conference participant depicted in portions of a video stream; creating, by explicitly modeling, one or more explicit object models to model the face of the video conference participant; generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face; and using the implicit object models, creating a photorealistic avatar representation of the video conference participant, wherein creating a photorealistic avatar representation of the video conference participant further includes enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for facilitating video conferencing, the computer program product being embodied on a non-transitory computer-readable medium and comprising code configured so as when executed on a computer to perform operations of:
-
creating, by explicitly modeling, one or more explicit object models to model a detected face of a video conference participant; generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face; using the implicit object models, creating a photorealistic avatar representation of the video conference participant; and enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.
-
-
18. A video conferencing system comprising:
-
a face detector configured to detect a face of a video conference participant in a video stream; a calibrator configured to generate a calibration model calibrating the face of the video conference participant; an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model; an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face; the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and the system further operable to enable the video conference participant to adjust a gaze of their respective photorealistic avatar representation.
-
-
19. A method of video conferencing, the method comprising the computer implemented steps of:
-
generating explicit object models to model a human face of a video conference participant depicted in portions of a video stream; using parameters from the explicit object models, generating implicit object models to create a photorealistic avatar representation of the video conference participant, where the explicit object model parameters are used to define expectations for the implicit object models regarding how lighting interacts with a structure of the face of the video conference participant; and enabling the video conference participant to adjust a gaze of their respective photorealistic avatar representation.
-
-
20. A video conferencing system comprising:
-
a face detector configured to detect a face of a video conference participant in a video stream; a calibrator configured to generate a calibration model calibrating the face of the video conference participant; an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model; an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face; the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and the system operable to periodically check to determine whether the implicit object modeling is working optimally, where the determination that the implicit object modeling is working optimally causes subsequent instances of the photorealistic avatar representation of the conference participant to be created without relying on the step of detecting a human face in the portions of the video stream.
-
-
21. A video conferencing system comprising:
-
a face detector configured to detect a face of a video conference participant in a video stream; a calibrator configured to generate a calibration model calibrating the face of the video conference participant; an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model; an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face; the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and the system operable to periodically check to determine whether the implicit object modeling is working optimally; wherein determining that the implicit object modeling is not working optimally by; determining that processing of the photorealistic avatar representation uses a disproportional amount of transmission bandwidth;
ordetermining that the implicit object modeling is not working optimally if reprojection error is high.
-
-
22. A method of video conferencing, the method comprising the computer implemented steps of:
-
detecting a human face of a video conference participant depicted in portions of a video stream; creating, by explicitly modeling, one or more explicit object models to model the face of the video conference participant; generating one or more implicit object models relative to parameters obtained from the explicit object models to facilitate creation of a compact encoding of the video conference participant'"'"'s face; using the implicit object models, creating a photorealistic avatar representation of the video conference participant; wherein the implicit object models provide an implicit representation of the face of the video conference participant; wherein the detecting and tracking comprise using a Viola/Jones face detection algorithm further includes the steps of; identifying corresponding elements of at least one object associated with the face in two or more video frames from the video stream; and tracking and classifying the corresponding elements to identify relationships between the corresponding elements based on previously calibrated and modeled faces.
-
-
23. A video conferencing system comprising:
-
a face detector configured to detect a face of a video conference participant in a video stream; a calibrator configured to generate a calibration model calibrating the face of the video conference participant; an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model; an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face; the system operable to generate a photorealistic avatar representation of the video conference participant from the implicit models; and wherein the implicit object models provide an implicit representation of the face of the video conference participant; wherein the face detector includes a Viola/Jones face detector further includes the steps of; identifying corresponding elements of at least one object associated with the face in two or more video frames from the video stream; and tracking and classifying the corresponding elements to identify relationships between the corresponding elements based on previously calibrated and modeled faces.
-
-
24. A video conferencing system comprising:
-
a face detector configured to detect a face of a video conference participant in a video stream; a calibrator configured to generate a calibration model calibrating the face of the video conference participant; an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model; an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face; the system operable to periodically check to determine whether the implicit object modeling is working optimally; the system operable to respond to the determination that the implicit object modeling is not working by processing the step of detecting a human face of a video conference participant; and in response to detecting a human face, the system operable to search for existing calibration information for the detected human face.
-
-
25. A video conferencing system comprising:
-
a face detector configured to detect a face of a video conference participant in a video stream; a calibrator configured to generate a calibration model calibrating the face of the video conference participant; an explicit object modeler configured to generate one or more explicit object models, in combination with the calibrator and face detector, the explicit object models modeling portions of the video stream depicting the face of the video conference participant based on the calibration model; an implicit object modeler configured to build one or more implicit object models relative to parameters from the explicit object models to facilitate creation of a compact encoding of the participant'"'"'s face; the system operable to periodically check to determine whether the implicit object modeling is working optimally, the system operable to determine that the implicit object models, which are used to create the photorealistic avatar representation, are working optimally by; determining that reprojection error is low in the photorealistic avatar representation; and determining that there is a significant amount of motion in the photorealistic avatar representation.
-
Specification