Method for generating an immersive video of a plurality of persons
First Claim
1. A method for generating an immersive video of a plurality of persons in a computer server, the method comprising:
- receiving a plurality of video streams on a plurality of video channels from a plurality of client devices, each video stream including a silhouette of a person;
analyzing the video stream received from a client device to detect a silhouette metadata channel carrying silhouette data the silhouette data being selected from the group consisting of coarse silhouette information representing coordinates of a characteristic point of the face of the person and fine silhouette information representing a mask corresponding to the shape and location of the person within a frame of the video stream;
analyzing the silhouette data when a silhouette metadata channel has been detected and;
in the case that a silhouette metadata channel is detected, extracting the person'"'"'s silhouette as a function of the silhouette data and generating a filtered video stream representing only the silhouette of the person;
in the case that a silhouette metadata channel has not been detected, generating in the computer server silhouette data from the video stream received and extracting the person'"'"'s silhouette from the incoming video stream as a function of the silhouette data generated; and
generating a video stream of a virtual scene comprising a plurality of the silhouettes extracted from the plurality of filtered video streams.
9 Assignments
0 Petitions
Accused Products
Abstract
A method for generating an immersive video of a plurality of persons in a computer server, the method comprising: —receiving a plurality of video streams on a plurality of video channels from a plurality of client devices, each video stream including a silhouette of a person; —extracting the person'"'"'s silhouette from each incoming video stream to generate a filtered video stream representing only the silhouette of the person; —generating a video stream of a virtual scene comprising a plurality of—the silhouettes extracted from the plurality of filtered video streams the method further comprising receiving silhouette data carried on a metadata channel from a client device in addition to the video stream, the silhouette data representing a position of a face of the person within a frame of the video stream; —analyzing the silhouette data performing the silhouette extraction as a function of the silhouette data analyzed.
56 Citations
15 Claims
-
1. A method for generating an immersive video of a plurality of persons in a computer server, the method comprising:
-
receiving a plurality of video streams on a plurality of video channels from a plurality of client devices, each video stream including a silhouette of a person; analyzing the video stream received from a client device to detect a silhouette metadata channel carrying silhouette data the silhouette data being selected from the group consisting of coarse silhouette information representing coordinates of a characteristic point of the face of the person and fine silhouette information representing a mask corresponding to the shape and location of the person within a frame of the video stream; analyzing the silhouette data when a silhouette metadata channel has been detected and; in the case that a silhouette metadata channel is detected, extracting the person'"'"'s silhouette as a function of the silhouette data and generating a filtered video stream representing only the silhouette of the person; in the case that a silhouette metadata channel has not been detected, generating in the computer server silhouette data from the video stream received and extracting the person'"'"'s silhouette from the incoming video stream as a function of the silhouette data generated; and generating a video stream of a virtual scene comprising a plurality of the silhouettes extracted from the plurality of filtered video streams. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of generating a video stream in a client device including:
-
capturing a video stream from a capture device; performing a silhouette extraction process from the captured video stream to generate silhouette data; encoding the silhouette data generated by the silhouette extraction process; and sending the video stream in a four channels video format including three channels of raw video data using a color space for the video stream and one metadata channel for the silhouette data generated by the silhouette extraction process. - View Dependent Claims (12)
-
-
13. A video processing Server including:
-
a video decoder block able to receive video streams from a plurality of client devices and to generate respective decoded video streams, a client processing detection block able to detect a metadata channel in an incoming video stream; a first silhouette extraction sub-processing block able to perform a silhouette extraction process in a respective decoded video stream to generate coarse silhouette information; a second silhouette extraction sub-processing block able to perform a silhouette extraction process to generate fine silhouette information as a function of coarse silhouette information and the respective decoded video stream; and an immersive rendering block able to use fine silhouette data to extract silhouette video streams of respective persons from respective video streams and mix a plurality of silhouette video streams to generate a virtual scene; wherein the client processing detection block is able to transmit to the video decoder block instructions to send the decoded video stream; to the first silhouette extraction sub-processing block when no metadata channel is detected or when the metadata channel does not contain any silhouette data, to the second silhouette extraction sub-processing block when a metadata channel is detected and contains coarse silhouette information, the course silhouette information representing coordinates of a location of a person within a frame of the video stream; and to the immersive rendering block when a metadata channel is detected and contains fine silhouette information representing a mask corresponding to the shape and location of the person within a frame of the video stream; the video processing server further including; a video encoder block able to encode and send a video stream comprising the virtual scene to a client device. - View Dependent Claims (14)
-
-
15. A method for generating an immersive video in a computer server, the method comprising:
-
receiving a video stream from a client device of an associated user; determining whether the video stream includes fine silhouette information representing a mask corresponding to the silhouette of the user suitable for identifying pixels in the video stream to be used in generation the immersive video, course silhouette information representing face detection of the user suitable for aiding the generation of fine silhouette information or no silhouette information describing a silhouette of the user; under the condition that the video stream includes fine silhouette information; using the received fine silhouette information to select pixels in the video stream to be used in generating the immersive video; under the condition that the video stream includes coarse silhouette information; processing the video stream guided by the coarse silhouette information to generate fine silhouette information; under the condition that the video stream includes no information describing a silhouette of the user; processing the video stream to generate fine silhouette information; using the generated fine silhouette information to select pixels in the video stream to be used in generation of the immersive video; and generating a video stream of a virtual scene including representations based on the selected pixels.
-
Specification