Probabilistic model to compress images for three-dimensional video
First Claim
1. A method comprising:
- receiving head-tracking data that describe one or more positions of people while the people are viewing a three-dimensional video;
generating a probabilistic model of the one or more positions of the people based on the head-tracking data, wherein the probabilistic model identifies a probability of a viewer looking in a particular direction as a function of time;
generating video segments from the three-dimensional video based on scene boundaries in the three-dimensional video;
for each of the video segments;
determining a directional encoding format that projects latitudes and longitudes of locations of a surface of a sphere onto locations on a plane;
determining a cost function that identifies a region of interest on the plane based on the probabilistic model; and
generating optimal segment parameters that minimize a sum-over position for the region of interest;
re-encoding the three-dimensional video to include the optimal segment parameters for each of the video segments and to blur portions of each of the video segments based on the probability, wherein an intensity of a level of blur increases as the probability of the viewer looking in the particular direction decreases; and
providing a re-encoded video and the optimal segment parameters for each of the video segments to a viewing device, wherein the viewing device uses the optimal segment parameters for each of the video segments to un-distort the re-encoded video and texture the re-encoded video to the sphere to display the re-encoded video with the region of interest for each of the video segments displayed at a higher resolution than other regions in each of the video segments.
2 Assignments
0 Petitions
Accused Products
Abstract
A method includes receiving head-tracking data that describe one or more positions of people while the people are viewing a three-dimensional video. The method further includes generating a probabilistic model of the one or more positions of the people based on the head-tracking data, wherein the probabilistic model identifies a probability of a viewer looking in a particular direction as a function of time. The method further includes generating video segments from the three-dimensional video. The method further includes, for each of the video segments: determining a directional encoding format that projects latitudes and longitudes of locations of a surface of a sphere onto locations on a plane, determining a cost function that identifies a region of interest on the plane based on the probabilistic model, and generating optimal segment parameters that minimize a sum-over position for the region of interest.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving head-tracking data that describe one or more positions of people while the people are viewing a three-dimensional video; generating a probabilistic model of the one or more positions of the people based on the head-tracking data, wherein the probabilistic model identifies a probability of a viewer looking in a particular direction as a function of time; generating video segments from the three-dimensional video based on scene boundaries in the three-dimensional video; for each of the video segments; determining a directional encoding format that projects latitudes and longitudes of locations of a surface of a sphere onto locations on a plane; determining a cost function that identifies a region of interest on the plane based on the probabilistic model; and generating optimal segment parameters that minimize a sum-over position for the region of interest; re-encoding the three-dimensional video to include the optimal segment parameters for each of the video segments and to blur portions of each of the video segments based on the probability, wherein an intensity of a level of blur increases as the probability of the viewer looking in the particular direction decreases; and providing a re-encoded video and the optimal segment parameters for each of the video segments to a viewing device, wherein the viewing device uses the optimal segment parameters for each of the video segments to un-distort the re-encoded video and texture the re-encoded video to the sphere to display the re-encoded video with the region of interest for each of the video segments displayed at a higher resolution than other regions in each of the video segments. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprises:
-
one or more processors coupled to a memory; a head tracking module stored in the memory and executable by the one or more processors, the head tracking module operable to receive head-tracking data that describe one or more positions of people while the people are viewing a set of three-dimensional videos, generate a set of probabilistic models of the one or more positions of the people based on the head-tracking data, and estimate a first probabilistic model for a first three-dimensional video, wherein the first probabilistic model identifies a probability of a viewer looking in a particular direction as a function of time and the first three-dimensional video is not part of the set of three-dimensional videos; a segmentation module stored in the memory and executable by the one or more processors, the segmentation module operable to generate video segments from the three-dimensional video, wherein the video segments are of equal length and of a predetermined length; a parameterization module stored in the memory and executable by the one or more processors, the parameterization module operable to, for each of the video segments; determine a directional encoding format that projects latitudes and longitudes of locations of a surface of a sphere onto locations on a plane; determine a cost function that identifies a region of interest on the plane based on the first probabilistic model; and generate optimal segment parameters that minimize a sum-over position for the region of interest; and an encoder module stored in the memory and executable by the one or more processors, the encoder module operable; to re-encode the three-dimensional video to include the optimal segment parameters for each of the video segments and blurring portions of each of the video segments based on the probability, wherein an intensity of a level of blur increases as the probability of the viewer looking in the particular direction decreases; and provide a re-encoded video and the optimal segment parameters for each of the video segments to a client device, wherein the client device uses the re-encoded video and the optimal segment parameters for each of the video segments to generate a two-dimensional video that automates head movement. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A non-transitory computer storage medium encoded with a computer program, the computer program comprising instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising:
-
receiving head-tracking data that describe one or more positions of people while the people are viewing a three-dimensional video; generating a probabilistic model of the one or more positions of the people based on the head-tracking data, wherein the probabilistic model identifies a probability of a viewer looking in a particular direction as a function of time; generating video segments from the three-dimensional video based on scene boundaries in the three-dimensional video; for each of the video segments; determining a directional encoding format that projects latitudes and longitudes of locations of a surface of a sphere onto locations on a plane; determining a cost function that identifies a region of interest on the plane based on the probabilistic model; generating optimal segment parameters that minimize a sum-over position for the region of interest; and identifying a region of low interest; re-encoding the three-dimensional video to include the optimal segment parameters for each of the video segments and blurring of the region of low interest; and providing a re-encoded video and the optimal segment parameters for each of the video segments to a viewing device, wherein the viewing device uses the optimal segment parameters for each of the video segments to un-distort the re-encoded video and texture the re-encoded video to the sphere to display the re-encoded video with the region of interest for each of the video segments displayed at a higher resolution than other regions in each of the video segments and the region of low interest displayed at a lower resolution than other regions in each of the video segments. - View Dependent Claims (17, 18, 19, 20)
-
Specification