TRUNCATED SQUARE PYRAMID GEOMETRY AND FRAME PACKING STRUCTURE FOR REPRESENTING VIRTUAL REALITY VIDEO CONTENT
First Claim
1. A method for encoding video data, comprising:
- obtaining virtual reality video data, wherein the virtual reality video data represents a 360-degree view of a virtual environment, wherein the virtual reality video data includes a plurality of frames, and wherein each frame from the plurality of frames includes corresponding spherical video data; and
mapping the spherical video data for a frame from the plurality of frames onto planes of a truncated square pyramid, wherein the planes of the truncated square pyramid include a base plane, a top plane, a left-side plane, a right-side plane, an up-side plane, and a bottom-side plane, wherein a size of the top plane is less than a size of the base plane, and wherein mapping the spherical video data includes;
mapping a first portion of the spherical video data onto the base plane at full resolution;
mapping a second portion of the spherical video data onto the top plane at a reduced resolution;
mapping a third portion of the spherical video data onto the left-side plane at a decreasing resolution;
mapping a fourth portion of the spherical video data onto the right-side plane at a decreasing resolution;
mapping a fifth portion of the spherical video data onto the up-side plane at a decreasing resolution; and
mapping a sixth portion of the spherical video data onto the bottom-side plane at a decreasing resolution.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques and systems are described for mapping 360-degree video data to a truncated square pyramid shape. A 360-degree video frame can include 360-degrees'"'"' worth of pixel data, and thus be spherical in shape. By mapping the spherical video data to the planes provided by a truncated square pyramid, the total size of the 360-degree video frame can be reduced. The planes of the truncated square pyramid can be oriented such that the base of the truncated square pyramid represents a front view and the top of the truncated square pyramid represents a back view. In this way, the front view can be captured at full resolution, the back view can be captured at reduced resolution, and the left, right, up, and bottom views can be captured at decreasing resolutions. Frame packing structures can also be defined for 360-degree video data that has been mapped to a truncated square pyramid shape.
71 Citations
30 Claims
-
1. A method for encoding video data, comprising:
-
obtaining virtual reality video data, wherein the virtual reality video data represents a 360-degree view of a virtual environment, wherein the virtual reality video data includes a plurality of frames, and wherein each frame from the plurality of frames includes corresponding spherical video data; and mapping the spherical video data for a frame from the plurality of frames onto planes of a truncated square pyramid, wherein the planes of the truncated square pyramid include a base plane, a top plane, a left-side plane, a right-side plane, an up-side plane, and a bottom-side plane, wherein a size of the top plane is less than a size of the base plane, and wherein mapping the spherical video data includes; mapping a first portion of the spherical video data onto the base plane at full resolution; mapping a second portion of the spherical video data onto the top plane at a reduced resolution; mapping a third portion of the spherical video data onto the left-side plane at a decreasing resolution; mapping a fourth portion of the spherical video data onto the right-side plane at a decreasing resolution; mapping a fifth portion of the spherical video data onto the up-side plane at a decreasing resolution; and mapping a sixth portion of the spherical video data onto the bottom-side plane at a decreasing resolution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A device for encoding video data, comprising:
-
a memory configured to store video data; and a video encoding device in communication with the memory, wherein the video encoding device is configured to; obtain virtual reality video data, wherein the virtual reality video data represents a 360-degree view of a virtual environment, wherein the virtual reality video data includes a plurality of frames, and wherein each frame from the plurality of frames includes corresponding spherical video data; and map the spherical video data for a frame from the plurality of frames onto planes of a truncated square pyramid, wherein the planes of the truncated square pyramid include a base plane, a top plane, a left-side plane, a right-side plane, an up-side plane, and a bottom-side plane, wherein a size of the top plane is less than a size of the base plane, and wherein mapping the spherical video data includes; mapping a first portion of the spherical video data onto the base plane at full resolution; mapping a second portion of the spherical video data onto the top plane at a reduced resolution; mapping a third portion of the spherical video data onto the left-side plane at a decreasing resolution; mapping a fourth portion of the spherical video data onto the right-side plane at a decreasing resolution; mapping a fifth portion of the spherical video data onto the up-side plane at a decreasing resolution; and mapping a sixth portion of the spherical video data onto the bottom-side plane at a decreasing resolution. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for decoding video data, comprising:
-
obtaining a frame of virtual reality video data, wherein the virtual reality video data represents a 360-degree view of a virtual environment, wherein the frame has a rectangular format; identifying a frame packing structure for the frame, wherein the frame packing structure provides positions for video data in the frame, wherein the frame packing structure includes planes of a truncated square pyramid, wherein the planes of the truncated square pyramid include a base plane, a top plane, a left-side plane, a right-side plane, an up-side plane, and a bottom-side plane, and wherein a size of the top plane is less than a size of the base plane; and displaying the frame using the frame packing structure. - View Dependent Claims (22, 23, 24)
-
-
25. A device for decoding video data, comprising:
-
a memory configured to store the video data; a video decoding device in communication with the memory, wherein the video decoding device is configured to; obtain a frame of virtual reality video data, wherein the virtual reality video data represents a 360-degree view of a virtual environment, wherein the frame has a rectangular format; identify a frame packing structure for the frame, wherein the frame packing structure provides positions for video data in the frame, wherein the frame packing structure includes planes of a truncated square pyramid, wherein the planes of the truncated square pyramid include a base plane, a top plane, a left-side plane, a right-side plane, an up-side plane, and a bottom-side plane, wherein a size of the top plane is less than a size of the base plane; and display the frame using the frame packing structure. - View Dependent Claims (26, 27, 28, 29, 30)
-
Specification