Excluding masked regions of virtual reality (VR) frames from encoder processing
First Claim
Patent Images
1. A method, comprising:
- obtaining a two dimensional (2D) rectangular representation of virtual reality (VR) content, the VR content including one or more images corresponding to a plurality of faces of a virtual three dimensional (3D) polygonal projection space, the virtual 3D polygonal projection space corresponding to one of a hexahedron, a cube, an octahedron, a dodecahedron, or an icosahedron, the 2D rectangular representation including a first set of regions containing a plurality of non-display pixels and a second set of regions containing a plurality of display pixels, the second set of regions including a plurality of regions, each region of the second set of regions corresponding to a different one of the plurality of faces of the 3D projection space, the display pixels containing image content of the VR content and the non-display pixels not containing image content of the VR content, each of the display pixels and non-display pixels being represented by a corresponding set of pixel values including an intensity value;
modifying the 2D rectangular representation, wherein modifying the 2D rectangular representation includes modifying the intensity value of each pixel of at least a portion of the non-display pixels of a first region of the first set of regions within the 2D rectangular representation based, at least in part, on the intensity value of at least one pixel of at least a portion of the display pixels of a second region of the second set of regions within the 2D rectangular representation, the first region being adjacent to the second region within the 2D representation;
after modifying the 2D rectangular representation, obtaining a mask indicating positions of the non-display pixels and the display pixels within the 2D representation, the mask including a first value for each of the non-display pixels and a second value for each of the display pixels; and
encoding the 2D representation using the mask such that the non-display pixels of the first set of regions are excluded from motion estimation.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described that enable a two-dimensional (2D) representation of three-dimensional (3D) virtual reality (VR) content to be encoded. These techniques include encoding VR content while excluding non-display pixels of the VR content from motion estimation during encoder processing.
20 Citations
23 Claims
-
1. A method, comprising:
-
obtaining a two dimensional (2D) rectangular representation of virtual reality (VR) content, the VR content including one or more images corresponding to a plurality of faces of a virtual three dimensional (3D) polygonal projection space, the virtual 3D polygonal projection space corresponding to one of a hexahedron, a cube, an octahedron, a dodecahedron, or an icosahedron, the 2D rectangular representation including a first set of regions containing a plurality of non-display pixels and a second set of regions containing a plurality of display pixels, the second set of regions including a plurality of regions, each region of the second set of regions corresponding to a different one of the plurality of faces of the 3D projection space, the display pixels containing image content of the VR content and the non-display pixels not containing image content of the VR content, each of the display pixels and non-display pixels being represented by a corresponding set of pixel values including an intensity value; modifying the 2D rectangular representation, wherein modifying the 2D rectangular representation includes modifying the intensity value of each pixel of at least a portion of the non-display pixels of a first region of the first set of regions within the 2D rectangular representation based, at least in part, on the intensity value of at least one pixel of at least a portion of the display pixels of a second region of the second set of regions within the 2D rectangular representation, the first region being adjacent to the second region within the 2D representation; after modifying the 2D rectangular representation, obtaining a mask indicating positions of the non-display pixels and the display pixels within the 2D representation, the mask including a first value for each of the non-display pixels and a second value for each of the display pixels; and encoding the 2D representation using the mask such that the non-display pixels of the first set of regions are excluded from motion estimation. - View Dependent Claims (2, 3, 4, 5, 21, 22, 23)
-
-
6. A computer program product, comprising one or more non-transitory computer readable media having computer program instructions stored therein, the computer program instructions being configured such that, when executed by one or more processors, the computer program instructions cause the one or more processors to:
-
obtain a two dimensional (2D) representation of virtual reality (VR) content, the VR content including one or more images corresponding to a plurality of faces of a virtual three dimensional (3D) projection space, the 2D representation including a first set of regions containing a plurality of non-display pixels and a second set of regions containing a plurality of display pixels, the second set of regions including a plurality of regions, each region of the second set of regions corresponding to a different one of the plurality of faces of the 3D projection space, the display pixels containing image content of the VR content and the non-display pixels not containing image content of the VR content; obtain a mask indicating positions of the non-display pixels and the display pixels within the 2D representation, the mask including a first value for each of the non-display pixels and a second value for each of the display pixels; and encode the 2D representation using the mask such that the non-display pixels of the first set of regions are excluded from motion estimation. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. A system, comprising:
-
an encoder configured to encode video frames according to a particular encoding standard; and one or more processors and memory configured to; obtain a two dimensional (2D) representation of virtual reality (VR) content, the VR content including one or more images corresponding to a plurality of faces of a virtual three dimensional (3D) projection space, the 2D representation including a first set of regions containing a plurality of non-display pixels and a second set of regions containing a plurality of display pixels, the second set of regions including a plurality of regions, each region of the second set of regions corresponding to a different one of the plurality of faces of the 3D projection space, the display pixels containing image content of the VR content and the non-display pixels not containing image content of the VR content; obtain a mask indicating positions of the non-display pixels and the display pixels within the 2D representation, the mask including a first value for each of the non-display pixels and a second value for each of the display pixels; and encode the 2D representation using the mask such that the non-display pixels of the first set of regions are excluded from motion estimation. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification