Three-dimensional video with asymmetric spatial resolution
First Claim
Patent Images
1. A method of coding video data, the method comprising:
- decoding video data of a first coded view that comprises a first view component comprising a first texture component having the first resolution and a first depth component having the first resolution to produce a first picture having the first resolution at least in part by;
predicting a first block of the first texture component using a first reference block indicated by a motion vector for the first block; and
predicting a second block of the first depth component using a second reference block indicated by the motion vector for the first block, wherein the second block is spatially collocated, within the first depth component, with the first block of the first texture component;
decoding video data of a second coded view that comprises a second view component comprising a second texture component having the first resolution and a second depth component having the first resolution to produce a second picture having the first resolution;
upsampling the first picture to form a first upsampled picture having a second resolution, wherein the second resolution is greater than the first resolution;
upsampling the second picture to form a second upsampled picture having the second resolution; and
decoding video data of a third coded view that comprises a third view component comprising a third texture component having the second resolution and a third depth component having the second resolution relative to the first upsampled picture and the second upsampled picture to produce a third picture having the second resolution.
1 Assignment
0 Petitions
Accused Products
Abstract
A video coding device may be configured to code a bitstream including multiple views plus depth information. Two of the views may have reduced resolutions, while a third view may have a full resolution. The third view may be predicted relative to upsampled versions of the two reduced-resolution views. Each view may include texture data and depth data, such that a view component may include a texture component and a depth component. Moreover, the texture and depth components may be arranged within an access unit according to a particular order, which may simplify component extraction from the access unit.
-
Citations
51 Claims
-
1. A method of coding video data, the method comprising:
-
decoding video data of a first coded view that comprises a first view component comprising a first texture component having the first resolution and a first depth component having the first resolution to produce a first picture having the first resolution at least in part by; predicting a first block of the first texture component using a first reference block indicated by a motion vector for the first block; and predicting a second block of the first depth component using a second reference block indicated by the motion vector for the first block, wherein the second block is spatially collocated, within the first depth component, with the first block of the first texture component; decoding video data of a second coded view that comprises a second view component comprising a second texture component having the first resolution and a second depth component having the first resolution to produce a second picture having the first resolution; upsampling the first picture to form a first upsampled picture having a second resolution, wherein the second resolution is greater than the first resolution; upsampling the second picture to form a second upsampled picture having the second resolution; and decoding video data of a third coded view that comprises a third view component comprising a third texture component having the second resolution and a third depth component having the second resolution relative to the first upsampled picture and the second upsampled picture to produce a third picture having the second resolution. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 48, 49)
-
-
19. A device for coding video data, the device comprising:
-
a memory configured to store the video data; and a video coder configured to; decode a first coded view of the video data, the first coded view comprising a first view component comprising a first texture component having the first resolution and a first depth component having the first resolution, to produce a first picture having a first resolution, wherein, to decode the first coded view, the one or more processors are configured to; predict a first block of the first texture component using a first reference block indicated by a motion vector for the first block; and predict a second block of the first depth component using a second reference block indicated by the motion vector for the first block, wherein the second block is spatially collocated, within the first depth component, with the first block of the first texture component, decode a second coded view of the video data, the second coded view comprising a second view component comprising a second texture component having the first resolution and a second depth component having the first resolution to produce a second picture having the first resolution, upsample the first picture to form a first upsampled picture having a second resolution, wherein the second resolution is greater than the first resolution, upsample the second picture to form a second upsampled picture having the second resolution, and decode a third coded view of the video data, the third coded view comprising a third view component comprising a third texture component having the second resolution and a third depth component having the second resolution relative to the first upsampled picture and the second upsampled picture to produce a third picture having the second resolution. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 50, 51)
-
-
30. A device for coding video data, the device comprising:
-
means for decoding video data of a first coded view that comprises a first view component comprising a first texture component having the first resolution and a first depth component having the first resolution to produce a first picture having first resolution, the means for decoding comprising; means for predicting a first block of the first texture component using a first reference block indicated by a motion vector for the first block; and means for predicting a second block of the first depth component using a second reference block indicated by the motion vector for the first block, wherein the second block is spatially collocated, within the first depth component, with the first block of the first texture component; means for decoding video data of a second coded view that comprises a second view component comprising a second texture component having the first resolution and a second depth component having the first resolution to produce a second picture having the first resolution; means for upsampling the first picture to form a first upsampled picture having a second resolution, wherein the second resolution is greater than the first resolution; means for upsampling the second picture to form a second upsampled picture having the second resolution; and means for decoding video data of a third coded view relative to the first upsampled picture and the second upsampled picture to produce a third picture having the second resolution. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause one or more processors of a video coding device to:
-
decode video data of a first coded view that comprises a first view component comprising a first texture component having the first resolution and a first depth component having the first resolution to produce a first picture having the first resolution, the instructions to decode comprising instructions that, when executed, cause the one or more processors to; predict a first block of the first texture component using a first reference block indicated by a motion vector for the first block; and predict a second block of the first depth component using a second reference block indicated by the motion vector for the first block, wherein the second block is spatially collocated, within the first depth component, with the first block of the first texture component; decode video data of a second coded view that comprises a second view component comprising a second texture component having the first resolution and a second depth component having the first resolution to produce a second picture having the first resolution; upsample the first picture to form a first upsampled picture having a second resolution, wherein the second resolution is greater than the first resolution; upsample the second picture to form a second upsampled picture having the second resolution; and decode video data of a third coded view that comprises a third view component comprising a third texture component having the second resolution and a third depth component having the second resolution relative to the first upsampled picture and the second upsampled picture to produce a third picture having the second resolution. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47)
-
Specification