Frame packing for asymmetric stereo video
First Claim
1. A method of encoding video data, the method comprising:
- receiving a first picture of a first view of a scene having a first resolution;
receiving a second picture of a second view of the scene having a reduced resolution relative to the first resolution;
forming an asymmetric frame comprising the first picture and the second picture, wherein the asymmetric frame comprises the first picture and the second picture as a single frame;
encoding the asymmetric frame, wherein encoding the asymmetric frame comprises encoding the single frame;
forming a supplemental enhancement information (SEI) message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first picture and the second picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first picture relative to the second picture in the asymmetric frame;
encapsulating the asymmetric frame in a video coding layer (VCL) network abstraction layer (NAL) unit;
encapsulating the SEI message in a non-VCL NAL unit;
encapsulating the VCL NAL unit including the asymmetric frame and the non-VCL NAL unit including the SEI message in an access unit, wherein the access unit corresponds to a specific time instance; and
outputting the access unit.
1 Assignment
0 Petitions
Accused Products
Abstract
An asymmetric frame of a coded video bitstream may include a full resolution picture of a left view and a reduced resolution picture of a right view, where the left and right views form a stereo view pair for three-dimensional video playback. In one example, an apparatus includes a video encoder configured to receive a first picture of a first view of a scene having a first resolution, receive a second picture of a second view of the scene having a reduced resolution relative to the first resolution, form an asymmetric frame including the first picture and the second picture, and encode the asymmetric frame. In this manner, decoders of varying capabilities may receive the same bitstream, and the bitstream may consume less bandwidth than one or more bitstreams having full resolution pictures of a stereo view pair. The bitstream may have better quality than a bitstream having subsampled pictures.
216 Citations
50 Claims
-
1. A method of encoding video data, the method comprising:
-
receiving a first picture of a first view of a scene having a first resolution; receiving a second picture of a second view of the scene having a reduced resolution relative to the first resolution; forming an asymmetric frame comprising the first picture and the second picture, wherein the asymmetric frame comprises the first picture and the second picture as a single frame; encoding the asymmetric frame, wherein encoding the asymmetric frame comprises encoding the single frame; forming a supplemental enhancement information (SEI) message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first picture and the second picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first picture relative to the second picture in the asymmetric frame; encapsulating the asymmetric frame in a video coding layer (VCL) network abstraction layer (NAL) unit; encapsulating the SEI message in a non-VCL NAL unit; encapsulating the VCL NAL unit including the asymmetric frame and the non-VCL NAL unit including the SEI message in an access unit, wherein the access unit corresponds to a specific time instance; and outputting the access unit. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An apparatus for encoding video data, the apparatus comprising:
-
a memory configured to store video data and a video encoder configured to; receive a first picture of a first view of a scene having a first resolution, receive a second picture of a second view of the scene having a reduced resolution relative to the first resolution, form an asymmetric frame of the video data, the asymmetric frame comprising the first picture and the second picture, wherein the asymmetric frame comprises the first picture and the second picture as a single frame, encode the asymmetric frame, wherein to encode the asymmetric frame, the video encoder is configured to encode the single frame, form a supplemental enhancement information (SEI) message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first picture and the second picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first picture relative to the second picture in the asymmetric frame, encapsulate the asymmetric frame in a video coding layer (VCL) network abstraction layer (NAL) unit, encapsulate the SEI message in a non-VCL NAL unit, and encapsulate the VCL NAL unit including the asymmetric frame and the non-VCL NAL unit including the SEI message in an access unit, wherein the access unit corresponds to a specific time instance. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. An apparatus for encoding video data, the apparatus comprising:
-
means for receiving a first picture of a first view of a scene having a first resolution; means for receiving a second picture of a second view of the scene having a reduced resolution relative to the first resolution; means for forming an asymmetric frame comprising the first picture and the second picture, wherein the asymmetric frame comprises the first picture and the second picture as a single frame; means for encoding the asymmetric frame, wherein the means for encoding the asymmetric frame comprises means for encoding the single frame; means for forming a supplemental enhancement information (SEI) message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first picture and the second picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first picture relative to the second picture in the asymmetric frame; means for encapsulating the asymmetric frame in a video coding layer (VCL) network abstraction layer (NAL) unit; means for encapsulating the SEI message in a non-VCL NAL unit; and means for encapsulating the VCL NAL unit including the asymmetric frame and the non-VCL NAL unit including the SEI message in an access unit, wherein the access unit corresponds to a specific time instance. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A computer program product comprising a non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause a processor of a device for encoding video data to:
-
receive a first picture of a first view of a scene having a first resolution; receive a second picture of a second view of the scene having a reduced resolution relative to the first resolution; form an asymmetric frame comprising the first picture and the second picture, wherein the asymmetric frame comprises the first picture and the second picture as a single frame; encode the asymmetric frame, wherein the instructions that cause the processor to encode the asymmetric frame comprise instructions that cause the processor to encode the single frame; form a supplemental enhancement information (SEI) message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first picture and the second picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first picture relative to the second picture in the asymmetric frame; encapsulate the asymmetric frame in a video coding layer (VCL) network abstraction layer (NAL) unit; encapsulate the SEI message in a non-VCL NAL unit; encapsulate the VCL NAL unit including the asymmetric frame and the non-VCL NAL unit including the SEI message in an access unit, wherein the access unit corresponds to a specific time instance; and output the access unit. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A method of decoding video data, the method comprising:
-
receiving an access unit corresponding to a specific time instance, the access unit comprising an encoded asymmetric frame, encapsulated in a video coding layer (VCL) network abstraction layer (NAL) unit, the encoded asymmetric frame comprising a first resolution picture of a first view of a scene and a reduced resolution picture of a second view of the scene, the reduced resolution picture having a reduced resolution relative to a first resolution of the first resolution picture, the asymmetric frame comprising the first resolution picture and the reduced resolution picture as a single frame, the access unit further comprising a supplemental enhancement information (SEI) message, encapsulated in a non-VCL NAL unit, the SEI message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first resolution picture and the reduced resolution picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first picture relative to the second picture in the asymmetric frame; decoding the asymmetric frame based on the SEI message, wherein decoding the asymmetric frame comprises decoding the single frame; separating the decoded asymmetric frame into the first resolution picture and the reduced resolution picture based on the SEI message; upsampling the reduced resolution picture to produce a second picture of the second view of the scene having the first resolution of the first resolution picture; and outputting the first resolution picture, as a first picture, and the second picture, wherein the first picture and the second picture form a stereo image pair. - View Dependent Claims (26, 27, 28, 29, 30)
-
-
31. An apparatus for decoding video data, the apparatus comprising:
-
a memory configured to store video data and a video decoder configured to; receive an access unit of the video data, the access unit corresponding to a specific time instance, the access unit comprising an encoded asymmetric frame encapsulated in a video coding layer (VCL) network abstraction layer (NAL) unit of the video data, the encoded asymmetric frame comprising a first resolution picture of a first view of a scene and a reduced resolution picture of a second view of the scene, the reduced resolution picture having a reduced resolution relative to a first resolution of the first resolution picture, the asymmetric frame comprising the first resolution picture and the reduced resolution picture as a single frame, the access unit further comprising a supplemental enhancement information (SEI) message, encapsulated in a non-VCL NAL unit, the SEI message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first resolution picture and the reduced resolution picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first resolution picture relative to the reduced resolution picture in the asymmetric frame, decode the asymmetric frame based on the SEI message, wherein to decode the asymmetric frame, the video decoder is configured to decode the single frame, and wherein the video decoder is further configured to; separate the decoded asymmetric frame into the first resolution picture and the reduced resolution picture based on the SEI message, and upsample the reduced resolution picture to produce a second picture of the second view of the scene having the first resolution the first resolution picture, wherein the first resolution picture, when output as a first picture, and the second picture form a stereo image pair. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38)
-
-
39. An apparatus for decoding video data, the apparatus comprising:
-
means for receiving an access unit corresponding to a specific time instance, the access unit comprising an encoded asymmetric frame, encapsulated in a video coding layer (VCL) network abstraction layer (NAL) unit, the encoded asymmetric frame comprising a first resolution picture of a first view of a scene and a reduced resolution picture of a second view of the scene, the reduced resolution picture having a reduced resolution relative to a first resolution of the first resolution picture, the asymmetric frame comprising the first resolution picture and the reduced resolution picture as a single frame, the access unit further comprising a supplemental enhancement information (SEI) message, encapsulated in a non-VCL NAL unit, the SEI message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first resolution picture and the reduced resolution picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first resolution picture relative to the reduced resolution picture in the asymmetric frame; means for decoding the asymmetric frame based on the SEI message, wherein the means for decoding the asymmetric frame comprises means for decoding the single frame; means for separating the decoded asymmetric frame into the first resolution picture and the reduced resolution picture based on the SEI message; and means for upsampling the reduced resolution picture to produce a second picture of the second view of the scene having the first resolution of the first resolution picture, wherein the first resolution picture, when output as a first picture, and the second picture form a stereo image pair. - View Dependent Claims (40, 41, 42, 43, 44)
-
-
45. A computer program product comprising a non-transitory computer-readable storage medium having stored thereon instructions that, when executed, cause a processor of a device for decoding video data to:
-
receive an access unit corresponding to a specific time instance, the access unit comprising an encoded asymmetric frame, encapsulated in a video coding layer (VCL) network abstraction layer (NAL) unit, the encoded asymmetric frame comprising a first resolution picture of a first view of a scene and a reduced resolution picture of a second view of the scene, the reduced resolution picture having a reduced resolution relative to a first resolution of the first resolution picture, the asymmetric frame comprising the first resolution picture and the reduced resolution picture as a single frame, the access unit further comprising a supplemental enhancement information (SEI) message, encapsulated in a non-VCL NAL unit, the SEI message comprising an asymmetric packing indicator value that indicates that the asymmetric frame is an asymmetric frame, a frame packing arrangement type value indicating that the first resolution picture and the reduced resolution picture have one of a side-by-side arrangement and a top-bottom arrangement, and information indicating a location of the first resolution picture relative to the reduced resolution picture in the asymmetric frame; decode the asymmetric frame based on the SEI message, wherein the instructions that cause the processor to decode the asymmetric frame comprise instructions that cause the processor to decode the single frame; separate the decoded asymmetric frame into the first resolution picture and the reduced resolution picture based on the SEI message; upsample the reduced resolution picture to produce a second picture of the second view of the scene having the first resolution of the first resolution picture; and output the first resolution picture, as a first picture, and the second picture, wherein the first picture and the second picture form a stereo image pair. - View Dependent Claims (46, 47, 48, 49, 50)
-
Specification