Prediction mode information upsampling for scalable video coding
First Claim
1. An apparatus configured to code video data, the apparatus comprising:
- a memory unit configured to store video data associated with a base layer having a first spatial resolution and an enhancement layer having a second spatial resolution greater than the first spatial resolution, the video data associated with the base layer including at least a base layer block and base layer prediction mode information associated with the base layer block, the base layer block including a plurality of sub-blocks where each sub-block is associated with respective prediction mode data of the base layer prediction mode information, the respective prediction mode data including non-pixel information; and
a processor implemented in circuitry and in communication with the memory unit, the processor being configured to;
upsample pixel data of a picture of the base layer to form an inter-layer picture comprising a plurality of blocks, the inter-layer picture having the second spatial resolution associated with the enhancement layer and being usable for inter-prediction of one or more pictures of the enhancement layer;
for every respective block of the inter-layer picture;
identify a single sub-block of the base layer block corresponding to the respective block of the inter-layer picture, the identified sub-block neighbored on each side by a respective sub-block of the plurality of sub-blocks of the base layer block;
select respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block;
upsample the selected respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; and
associate the upsampled prediction mode data with the respective block of the inter-layer picture;
add the inter-layer picture to a reference picture list, pictures within the reference picture list being usable for inter-prediction of the enhancement layer having the second spatial second resolution;
inter-predict a first block of a first picture of the one or more pictures of the enhancement layer from the inter-layer picture to form a predicted block for the first block, wherein to predict the first block, the processor is configured to code a reference index of the first block, the reference index identifying the inter-layer picture in the reference picture list; and
predict prediction mode data of a second block of a second picture of the one or more pictures of the enhancement layer using the upsampled prediction mode data of the inter-layer picture of the reference picture list.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, an apparatus configured to code video data includes a processor and a memory unit. The memory unit stores video data associated with a first layer having a first spatial resolution and a second layer having a second spatial resolution. The video data associated with the first layer includes at least a first layer block and first layer prediction mode information associated with the first layer block, and the first layer block includes a plurality of sub-blocks where each sub-block is associated with respective prediction mode data of the first layer prediction mode information. The processor derives the predication mode data associated with one of the plurality of sub-blocks based at least on a selection rule, upsamples the derived prediction mode data and the first layer block, and associates the upsampled prediction mode data with each upsampled sub-block of the upsampled first layer block.
-
Citations
22 Claims
-
1. An apparatus configured to code video data, the apparatus comprising:
-
a memory unit configured to store video data associated with a base layer having a first spatial resolution and an enhancement layer having a second spatial resolution greater than the first spatial resolution, the video data associated with the base layer including at least a base layer block and base layer prediction mode information associated with the base layer block, the base layer block including a plurality of sub-blocks where each sub-block is associated with respective prediction mode data of the base layer prediction mode information, the respective prediction mode data including non-pixel information; and a processor implemented in circuitry and in communication with the memory unit, the processor being configured to; upsample pixel data of a picture of the base layer to form an inter-layer picture comprising a plurality of blocks, the inter-layer picture having the second spatial resolution associated with the enhancement layer and being usable for inter-prediction of one or more pictures of the enhancement layer; for every respective block of the inter-layer picture; identify a single sub-block of the base layer block corresponding to the respective block of the inter-layer picture, the identified sub-block neighbored on each side by a respective sub-block of the plurality of sub-blocks of the base layer block; select respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; upsample the selected respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; and associate the upsampled prediction mode data with the respective block of the inter-layer picture; add the inter-layer picture to a reference picture list, pictures within the reference picture list being usable for inter-prediction of the enhancement layer having the second spatial second resolution; inter-predict a first block of a first picture of the one or more pictures of the enhancement layer from the inter-layer picture to form a predicted block for the first block, wherein to predict the first block, the processor is configured to code a reference index of the first block, the reference index identifying the inter-layer picture in the reference picture list; and predict prediction mode data of a second block of a second picture of the one or more pictures of the enhancement layer using the upsampled prediction mode data of the inter-layer picture of the reference picture list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of coding video data, the method comprising:
-
storing video data associated with a base layer having a first spatial resolution and an enhancement layer having a second spatial resolution greater than the first spatial resolution in a memory unit, the video data associated with the base layer including at least a base layer block and base layer prediction mode information associated with the base layer block, the base layer block including a plurality of sub-blocks where each sub-block is associated with respective prediction mode data of the base layer prediction mode information, the respective prediction mode data including non-pixel information; upsampling, by a processor in communication with the memory unit, pixel data of a picture of the base layer to form an inter-layer picture comprising a plurality of blocks, the inter-layer picture having the second spatial resolution associated with the enhancement layer and being usable for inter-prediction of one or more pictures of the enhancement layer; for every respective block of the inter-layer picture; identifying, by the processor, a single sub-block of the base layer block corresponding to the respective block of the inter-layer picture, the identified sub-block neighbored on each side by a respective sub-block of the plurality of sub-blocks of the base layer block; selecting, by the processor, respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; upsampling, by the processor, the selected respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; and associating, by the processor, the upsampled prediction mode data with the respective block of the inter-layer picture; adding, by the processor, the inter-layer picture to a reference picture list, pictures within the reference picture list being usable for inter-prediction of the enhancement layer having the second spatial second resolution; inter-predicting a first block of a first picture of the one or more pictures of the enhancement layer from the inter-layer picture to form a predicted block for the first block, wherein inter-predicting the first block comprises coding a reference index of the first block, the reference index identifying the inter-layer picture in the reference picture list; and predicting prediction mode data of a second block of a second picture of the one or more pictures of the enhancement layer using the upsampled prediction mode data of the inter-layer picture of the reference picture list. - View Dependent Claims (11, 12, 13, 14, 22)
-
-
15. An apparatus configured to code video data, the apparatus comprising:
-
means for storing video data associated with a base layer having a first spatial resolution and an enhancement layer having a second spatial resolution greater than the first spatial resolution, the video data associated with the base layer including at least a base layer block and base layer prediction mode information associated with the base layer block, the base layer block including a plurality of sub-blocks where each sub-block is associated with respective prediction mode data of the base layer prediction mode information, the respective prediction mode data including non-pixel information; and means for upsampling pixel data of a picture of the base layer to form an inter-layer picture comprising a plurality of blocks, the inter-layer picture having the second spatial resolution associated with the enhancement layer and being usable for inter-prediction of one or more pictures of the enhancement layer; means for identifying, for every respective block of the inter-layer picture, a single sub-block of the base layer block corresponding to the respective block of the inter-layer picture, the identified sub-block neighbored on each side by a respective sub-block of the plurality of sub-blocks of the base layer block; means for selecting, for every respective block of the inter-layer picture, respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; means for upsampling, for every respective block of the inter-layer picture, the selected respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; means for associating, for every respective block of the inter-layer picture, the upsampled prediction mode data with the respective block of the inter-layer picture; means for adding the inter-layer picture to a reference picture list, pictures within the reference picture list being usable for inter-prediction of the enhancement layer having the second spatial second resolution; means for inter-predicting a first block of a first picture of the one or more pictures of the enhancement layer from the inter-layer picture to form a predicted block for the first block, wherein the means for inter-predicting the first block comprises means for coding a reference index of the first block, the reference index identifying the inter-layer picture in the reference picture list; and means for predicting prediction mode data of a second block of a second picture of the one or more pictures of the enhancement layer using the upsampled prediction mode data of the inter-layer picture of the reference picture list. - View Dependent Claims (16, 17, 18)
-
-
19. A non-transitory computer storage that stores executable program instructions that, when executed, direct an apparatus for coding video data to perform a process that comprises:
-
storing video data associated with a base layer having a first spatial resolution and an enhancement layer having a second spatial resolution greater than the first spatial resolution in a memory unit, the video data associated with the base layer including at least a base layer block and base layer prediction mode information associated with the base layer block, the base layer block including a plurality of sub-blocks where each sub-block is associated with respective prediction mode data of the base layer prediction mode information, the respective prediction mode data including non-pixel information; upsampling, by a processor in communication with the memory unit, pixel data of a picture of the base layer to form an inter-layer picture comprising a plurality of blocks, the inter-layer picture being usable for inter-prediction of one or more pictures of the enhancement layer; for every respective block of the inter-layer picture; identifying a single sub-block of the base layer block corresponding to the respective block of the inter-layer picture, the identified sub-block neighbored on each side by a respective sub-block of the plurality of sub-blocks of the base layer block; selecting respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; upsampling the selected respective prediction mode data associated with the identified sub-block of the plurality of sub-blocks of the base layer block; and associating the upsampled prediction mode data with the respective block of the inter-layer picture; adding the inter-layer picture to a reference picture list, pictures within the reference picture list being usable for inter-prediction of the enhancement layer having the second spatial second resolution; inter-predicting a first block of a first picture of the one or more pictures of the enhancement layer from the inter-layer picture to form a predicted block for the first block, wherein inter-predicting the first block comprises coding a reference index of the first block, the reference index identifying the inter-layer picture in the reference picture list; and predicting prediction mode data of a second block of a second picture of the one or more pictures of the enhancement layer using the upsampled prediction mode data of the inter-layer picture of reference picture list. - View Dependent Claims (20, 21)
-
Specification