SCALABLE VIDEO CODING WITH FILTERING OF LOWER LAYERS
First Claim
1. A method of predicting motion vectors in a multi-layer video decoding process, comprising:
- determining a size difference between recovered video data obtained solely by a base layer decode process and recovered video data obtained from an enhancement layer decode process;
scaling a base layer pixelblock partition map according to the determined size difference;
predicting a motion vector of an enhancement layer pixelblock according to;
determining which base layer pixelblock(s), when scaled according to the size difference, are co-located with the enhancement layer pixelblock,scaling motion vectors of the co-located base layer pixelblock(s) according to the size difference, andaveraging the scaled motion vectors of the co-located base layer pixelblock(s), wherein the averaging weight contribution of each scaled motion vector according to a degree of overlap between the enhancement layer pixelblock and the respective scaled base layer pixelblock.
1 Assignment
0 Petitions
Accused Products
Abstract
A First Improvement is Described for Prediction of Motion Vectors to be Used in Prediction of video data for enhancement layer data. Arbitrary pixelblock partitioning between base layer data and enhancement layer data raises problems to identify base layer motion vectors to be used as prediction sources for enhancement layer motion vectors. The disclosed method develops enhancement layer motion vectors by scaling a base layer pixelblock partition map according to a size difference between the base layer video image and the enhancement layer video image, then identified scale base layer pixelblocks that are co-located with the enhancement layer pixelblocks for which motion vector prediction is to be performed. Motion vectors from the scaled co-located base layer pixelblocks are averaged, weighted according to a degree of overlap between the base layer pixelblocks and the enhancement layer pixelblock. Another improvement is obtained by filtering recovered base layer image data before being provided to an enhancement layer decoder. When a specified filter requires image data outside a prediction region available from a base layer decoder, the prediction region data may be supplemented with previously-decoded data from an enhancement layer at a border of the prediction region.
163 Citations
20 Claims
-
1. A method of predicting motion vectors in a multi-layer video decoding process, comprising:
-
determining a size difference between recovered video data obtained solely by a base layer decode process and recovered video data obtained from an enhancement layer decode process; scaling a base layer pixelblock partition map according to the determined size difference; predicting a motion vector of an enhancement layer pixelblock according to; determining which base layer pixelblock(s), when scaled according to the size difference, are co-located with the enhancement layer pixelblock, scaling motion vectors of the co-located base layer pixelblock(s) according to the size difference, and averaging the scaled motion vectors of the co-located base layer pixelblock(s), wherein the averaging weight contribution of each scaled motion vector according to a degree of overlap between the enhancement layer pixelblock and the respective scaled base layer pixelblock. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A multi-layer video decoder, comprising:
-
a base layer decoder to generate recovered base layer image data from base layer coded video provided in a channel according to temporal prediction techniques, and an enhancement layer decoder to generate recovered enhancement layer image data from enhancement layer coded video provided in a channel according to temporal prediction techniques, comprising; a partition map that stores information representing pixelblock partitioning of the base layer image data and of the enhancement layer image data and a motion compensation predictor that predicts recovered enhancement layer image data from previously decoded image data according to a motion vectors, a motion vector of at least one enhancement layer pixelblock being predicted according to; determining which base layer pixelblock(s), when scaled according to a size difference between base layer pixelblocks and enhancement layer pixelblocks, are co-located with the enhancement layer pixelblock, scaling motion vectors of the co-located base layer pixelblock(s) according to the size difference, and averaging the scaled motion vectors of the co-located base layer pixelblock(s), wherein the averaging weight contribution of each scaled motion vector according to a degree of overlap between the enhancement layer pixelblock and the respective scaled base layer pixelblock. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A video decoding method comprising:
-
decoding recovered prediction region data from base layer coded video provided in a channel according to temporal prediction techniques, generating composite image data as a merger between the recovered prediction region data with border data taken from previously-decoded recovered enhancement layer data, filtering the composite image data, and generating new recovered enhancement layer image data from the filtered composite image data and from enhancement layer coded video provided in a channel according to temporal prediction techniques. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A video decoder, comprising:
-
a base layer decoder to generate recovered base layer image data from base layer coded video provided in a channel according to temporal prediction techniques; an enhancement layer decoder to generate recovered enhancement layer image data from enhancement layer coded video provided in a channel according to temporal prediction techniques, the enhancement layer decoding having storage for reference frames of recovered enhancement layer image data; a composite image generator having inputs for recovered base layer image data and reference frames of recovered enhancement layer image data, the generator to merge prediction region data from the recovered base layer image data with a border region from the reference frames of recovered enhancement layer image data, the prediction region having been scaled to account for any size difference between the recovered base layer data and the recovered enhancement layer image data, wherein the border region is taken from a spatial area that borders a spatial area occupied by the prediction region; a filter that applies image filtering to the merged data, wherein an output of the filter is input to the enhancement layer decoder as reference image data for temporal prediction. - View Dependent Claims (17, 18, 19, 20)
-
Specification