Prediction apparatus and method for improving coding efficiency in scalable video coding
First Claim
1. A prediction method for improving coding efficiency by reducing temporal redundancy in an enhancement error signal of a multiple layer video encoding system, the prediction method comprising:
- dividing an input video sequence into a plurality of video object layers each consisting of a base layer and several enhancement layers, and each of the video object layers are further made up of a series of video object planes (VOP) comprising a two dimensional array of sampled pixel data at a particular time reference;
encoding a VOP of the base layer to obtain a compressed bitstream and a corresponding locally decoded VOP of the base layer;
obtaining a pixel classification criteria from information in current and previous locally decoded VOP of the base layer;
constructing a merged VOP, from a previous locally decoded VOP of an enhancement layer, the previous locally decoded VOP of the base layer, the current locally decoded VOP of the base layer and the pixel classification criteria;
predicting an enhancement layer VOP from;
the merged VOP, the previous VOP in the same layer, and the VOP from the base layer with the same time reference;
entropy coding information for prediction modes as header information for the decoder;
coding prediction errors of the enhancement layer VOP which was predicted according to one of several prediction modes based on a mode decision, together with motion vector information, and transmitting them in a compressed bitstream to a decoder; and
repeating the above prediction steps on the enhancement layers treating a lower of two enhancement layers as the base layer.
1 Assignment
0 Petitions
Accused Products
Abstract
A prediction method, merged method, has been introduced in the enhancement layer of a multiple layer video coding. This merged method was designed to efficiently handle the prediction of the non moving parts in coding of the enhancement layer VOP or frame. All the information for this merged mode prediction is obtained from the base layer, and no additional side information is transmitted. This prediction mode when used together with combination of the existing forward mode, backward mode, and interpolated mode, can improve the coding efficiency for enhancement layer video coding, especially in low bit rate coding. The method can be used in most multiple layer video coding schemes, especially in spatial scalability video coding.
-
Citations
36 Claims
-
1. A prediction method for improving coding efficiency by reducing temporal redundancy in an enhancement error signal of a multiple layer video encoding system, the prediction method comprising:
-
dividing an input video sequence into a plurality of video object layers each consisting of a base layer and several enhancement layers, and each of the video object layers are further made up of a series of video object planes (VOP) comprising a two dimensional array of sampled pixel data at a particular time reference; encoding a VOP of the base layer to obtain a compressed bitstream and a corresponding locally decoded VOP of the base layer; obtaining a pixel classification criteria from information in current and previous locally decoded VOP of the base layer; constructing a merged VOP, from a previous locally decoded VOP of an enhancement layer, the previous locally decoded VOP of the base layer, the current locally decoded VOP of the base layer and the pixel classification criteria; predicting an enhancement layer VOP from;
the merged VOP, the previous VOP in the same layer, and the VOP from the base layer with the same time reference;entropy coding information for prediction modes as header information for the decoder; coding prediction errors of the enhancement layer VOP which was predicted according to one of several prediction modes based on a mode decision, together with motion vector information, and transmitting them in a compressed bitstream to a decoder; and repeating the above prediction steps on the enhancement layers treating a lower of two enhancement layers as the base layer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 26, 31, 32, 33, 34, 35)
-
-
22. A prediction method for improving coding efficiency by reducing temporal redundancy in an enhancement error signal of a multiple layer video decoding system, the method comprising:
-
decoding a VOP of a base layer from a compressed bitstream to obtain a corresponding decoded VOP of the base layer; obtaining a pixel classification criteria from information in current and previous decoded VOP of the base layer; constructing a merged VOP, from a previous decoded VOP of an enhancement layer, the previous decoded VOP of the base layer, the current decoded VOP of the base layer and the pixel classification criteria; decoding information for prediction modes from header information transmitted by the encoder; decoding prediction errors of an enhancement layer VOP which was predicted according to a transmitted prediction mode, together with motion vector information received in the compressed bitstream; reconstructing the enhancement layer VOP from the merged VOP, the previous VOP in the same layer, and the VOP from the base layer with a same time reference; and repeating the above reconstruction steps on the enhancement layer treating a lower of two enhancement layers as the base layer. - View Dependent Claims (23, 24, 25, 27, 28, 29, 30)
-
-
36. A prediction apparatus for improving coding efficiency by reducing the temporal redundancy in a enhancement error signal of a multiple layer video encoding system, the apparatus comprising:
-
a divider which divides an input video sequence into a plurality of video object layers consisting of a base layer and several enhancement layers, and each video object layer further comprises a series of video object planes (VOP) comprising a two dimensional array of sampled pixel data at a particular time reference; an encoder which encodes a VOP of the base layer to obtain a compressed bitstream and a corresponding locally decoded VOP of the base layer, an obtaining part which obtains a pixel classification criteria from information in the current and previous locally decoded VOP of the base layer; a constructing part which constructs a merged VOP, from a previous locally decoded VOP of an enhancement layer, the previous locally decoded VOP of the base layer, the current locally decoded VOP of the base layer and the pixel classification criteria; a prediction part which predicts an enhancement layer VOP from;
the merged VOP, the previous VOP in the same layer, and the VOP from the base layer with the same time reference;a coder which entropy codes information for prediction modes as header information for the decoder; and a coder which codes prediction errors of the enhancement layer VOP which was predicted according to one of several prediction modes based on a mode decision, together with motion vector information, and transmitting them in a compressed bitstream to a decoder; wherein the prediction apparatus further operates by treating a lower of two enhancement layers as the base layer.
-
Specification