Motion-compensated prediction of inter-layer residuals
First Claim
1. A method comprising:
- encoding base layer video to produce at least part of a base layer bit stream;
encoding inter-layer residual video to produce at least part of an enhancement layer bit stream, wherein the inter-layer residual video represents differences between the base layer video and input video, and wherein the encoding the inter-layer residual video includes, for a given block of a current picture of the inter-layer residual video;
performing motion compensated prediction of the given block of the current picture of the inter-layer residual video relative to one or more reference pictures of previously reconstructed inter-layer residual video, wherein multiple reference pictures of the previously reconstructed inter-layer residual video are available for use in the motion compensated prediction for the given block of the current picture of the inter-layer residual video;
determining residual values between the given block of the current picture of the inter-layer residual video and the motion compensated prediction for the given block;
performing a frequency transform of the residual values to produce transform coefficients;
quantizing the transform coefficients; and
performing arithmetic coding of the quantized transform coefficients; and
signaling the at least part of the base layer bit stream and the at least part of the enhancement layer bit stream.
3 Assignments
0 Petitions
Accused Products
Abstract
Techniques and tools are described for scalable video encoding and decoding. In some embodiments, an encoding tool encodes base layer video and outputs encoded base layer video in a base layer bit stream. The encoding tool encodes inter-layer residual video (representing differences between input video and reconstructed base layer video) using motion compensation relative to previously reconstructed inter-layer residual video. For the inter-layer residual video, the encoding tool outputs motion information and motion-compensated prediction residuals in an enhancement layer bit stream. A decoding tool receives the base layer bit stream and enhancement layer bit stream, reconstructs base layer video, reconstructs inter-layer residual video, and combines the reconstructed base layer video and reconstructed inter-layer residual video. Using motion compensation for the inter-layer residual video facilitates the use of separate motion vectors and separate codecs for the base layer video and inter-layer residual video.
-
Citations
20 Claims
-
1. A method comprising:
-
encoding base layer video to produce at least part of a base layer bit stream; encoding inter-layer residual video to produce at least part of an enhancement layer bit stream, wherein the inter-layer residual video represents differences between the base layer video and input video, and wherein the encoding the inter-layer residual video includes, for a given block of a current picture of the inter-layer residual video; performing motion compensated prediction of the given block of the current picture of the inter-layer residual video relative to one or more reference pictures of previously reconstructed inter-layer residual video, wherein multiple reference pictures of the previously reconstructed inter-layer residual video are available for use in the motion compensated prediction for the given block of the current picture of the inter-layer residual video; determining residual values between the given block of the current picture of the inter-layer residual video and the motion compensated prediction for the given block; performing a frequency transform of the residual values to produce transform coefficients; quantizing the transform coefficients; and performing arithmetic coding of the quantized transform coefficients; and signaling the at least part of the base layer bit stream and the at least part of the enhancement layer bit stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. One or more computer-readable memory storing instructions for causing a computing device programmed thereby to perform a method comprising:
-
decoding at least part of a base layer bit stream to reconstruct base layer video; decoding at least part of an enhancement layer bit stream to reconstruct inter-layer residual video that represents differences between the base layer video and input video from encoding, including, for a given block of a current picture of the inter-layer residual video; using motion compensation to predict the given block of the current picture of the inter-layer residual video relative to one or more reference pictures of previously reconstructed inter-layer residual video, wherein multiple reference pictures of the previously reconstructed inter-layer residual video are available for use in the motion compensation for the given block of the current picture of the inter-layer residual video; performing arithmetic decoding of quantized transform coefficients for residual values between the given block of the current picture of the inter-layer residual video and the motion compensated prediction for the given block; inverse quantizing the quantized transform coefficients; performing an inverse frequency transform on the transform coefficients to reconstruct the residual values for the given block; and combining the reconstructed residual values for the given block and the motion compensated prediction for the given block; and combining the reconstructed base layer video and the reconstructed inter-layer residual video to reconstruct output video. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system comprising:
-
a base layer decoder for decoding base layer video; an inter-layer prediction residual decoder for decoding inter-layer residual video that represents differences between the base layer video and input video from encoding, wherein the inter-layer prediction residual decoder is configured to; with buffers, store reference pictures of previously reconstructed inter-layer residual video; and with a motion compensator, predict the inter-layer residual video relative to one or more of the reference pictures of the previously reconstructed inter-layer residual video; with an entropy decoder, perform arithmetic decoding of quantized transform coefficients; with an inverse quantizer, inverse quantize the quantized transform coefficients; with an inverse frequency transformer, perform an inverse frequency transform on the transform coefficients to reconstruct residual values; and combine the reconstructed residual values and corresponding motion compensated predictions; and an inverse scaler for scaling samples of the inter-layer residual video from a first sample depth to a second sample depth higher than the first sample depth; wherein the system is further configured to combine the base layer video and the inverse scaled inter-layer residual video to reconstruct output video. - View Dependent Claims (19, 20)
-
Specification