×

Encoding method for the compression of a video sequence

  • US 6,519,284 B1
  • Filed: 07/14/2000
  • Issued: 02/11/2003
  • Est. Priority Date: 07/20/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. An encoding method for the compression of a video sequence divided in groups of frames decomposed by means of a tridimensional (3D) wavelet transform leading to a given number of successive resolution levels, said method being based on a hierarchical subband encoding process called “

  • set partitioning in hierarchical trees”

    (SPIHT) and leading from the original set of picture elements (pixels) of each group of frames to transform coefficients encoded with a binary format and constituting a hierarchical pyramid, said coefficients being ordered by means of magnitude tests involving the pixels represented by three ordered lists called list of insignificant sets (LIS), list of insignificant pixels (LIP) and list of significant pixels (LSP), said tests being carried out in order to divide said original set of picture elements into partitioning subsets according to a division process that continues until each significant coefficient is encoded within said binary representation, and a spatio-temporal orientation tree—

    in which the roots are formed with the pixels of the approximation subband resulting from the 3D wavelet transform and the offspring of each of these pixels is formed with the pixels of the higher subbands corresponding to the image volume defined by these root pixels—

    defining the spatio-temporal relationship inside said hierarchical pyramid, said method, applied to the band-pass subbands of the spatio-temporal tree, being further characterized in that;

    (A) a vectorial differential pulse code modulation (DPCM) is used to separately encode the lowest frequency spatio-temporal subband, or approximation subband, according to the following conditions;

    (a) a spatio-temporal predictor, using not only values at the same location in past frames of the video sequence but also neighbouring values in the current frame, is constructed for each vector of coefficients having components in each frame of the approximation subband, said vectorial coding feature coming from the fact that the lowest frequency subband contains spatial low frequency subbands from at least two frames;

    (b) said DPCM uses constant prediction coefficients;

    (B) the quantification of the prediction error is carried out by means of a scalar quantization of the two vector components, followed by an assignment of a unique binary code associated to the probability computed for each given couple of quantized values;

    (C) the binary stream resulting from the steps (A) and (B) is encoded by a lossless process minimizing the entropy of the whole message.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×