SEPARABLE DIRECTIONAL TRANSFORMS

US 20140112387A1
Filed: 12/27/2013
Published: 04/24/2014
Est. Priority Date: 06/15/2007
Status: Active Grant

First Claim

Patent Images

1. A method of decoding video data, the method comprising:

decoding, from an encoded video bitstream, data associated with a predicted video block having a partition size, wherein the data comprises a prediction mode for predicting pixel values of the predicted video block and a plurality of transform coefficients indicative of a residual associated with the predicted video block, wherein the prediction mode comprises one of a plurality of prediction modes for predicting pixel values in a specified direction;

selecting, based on the prediction mode, one or more separable transforms from a plurality of separable transforms for the partition size,wherein for a first one of the prediction modes having a first prediction direction and a second one of the prediction modes having a second, different, prediction direction, different transforms are selected based on the prediction mode, and wherein, for the first one of the prediction modes, selecting the one or more transforms comprises selecting a combination of a separable DCT transform and at least one other separable transform;

applying the selected transforms to the plurality of transform coefficients to generate a block of residual values;

generating predicted pixel values of the predicted video block based on the prediction mode; and

generating the predicted video block based on the predicted pixel values and the generated block of residual values.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

This disclosure describes techniques for transforming residual blocks of video data. In particular, a plurality of different transforms selectively applied to the residual blocks based on the prediction mode of the video blocks. At least a portion of the plurality of transforms are separable directional transform specifically trained for a corresponding prediction mode to provide better energy compaction for the residual blocks of the given prediction mode. Using separable directional transforms offers the benefits of lower computation complexity and storage requirement than use of non-separable directional transforms. Additionally, a scan order used to scan the coefficients of the residual block may be adjusted when applying separable directional transforms. In particular, the scan order may be adjusted based on statistics associated with one or more previously coded blocks to better ensure that non-zero coefficients are grouped near the front of the one-dimensional coefficient vector to improve the effectiveness of entropy coding.

76 Citations

20 Claims

1. A method of decoding video data, the method comprising:
- decoding, from an encoded video bitstream, data associated with a predicted video block having a partition size, wherein the data comprises a prediction mode for predicting pixel values of the predicted video block and a plurality of transform coefficients indicative of a residual associated with the predicted video block, wherein the prediction mode comprises one of a plurality of prediction modes for predicting pixel values in a specified direction;
  
  selecting, based on the prediction mode, one or more separable transforms from a plurality of separable transforms for the partition size,wherein for a first one of the prediction modes having a first prediction direction and a second one of the prediction modes having a second, different, prediction direction, different transforms are selected based on the prediction mode, and wherein, for the first one of the prediction modes, selecting the one or more transforms comprises selecting a combination of a separable DCT transform and at least one other separable transform;
  
  applying the selected transforms to the plurality of transform coefficients to generate a block of residual values;
  
  generating predicted pixel values of the predicted video block based on the prediction mode; and
  
  generating the predicted video block based on the predicted pixel values and the generated block of residual values.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, wherein the first prediction direction is horizontal and the second prediction direction is vertical.
  - 3. The method of claim 1, wherein the separable DCT transform comprises a separable DCT-like integer transform.
  - 4. The method of claim 1, wherein the separable transforms each comprise a column transform matrix of size N×
    - N and a row transform matrix of size N×
      
      N, where N×
      
      N is a dimension of the partition size of the predicted video block.

5. A method of encoding video data, the method comprising:
- generating predicted pixel values of a video block having a partition size based on a prediction mode, wherein the prediction mode comprises one of a plurality of prediction modes for predicting pixel values in a specified direction;
  
  generating a block of residual values based on the video block and the predicted pixel values;
  
  selecting, based on the prediction mode, one or more separable transforms from a plurality of separable transforms for the partition size,wherein for a first one of the prediction modes having a first prediction direction and a second one of the prediction modes having a second, different, prediction direction, different transforms are selected based on the prediction mode, and wherein, for the first one of the prediction modes, selecting the one or more transforms comprises selecting a combination of a separable DCT transform and at least one other separable transform;
  
  applying the selected transforms to the plurality of transform coefficients to generate a plurality of transform coefficients; and
  
  entropy encoding data indicative of the prediction mode and the transform coefficients.
- View Dependent Claims (6, 7, 8)
- - 6. The method of claim 5, wherein the first prediction direction is horizontal and the second prediction direction is vertical.
  - 7. The method of claim 5, wherein the separable DCT transform comprises a separable DCT-like integer transform.
  - 8. The method of claim 5, wherein the separable transforms each comprise a column transform matrix of size N×
    - N and a row transform matrix of size N×
      
      N, where N×
      
      N is a dimension of the partition size of the predicted video block.

9. A device for coding video data, the device comprising:
- a memory configured to store a plurality of separable transforms for use in transforming between residual pixel values of a video block and residual transform coefficients of the video block, each of the plurality of separable transforms being associated with a partition size; and
  
  a processor configured to;
  
  generate predicted pixel values of a video block having a partition size based on a prediction mode, wherein the prediction mode comprises one of a plurality of prediction modes for predicting pixel values in a specified direction;
  
  select, based on the prediction mode, one or more separable transforms from the plurality of separable transforms for the partition size, wherein for a first one of the prediction modes having a first prediction direction and a second one of the prediction modes having a second, different, prediction direction, different transforms are selected based on the prediction mode, and wherein, for the first one of the prediction modes, selecting the one or more transforms comprises selecting a combination of a separable DCT transform and at least one other separable transform; and
  
  apply the selected separable transforms to transform between residual pixel values associated with the predicted pixel values and residual transform coefficients.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The device of claim 9, wherein the first prediction direction is horizontal and the second prediction direction is vertical.
  - 11. The device of claim 9, wherein the separable DCT transform comprises a separable DCT-like integer transform.
  - 12. The device of claim 9, wherein the separable transforms each comprise a column transform matrix of size N×
    - N and a row transform matrix of size N×
      
      N, where N×
      
      N is a dimension of the partition size of the predicted video block.
  - 13. The device of claim 9, wherein the processor comprises a video encoder, wherein the transforms are transforms for use in transforming residual pixel values of the video block to residual transform coefficients, and wherein the processor is further configured to:
    - generate residual values of the video block based on the video block and the predicted pixel values,apply the selected transforms to transform the residual values to the residual transform coefficients; and
      
      encode data indicative of the residual transform coefficients and the prediction mode.
  - 14. The device of claim 9, wherein the processor comprises a video decoder, wherein the transforms are inverse transforms for use in transforming residual transform coefficients to residual pixel values of the video block, and wherein the processor is further configured to:
    - decode data indicative of the prediction mode and residual transform coefficients of the video block;
      
      apply the selected transforms to transform the residual transform coefficients to the residual values of the video block; and
      
      generate a video block based on the residual values of the video block and the predicted video block.
  - 15. The device of claim 13, wherein the device comprises a wireless communication device having a display, the display being configured to display the coded video data.
  - 16. The device of claim 9, wherein the device comprises an integrated circuit device.

17. A non-transitory computer-readable medium upon which is stored instructions that upon execution in a device cause the device to code video blocks, wherein the instructions cause the device to:
- generate predicted pixel values of a video block having a partition size based on a prediction mode, wherein the prediction mode comprises one of a plurality of prediction modes for predicting pixel values in a specified direction;
  
  select, based on the prediction mode, one or more separable transforms from a plurality of separable transforms for use in transforming between residual pixel values of a video block and residual transform coefficients of the video block, each of the plurality of separable transforms being associated with a partition size, wherein for a first one of the prediction modes having a first prediction direction and a second one of the prediction modes having a second, different, prediction direction, different transforms are selected based on the prediction mode, and wherein, for the first one of the prediction modes, selecting the one or more transforms comprises selecting a combination of a separable DCT transform and at least one other separable transform; and
  
  apply the selected separable transforms to transform between residual pixel values associated with the predicted pixel values and residual transform coefficients;

18. A device for coding video data, the device comprising:
- means for storing a plurality of separable transforms for use in transforming between residual pixel values of a video block and residual transform coefficients of the video block, each of the plurality of separable transforms being associated with a partition size; and
  
  means for processing video data configured to;
  
  generate predicted pixel values of a video block having a partition size based on a prediction mode, wherein the prediction mode comprises one of a plurality of prediction modes for predicting pixel values in a specified direction;
  
  select, based on the prediction mode, one or more separable transforms from the plurality of separable transforms for the partition size, wherein for a first one of the prediction modes having a first prediction direction and a second one of the prediction modes having a second, different, prediction direction, different transforms are selected based on the prediction mode, and wherein, for the first one of the prediction modes, selecting the one or more transforms comprises selecting a combination of a separable DCT transform and at least one other separable transform; and
  
  apply the selected separable transforms to transform between residual pixel values associated with the predicted pixel values and residual transform coefficients;
- View Dependent Claims (19, 20)
- - 19. The device of claim 18, wherein the video processing means comprises means for encoding video, wherein the transforms are transforms for use in transforming residual pixel values of the video block to residual transform coefficients, and wherein the video processing means is further configured to:
    - generate residual values of the video block based on the video block and the predicted pixel values,apply the selected transforms to transform the residual values to the residual transform coefficients; and
      
      encode data indicative of the residual transform coefficients and the prediction mode.
  - 20. The device of claim 18, wherein the video processing means comprises means for decoding video data, wherein the transforms are inverse transforms for use in transforming residual transform coefficients to residual pixel values of the video block, and wherein the video processing means is further configured to:
    - decode data indicative of the prediction mode and residual transform coefficients of the video block;
      
      apply the selected transforms to transform the residual transform coefficients to the residual values of the video block; and
      
      generate a video block based on the residual values of the video block and the predicted video block.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Qualcomm, Inc.
Original Assignee
Qualcomm, Inc.
Inventors
Ye, Yan, Karczewicz, Marta

Granted Patent

US 9,578,331 B2
Time in Patent Office

Days
Field of Search
US Class Current

375/240.02
CPC Class Codes

H04N 19/103   Selection of coding mode or...

H04N 19/11   among a plurality of spatia...

H04N 19/12   Selection from among a plur...

H04N 19/122   Selection of transform size...

H04N 19/129   Scanning of coding units, e...

H04N 19/13   Adaptive entropy coding, e....

H04N 19/147   according to rate distortio...

H04N 19/157   Assigned coding mode, i.e. ...

H04N 19/176   the region being a block, e...

H04N 19/18   the unit being a set of tra...

H04N 19/19   using optimisation based on...

H04N 19/196   being specially adapted for...

H04N 19/197   including determination of ...

H04N 19/42   characterised by implementa...

H04N 19/46   Embedding additional inform...

H04N 19/463   by compressing encoding par...

H04N 19/48   using compressed domain pro...

H04N 19/593   involving spatial predictio...

H04N 19/61   in combination with predict...

H04N 19/625   using discrete cosine trans...

H04N 19/70 : characterised by syntax asp...

View All

SEPARABLE DIRECTIONAL TRANSFORMS

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

76 Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

SEPARABLE DIRECTIONAL TRANSFORMS

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

76 Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links