VIDEO CODING SYSTEM USING TEXTURE ANALYSIS AND SYNTHESIS IN A SCALABLE CODING FRAMEWORK
First Claim
1. A method of efficiently coding a high-resolution video sequence, comprising:
- spatial downsampling of a high-resolution video sequence received as input to obtain downsampled video having a lower spatial resolution video signal;
coding of a base-layer of said downsampled video utilizing conventional video coding;
performing texture analysis and classification from said downsampled video;
determining structural information from said decoded base-layer;
performing texture synthesis on blocks, which are determined to be texture synthesis compatible, in response to said structural information;
coding an enhancement-layer in response to structure analysis of the base-layer for acceptably synthesized blocks and using a traditional scalable coding framework for unacceptably synthesized blocks; and
outputting at least one bit stream containing the coded base-layer and enhancement-layer.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus or method of efficiently coding high resolution video using texture analysis and synthesis techniques in a scalable video coding framework. A high-resolution video signal is spatially downsampled and encoded into a base-layer. Texture and structure information are extracted from the downsampled signal and base-layer for use by a texture synthesizer. The texture synthesizer is structurally and texturally aware utilizing edge information from a base-layer of the synthesizer to improve synthesis. After synthesis, a video quality assessor directs enhancement-layer coding of unacceptably synthesized areas by alternate (non-textural) coding means, such as conventional AVC or MPEG-2 coding. In one mode, the quality assessor iteratively improves synthesis of certain blocks to make them acceptable for enhancement-layer coding. The apparatus or method outputs a bit stream, or bit streams, containing both the coded base-layer and the enhancement-layer.
91 Citations
20 Claims
-
1. A method of efficiently coding a high-resolution video sequence, comprising:
-
spatial downsampling of a high-resolution video sequence received as input to obtain downsampled video having a lower spatial resolution video signal; coding of a base-layer of said downsampled video utilizing conventional video coding; performing texture analysis and classification from said downsampled video; determining structural information from said decoded base-layer; performing texture synthesis on blocks, which are determined to be texture synthesis compatible, in response to said structural information; coding an enhancement-layer in response to structure analysis of the base-layer for acceptably synthesized blocks and using a traditional scalable coding framework for unacceptably synthesized blocks; and outputting at least one bit stream containing the coded base-layer and enhancement-layer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of efficiently coding a high-resolution video sequence, comprising:
-
spatial downsampling of a high-resolution video sequence received as input to obtain downsampled video having a lower spatial resolution video signal; coding of a base-layer of said downsampled video utilizing conventional video coding; performing texture analysis of said downsampled video into texture blocks (T blocks) upon which texture synthesis can be performed, and non-texture blocks (NT blocks) upon which alternate non-textural coding is to be performed; performing texture classification of T blocks into various categories varying from highly rigid to non-rigid so that effective synthesis techniques can be performed dependent upon the classification; obtaining structural information from said base-layer; performing texture synthesis in response to said structural information; performing texture quality assessment of texture blocks to classify visual acceptability of synthesized areas as acceptable, non-acceptable and marginally non-acceptable; wherein determining of acceptably synthesized blocks is performed by quality assessor programming which utilizes iterative processing toward bringing marginally non-acceptable blocks up to an acceptable level of visual quality; coding an enhancement-layer in response to structure analysis of the base-layer for acceptably synthesized blocks and using said alternate non-textural coding for unacceptably synthesized blocks; and outputting at least one bit stream containing the coded base-layer and the enhancement-layer. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An apparatus for efficient coding of a high-resolution video sequence, comprising:
-
a computer configured for processing high-definition video signals; and programming executable on said computer for, spatial downsampling of a high-resolution video sequence input to obtain downsampled video having a lower spatial resolution video signal, coding of a base-layer of said downsampled video into a decoded base layer utilizing non-textural video coding, performing texture analysis and classification, determining structural information from said decoded base-layer, performing texture synthesis on blocks, which are determined to be texture synthesis compatible, in response to said structural information from said decoded base layer, coding an enhancement-layer in response to structural information from the coded base-layer for blocks which were acceptably synthesized, and using a non-textural coding framework for blocks which were not synthesized or for which acceptable synthesized results were not obtained, and outputting at least one bit stream containing the coded base-layer and the enhancement-layer.
-
-
20. A computer-readable media containing a computer program executable on a computer configured for processing high-resolution video and causing the computer to generate scalable coding in response to steps, comprising:
-
spatial downsampling of a high-resolution video sequence received as input to obtain downsampled video having a lower spatial resolution video signal; coding of a base-layer of said downsampled video into a decoded base layer utilizing non-textural video coding; performing texture analysis and classification; determining structural information from said decoded base-layer; performing texture synthesis on blocks, which are determined to be texture synthesis compatible, in response to receipt said structural information obtained from said decoded base layer; coding an enhancement-layer in response to structural information from the base-layer for blocks which were acceptably synthesized, and using a non-textural coding framework for blocks which were not synthesized or for which acceptable synthesized results were not obtained; and outputting at least one bit stream containing the coded base-layer and the enhancement-layer.
-
Specification