Method for visual optimisation of embedded block codes to exploit visual masking phenomena
First Claim
1. A method of compressing a digital image to produce a scalable bit stream having a plurality of quality layers, the method including the steps of:
- a) decomposing the image into a set of distinct frequency bands using a space frequency transform;
b) partitioning the samples in each frequency band into code blocks;
c) for each code-block, generating an embedded bit-stream to represent the contents of the respective code block;
d) determining a rate-distortion optimal set of truncation points, nil for each code-block, Bi, and each quality layer, l, subject to a constraint on the overall bit-rate or distortion for the layer in a manner which is sensitive to the masking property of the Human Visual System (HVS); and
e) storing the embedded bit-streams for each code-block.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for lossy compression of images reduces visual distortion for a given compressed bit-rate or, equivalently, requires a lower bit-rate for a given level of visual distortion. An image is decomposed using a space-frequency transform and frequency bands are then partitioned into small blocks. The blocks are independently quantized and coded using an embedded block coder, so that each block bit-stream contains a large number of finely spaced truncation points. A visual distortion measure is computed for each block at each truncation point, where the metric is sensitive to masking properties of the Human Visual System. The distortion values and bit-stream lengths corresponding to each block'"'"'s truncation point are used to optimise overall visual distortion at one or more target bit-rates or to minimise the bit-rate corresponding to one or more target visual distortion levels. A computationally and memory efficient procedure is described for computing the visual distortion measure for each block'"'"'s truncation point, within each frequency band, as required by the subject compression system.
90 Citations
29 Claims
-
1. A method of compressing a digital image to produce a scalable bit stream having a plurality of quality layers, the method including the steps of:
-
a) decomposing the image into a set of distinct frequency bands using a space frequency transform;
b) partitioning the samples in each frequency band into code blocks;
c) for each code-block, generating an embedded bit-stream to represent the contents of the respective code block;
d) determining a rate-distortion optimal set of truncation points, nil for each code-block, Bi, and each quality layer, l, subject to a constraint on the overall bit-rate or distortion for the layer in a manner which is sensitive to the masking property of the Human Visual System (HVS); and
e) storing the embedded bit-streams for each code-block. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
unpacking the layered compressed bit-stream to recover the truncated embedded bit-streams corresponding to each code-block;
decoding and assembling the code-blocks into a set of frequency bands; and
synthesising a reconstructed image from the frequency bands through the inverse transform.
-
-
28. The method as claimed in claim 27, wherein the blocks are decoded on demand, as the relevant frequency band samples are requested by the inverse transform.
-
29. The method as claimed in claim 27, wherein the synthesis operation proceeds incrementally, requesting frequency samples and using them to synthesise new image samples, as those image samples are requested by the application.
Specification