Method for high-quality compression of binary text images
First Claim
1. Method for high-quality compression of binary text images for storage and possible future processing including transmission over data links, involving the scanning thereof on the record carrier on which they are presented in a pixel-by-pixel raster scan and deriving a string of digital data from the scanner output, characterized by the step of determining the degree of compressibility of the individual features of the original image based on the nature of the scan data derived from a specific vicinity of pixels in the original image, and, depending on the frequency of the information content of the data from said vicinity, assigning one of at least two different compression ratios to the data from each vicinity, and compressing said data in accordance with said one compression ratio.
2 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a method for the compression and decompression of binary test images. The method distinguishes between large low-frequency areas and small high-frequency areas in the original frame. For the low-frequency areas, a scheme for lossy compression is used, whereas for the high-frequency areas, a scheme permitting lossless compression is applied. The compression/decompression process involves five stages; namely prefiltering to remove all black patches (e.g. by removing all black pixels, except where they belong to a large black segment), fast evaluation of compressibility by partitioning the images into mutually exclusive segments and applying different compression modes to each segment, connectivity-oriented subsampling to reduce the reslolution in horizontal and vertical directions which cause the image to be segmented into blocks and a 1-pixel representation for each block is determined, lossless compression and decompression where the reduced file is compressed by conventional techniques, and reconstruction by sequence reversal so that lossless decompression will retrieve the subsampled file, expansion of the subsampled file through replacement of each pixel by a block having equal value and postfiltering.
120 Citations
12 Claims
- 1. Method for high-quality compression of binary text images for storage and possible future processing including transmission over data links, involving the scanning thereof on the record carrier on which they are presented in a pixel-by-pixel raster scan and deriving a string of digital data from the scanner output, characterized by the step of determining the degree of compressibility of the individual features of the original image based on the nature of the scan data derived from a specific vicinity of pixels in the original image, and, depending on the frequency of the information content of the data from said vicinity, assigning one of at least two different compression ratios to the data from each vicinity, and compressing said data in accordance with said one compression ratio.
- 8. Method for high-quality compression of binary text images for storage and possible future processing including transmission over data links, involving the scanning thereof on the record carrier on which they are presented in a pixel-by-pixel raster scan and deriving a string of digital data from the scanner output, characterized by the step of determining the degree of compressibility of the individual features of the original image based on the nature of the scan data and depending on the frequency of the information content of the data derived from a specific vicinity of pixels in the original document, said vicinity of pixels are defined by partitioning the entire image area into mutually exclusive segments of uniform size, that for each segment the number (m) of nonzero bytes is counted and each such byte is translated into a binary number which equals 1 if the byte under consideration contains a small black or white interval, that is, that byte does not equal either 00000000 or 11111111 and the first bit of the byte equals the last one, and which equals 0 in all other cases, that the number (n) of pairs of consecutive "1'"'"'s" is counted, and that lossy compression is performed if n is smaller than a predetermined threshold and/or if n is smaller than a predetermined fraction of m, assigning one of at least two different compression ratios to the data from each vicinity, and compressing said data in accordance with said one compression ratio.
-
12. Method for high-quality compression of binary text images for storage and possible future processing including transmission over data links, involving the scanning thereof on the record carrier on which they are presented in a pixel-by-pixel raster scan and deriving a string of digital data from the scanner output, characterized by the step of determining the degree of compressibility of the individual features of the original image based on the nature of the scan data derived from a specific vicinity of pixels in the original image, and depending on the frequency of the information content of the data from said vicinity, assigning one of at least two different compression ratios to the data from each vicinity, and compressing said data in accordance with said one compression ratio, with a compression ratio being determined by a compression operation involving a reconstruction and postfiltering step applicable to vicinities in accordance with an array of the type ##EQU14## which is executed as follows:
- - if C=1 and G=F=H=0 (case of a small curve), then said array should be transformed to the following new array;
##EQU15## - and if C=D=E=F=0 (case of a diagonal line), then said array should be transformed to the following new array;
##EQU16## in order to preserve curves and sharp angles between lines, and to smooth the reconstructions of diagonal lines.
- - if C=1 and G=F=H=0 (case of a small curve), then said array should be transformed to the following new array;
Specification