COMPRESSION OF DIGITAL IMAGES OF SCANNED DOCUMENTS
First Claim
1. A method for creating a binary mask image from an inputted digital image of a scanned document, comprising the steps of:
- a) creating a binarized image by binarizing said inputted digital image,b) detecting in said binarized image first text regions representing light text on a dark background in said inputted digital image,c) inverting said first text regions in said binarized image, such that a transformed binary image is formed in which the inverted first text regions are interpretable in the same way as dark text on a light background,wherein the steps of detecting and inverting the first text regions are performed on the basis of pixel blobs contained in the binarized image without interpreting the represented text.
0 Assignments
0 Petitions
Accused Products
Abstract
A first aspect of the invention relates to a method for creating a binary mask image from an a inputted digital image of a scanned document, comprising the steps of creating a binarized image by binarizing the inputted digital image, detecting first text regions representing light text on a dark background, and inverting the first text regions, such that the inverted first text regions are interpretable in the same way as dark text on a light background. A second aspect of the invention relates to a method for comparing in a binary image a first pixel blob with a second pixel blob to determine whether they represent matching symbols, comprising the steps of detecting a line in one blob not present in the other and/or determining if one of the blobs represents an italicized symbol where the other does not.
17 Citations
14 Claims
-
1. A method for creating a binary mask image from an inputted digital image of a scanned document, comprising the steps of:
-
a) creating a binarized image by binarizing said inputted digital image, b) detecting in said binarized image first text regions representing light text on a dark background in said inputted digital image, c) inverting said first text regions in said binarized image, such that a transformed binary image is formed in which the inverted first text regions are interpretable in the same way as dark text on a light background, wherein the steps of detecting and inverting the first text regions are performed on the basis of pixel blobs contained in the binarized image without interpreting the represented text. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for text recognition, comprising the steps of:
-
creating a binary image from an inputted digital image of a scanned document by means of the following steps; a. creating a binarized image by binarizing said inputted digital image; b. detecting in said binarized image first text regions representing light text on a dark background in said inputted digital image; c. detecting in said binarized image second text regions representing dark text on a light background in said inputted digital image; and d. inverting said first text regions in said binarized image, such that a transformed binary image is formed in which the inverted first text regions are interpretable in the same way as the second text regions; wherein the steps of detecting and inverting the first text regions are performed on the basis of pixel blobs contained in the binarized image without interpreting the represented text; and applying a text recognition technique in which the inverted first text regions are recognized along with the second text regions.
-
-
9. A compression method for compressing an inputted digital image of a scanned document, said compression method comprising the steps of:
-
a) segmenting said inputted digital image into multiple image layers comprising a foreground image containing color information for foreground elements of said document, a background image containing color information for background elements of said document and a binary mask image for selecting between pixels in said foreground image and said background image upon decompressing said compressed digital image, and b) compressing each of the image layers by means of a suitable compression technique, thereby obtaining a compressed digital image, wherein creating the binary mask image involves the steps of; creating a binarized image by binarizing said inputted digital image, detecting in said binarized image first text regions representing light text on a dark background in said inputted digital image, inverting said first text regions in said binarized image, such that a transformed binary image is formed in which the inverted first text regions are interpretable in the same way as dark text on a light background, wherein the steps of detecting and inverting the first text regions are performed on the basis of pixel blobs contained in the binarized image without interpreting the represented text. - View Dependent Claims (10, 11, 12)
-
-
13. A computer program product directly loadable into a memory of a computer, comprising software code portions for performing the following steps when said product is run on a computer:
-
a) segmenting an inputted digital image of a scanned document into multiple image layers comprising a foreground image containing color information for foreground elements of said document, a background image containing color information for background elements of said document and a binary mask image for selecting between pixels in said foreground image and said background image upon decompressing said compressed digital image, and b) compressing each of the image layers by means of a suitable compression technique, thereby obtaining a compressed digital image, wherein creating the binary mask image involves the steps of; creating a binarized image by binarizing said inputted digital image, detecting in said binarized image first text regions representing light text on a dark background in said inputted digital image, inverting said first text regions in said binarized image, such that a transformed binary image is formed in which the inverted first text regions are interpretable in the same way as dark text on a light background, wherein the steps of detecting and inverting the first text regions are performed on the basis of pixel blobs contained in the binarized image without interpreting the represented text. - View Dependent Claims (14)
-
Specification