Page decomposition using local orthogonal transforms and a map optimization
First Claim
Patent Images
1. A computer-implemented method of page decomposition comprising the steps of:
- segmenting an image into a plurality of blocks according to an amount of information contained in each block, wherein the segmenting comprises;
dividing, iteratively, the plurality of blocks into a plurality of child blocks, wherein each of the plurality of blocks has a set of corresponding child blocks;
determining an entropy for each of the plurality of blocks and the set of corresponding child blocks; and
performing a further iteration of the dividing on at least one of the plurality of blocks upon determining the entropy of the at least one of the plurality of blocks is greater than the entropy of the set of corresponding child blocks;
classifying each block as one of a text, an image and a graphics block according to a corresponding frequency spectrum and a corresponding value of a neighborhood; and
refining a classification of each block by iterating the classifying of each block.
3 Assignments
0 Petitions
Accused Products
Abstract
Document processing and understanding can be important for a variety of applications such as office automation, creation of electronic manuals, online documentation and annotation, etc. The first step towards this process can involve the decomposition of a page into separate units describing the text, graphics, pictures etc. Towards that end, a method of page decomposition comprises segmenting an image into a plurality of blocks according to an amount of information contained in each block, classifying each block according to a corresponding frequency spectrum, and refining a classification of each block.
-
Citations
16 Claims
-
1. A computer-implemented method of page decomposition comprising the steps of:
-
segmenting an image into a plurality of blocks according to an amount of information contained in each block, wherein the segmenting comprises; dividing, iteratively, the plurality of blocks into a plurality of child blocks, wherein each of the plurality of blocks has a set of corresponding child blocks; determining an entropy for each of the plurality of blocks and the set of corresponding child blocks; and performing a further iteration of the dividing on at least one of the plurality of blocks upon determining the entropy of the at least one of the plurality of blocks is greater than the entropy of the set of corresponding child blocks; classifying each block as one of a text, an image and a graphics block according to a corresponding frequency spectrum and a corresponding value of a neighborhood; and refining a classification of each block by iterating the classifying of each block. - View Dependent Claims (2, 3, 4, 5, 6, 14, 15, 16)
-
-
7. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for page decomposition, the method steps comprising:
-
segmenting an image into a plurality of blocks according to an amount of information contained in each block, wherein the segmenting comprises; dividing, iteratively, the plurality of blocks into a plurality of child blocks, wherein each of the plurality of blocks has a set of corresponding child blocks; determining an entropy for each of the plurality of blocks and the set of corresponding child blocks; and performing a father iteration of the dividing on at least one of the plurality of blocks upon determined the entropy of the at least one of the plurality of blocks is greater than the entropy of the set of corresponding child blocks; classifying each block as one of a text, an image and a graphics block according to a corresponding frequency spectrum; and refining a classification of each block. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system adapted to decompose a page comprising:
-
a segmenting module for excepting an image, and dividing the image into a plurality of blocks according to an entropy of each block, wherein the segmenting module divides, iteratively, the plurality of blocks into a plurality of child blocks, wherein each of the plurality of blocks has a set of corresponding child blocks, determines an entropy for each of the plurality of blocks and the set of corresponding child blocks, and performs a further iteration of the dividing on at least one of the plurality of blocks upon determining the entropy of the at least one of the plurality of blocks is greater than the entropy of the set of corresponding child blocks; and a classifying module coupled to the segmenting module for excepting the plurality of blocks, wherein each block is classified as one of a text, an image and a graphics block according to a concentration component, a variance component of the frequency spectrum, and a neighborhood value, and wherein the classifying module iterates through the plurality of blocks and determines a percentage of classification changes of the plurality of blocks from a prior iteration to a threshold, and upon determining the percentage to be at or below the threshold outputting a resulting classified image comprising a plurality of classified blocks.
-
Specification