OBJECT EXTRACTION IN COLOUR COMPOUND DOCUMENTS
First Claim
1. A computer implemented method of identifying at least one of a photo region and a graphics region in a colour bitmap image of a compound document, said method comprising:
- (a) connecting similarly coloured pixels of said image into connected components and placing the connected components in an enclosure tree, the enclosure tree forming a hierarchy of ancestor and descendent connected components where an ancestor connected component encloses a descendent connected component wherein each node of the enclosure tree is a connected component;
(b) classifying each connected component into one of a plurality of classes wherein at least one class represents non-text connected components;
(c) selecting a connected component from one of the non-text connected components of the enclosure tree;
(d) calculating context statistics for the selected connected component based on descendent and touching sibling connected components of the selected connected component;
(e) identifying whether the selected connected component contains at least one of a photo region and a graphics region based on the context statistics; and
(f) storing the selected connected component identified as either graphics or photo object to memory.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is a computer implemented method of text extraction in colour compound documents. The method connects similarly coloured pixels of an image of a colour compound document into connected components (CCs); classifies each CC as either text or non-text; refines the text CC classification for each text CC using global colour context statistics; groups text CCs into text blocks; recovers misclassified non-text CCs into a nearby text block; and removes extraneous CCs from each text block using local colour context statistics to thereby provide the extracted text in the text blocks. Also disclosed is a computer implemented method of locating graphics objects in a colour compound document image. The method connects similarly coloured pixels of said image into connected components (CCs) and placing the CCs in an enclosure tree; classifies (330,730) each CC into one of a plurality of classes wherein at least one class (862) represents salient graphics components; identifies (1140) a graphics container (441) to perform semantic analysis for each CC of said class representing salient graphics components; profiles (1170) descendents of said graphics container in said tree to obtain semantic context statistics; and decides (1710) whether the graphics container contains a whole or part of a graphics object based on said semantic context statistics.
-
Citations
12 Claims
-
1. A computer implemented method of identifying at least one of a photo region and a graphics region in a colour bitmap image of a compound document, said method comprising:
-
(a) connecting similarly coloured pixels of said image into connected components and placing the connected components in an enclosure tree, the enclosure tree forming a hierarchy of ancestor and descendent connected components where an ancestor connected component encloses a descendent connected component wherein each node of the enclosure tree is a connected component; (b) classifying each connected component into one of a plurality of classes wherein at least one class represents non-text connected components; (c) selecting a connected component from one of the non-text connected components of the enclosure tree; (d) calculating context statistics for the selected connected component based on descendent and touching sibling connected components of the selected connected component; (e) identifying whether the selected connected component contains at least one of a photo region and a graphics region based on the context statistics; and (f) storing the selected connected component identified as either graphics or photo object to memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer readable storage medium having a computer program recorded thereon, the program being executable by computer apparatus to identify at least one of a photo region and a graphics region in a colour bitmap image of a compound document, said program comprising:
-
code for connecting similarly coloured pixels of said image into connected components and placing the connected components in an enclosure tree, the enclosure tree forming a hierarchy of ancestor and descendent connected components where an ancestor connected component encloses a descendent connected component wherein each node of the enclosure tree is a connected component; code for classifying each connected component into one of a plurality of classes wherein at least one class represents non-text connected components; code for selecting a connected component from one of the non-text connected components of the enclosure tree; code for calculating context statistics for the selected connected component based on descendent and touching sibling connected components of the selected connected component; code for identifying whether the selected connected component contains at least one of a photo region and a graphics region based on the context statistics; and code for storing the selected connected component identified as either graphics or photo object to memory.
-
-
11. Computer apparatus, said apparatus comprising a processor and a memory coupled to the processor, the memory having recorded thereon a program for identifying at least one of a photo region and a graphics region in a colour bitmap image of a compound document, the image being stored in the memory, the program comprising:
-
code means executable by the processor for connecting similarly coloured pixels of said image into connected components and placing the connected components in an enclosure tree, the enclosure tree forming a hierarchy of ancestor and descendent connected components where an ancestor connected component encloses a descendent connected component wherein each node of the enclosure tree is a connected component; code means executable by the processor for classifying each connected component into one of a plurality of classes wherein at least one class represents non-text connected components; code means executable by the processor for selecting a connected component from one of the non-text connected components of the enclosure tree; code means executable by the processor for calculating context statistics for the selected connected component based on descendent and touching sibling connected components of the selected connected component; code means executable by the processor for identifying whether the selected connected component contains at least one of a photo region and a graphics region based on the context statistics; and code means executable by the processor for storing in the memory the selected connected component identified as either graphics or photo object.
-
-
12. A copier comprising:
-
a scanner for scanning a compound document to form a colour bitmap image; a processor coupled to the scanner; a memory coupled to the processor for storing the colour bitmap image, said memory having recorded thereon a program executable by the processor for identifying at least one of a photo region and a graphics region in the colour bitmap image of the compound document, the program comprising; code means for connecting similarly coloured pixels of said image into connected components and placing the connected components in an enclosure tree, the enclosure tree forming a hierarchy of ancestor and descendent connected components where an ancestor connected component encloses a descendent connected component wherein each node of the enclosure tree is a connected component; code means for classifying each connected component into one of a plurality of classes wherein at least one class represents non-text connected components; code means for selecting a connected component from one of the non-text connected components of the enclosure tree; code means for calculating context statistics for the selected connected component based on descendent and touching sibling connected components of the selected connected component; code means for identifying whether the selected connected component contains at least one of a photo region and a graphics region based on the context statistics; and code means for storing in the memory the selected connected component identified as either graphics or photo object.
-
Specification