POST-OCR IMAGE SEGMENTATION INTO SPATIALLY SEPARATED TEXT ZONES
First Claim
1. A computer based method of processing text on a document comprising:
- receiving an electronic image of a document with text;
processing the electronic image to obtain words and word positions for the text on the document;
generating word bounding boxes around each word based;
dilating the word bounding boxes by a dilation factor; and
grouping together the words that have intersecting word bounding boxes intersect.
4 Assignments
0 Petitions
Accused Products
Abstract
This invention describes a post-recognition procedure to group text recognized by an Optical Character Reader (OCR) from a document image into zones. Once the recognized text and the corresponding word bounding boxes for each word of the text are received, the procedure described dilates (expands) these word bounding boxes by a factor and records those which cross. Two word bounding boxes will cross upon dilation if the corresponding words are very close to each other on the original document. The text is then grouped into zones using the rule that two words will belong to the same zone if their word bounding boxes cross upon dilation. The text zones thus identified are sorted and returned.
60 Citations
16 Claims
-
1. A computer based method of processing text on a document comprising:
-
receiving an electronic image of a document with text;
processing the electronic image to obtain words and word positions for the text on the document;
generating word bounding boxes around each word based;
dilating the word bounding boxes by a dilation factor; and
grouping together the words that have intersecting word bounding boxes intersect. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system of processing text on a document comprising:
-
a scanning device for creating a electronic image of the document;
a computing device in communication with the scanning device; and
software execution on the scanning device or the computing device for performing the following steps;
processing the electronic image to obtain words and position of word edges for the text on the document;
generating word bounding boxes around each word based on the word edges;
dilating the word bounding boxes by a dilation factor; and
grouping together the words that have intersecting word bounding boxes intersect. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
Specification