POST-OCR IMAGE SEGMENTATION INTO SPATIALLY SEPARATED TEXT ZONES

US 20070041642A1
Filed: 08/18/2006
Published: 02/22/2007
Est. Priority Date: 08/18/2005
Status: Abandoned Application

First Claim

Patent Images

1. A computer based method of processing text on a document comprising:

receiving an electronic image of a document with text;

processing the electronic image to obtain words and word positions for the text on the document;

generating word bounding boxes around each word based;

dilating the word bounding boxes by a dilation factor; and

grouping together the words that have intersecting word bounding boxes intersect.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

This invention describes a post-recognition procedure to group text recognized by an Optical Character Reader (OCR) from a document image into zones. Once the recognized text and the corresponding word bounding boxes for each word of the text are received, the procedure described dilates (expands) these word bounding boxes by a factor and records those which cross. Two word bounding boxes will cross upon dilation if the corresponding words are very close to each other on the original document. The text is then grouped into zones using the rule that two words will belong to the same zone if their word bounding boxes cross upon dilation. The text zones thus identified are sorted and returned.

60 Citations

View as Search Results

16 Claims

1. A computer based method of processing text on a document comprising:
- receiving an electronic image of a document with text;
  
  processing the electronic image to obtain words and word positions for the text on the document;
  
  generating word bounding boxes around each word based;
  
  dilating the word bounding boxes by a dilation factor; and
  
  grouping together the words that have intersecting word bounding boxes intersect.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1 wherein the step of grouping is accomplished by:
    - creating a vertex for each word bounding box;
      
      connecting with lines the vertices that represent word bounding boxes that overlap; and
      
      grouping together the words that are represented by vertices that are interconnected with lines.
  - 3. The method of claim 1 wherein the word bounding boxes are generated based upon the position word edges.
  - 4. The method of claim 1 wherein the dilation factor is preset or is adjusted during the process of dilation.
  - 5. The method of claim 1 wherein the dilation factor is approximately in the range of 0.1 and 0.3.
  - 6. The method of claim 1 wherein the document is a receipt, business card, invoice, article or web page.
  - 7. The method of claim 1 wherein the image is created by scanning, digital photography or faxing.

8. A computer system of processing text on a document comprising:
- a scanning device for creating a electronic image of the document;
  
  a computing device in communication with the scanning device; and
  
  software execution on the scanning device or the computing device for performing the following steps;
  
  processing the electronic image to obtain words and position of word edges for the text on the document;
  
  generating word bounding boxes around each word based on the word edges;
  
  dilating the word bounding boxes by a dilation factor; and
  
  grouping together the words that have intersecting word bounding boxes intersect.
- View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
- - 9. The computer system of claim 8 wherein the step of grouping is accomplished by:
    - creating a vertex for each word bounding box;
      
      connecting with lines the vertices that represent word bounding boxes that overlap;
      
      and grouping together the words that are represented by vertices that are interconnected with lines.
  - 10. The computer system of claim 8 wherein the word bound boxes are generated based upon the position word edges.
  - 11. The computer system of claim 8 wherein the dilation factor is preset or is adjusted during the process of dilation.
  - 12. The computer system of claim 8 wherein the dilation factor is approximately in the range of 0.1 and 0.3.
  - 13. The computer system of claim 8 wherein the document is a receipt, business card, invoice, article or web page.
  - 14. The computer system of claim 8 wherein the image is created by scanning, digital photography, or faxing.
  - 15. The computer system of claim 8 wherein the scanning device is an optical scanner, fax, or digital camera.
  - 16. The computer system of claim 8 wherein the scanning device is stationary or portable.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Digital Businesses Processess, Inc.
Original Assignee
Digital Businesses Processess, Inc.
Inventors
SINGH, Sarabjit, SPERO, Leslie, ROMANOFF, Harris

Application Number

US11/465,505
Publication Number

US 20070041642A1
Time in Patent Office

Days
Field of Search
US Class Current

382/177
CPC Class Codes

G06V 30/414 Extracting the geometrical ...

POST-OCR IMAGE SEGMENTATION INTO SPATIALLY SEPARATED TEXT ZONES

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

60 Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

POST-OCR IMAGE SEGMENTATION INTO SPATIALLY SEPARATED TEXT ZONES

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

60 Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links