Data processing system and method for field extraction of scanned images of document forms
First Claim
Patent Images
1. In a data processing system, a method for field extraction of scanned images of document forms, comprising the steps of:
- creating a form definition for each form, which includes a form definition data set;
marking each field which is to be subjected to field extraction, on a master form;
defining a field viewpoint structure for each field, including field viewpoint primitive shapes including line ends, box ends, crossed lines, and blobs;
breaking the area within and surrounding a marked field into regions, an inner region enclosed by the marked field and eight outer regions which surround the marked field;
identifying types and locations of viewpoint primitives in each region;
linking the defined primitive into a view relation;
defining a field viewpoint structure as a set of all the viewpoint primitives and the view relation for a field;
storing the coordinates of the field mark and the field viewpoint structure for each field;
defining a collection of coordinates and field viewpoint structures for all fields on a form as a form definition data set for the form;
storing the form definition data set in the system until a particular filled in copy of the form is scanned into the system.
1 Assignment
0 Petitions
Accused Products
Abstract
An improved data processing system and method are disclosed for field extraction of scanned images of document forms. The process makes use of a viewpoint structure which characterizes the interior and surrounding regions of a field from which an image is to be extracted for character recognition.
-
Citations
2 Claims
-
1. In a data processing system, a method for field extraction of scanned images of document forms, comprising the steps of:
-
creating a form definition for each form, which includes a form definition data set; marking each field which is to be subjected to field extraction, on a master form; defining a field viewpoint structure for each field, including field viewpoint primitive shapes including line ends, box ends, crossed lines, and blobs; breaking the area within and surrounding a marked field into regions, an inner region enclosed by the marked field and eight outer regions which surround the marked field; identifying types and locations of viewpoint primitives in each region; linking the defined primitive into a view relation; defining a field viewpoint structure as a set of all the viewpoint primitives and the view relation for a field; storing the coordinates of the field mark and the field viewpoint structure for each field; defining a collection of coordinates and field viewpoint structures for all fields on a form as a form definition data set for the form; storing the form definition data set in the system until a particular filled in copy of the form is scanned into the system. - View Dependent Claims (2)
-
Specification