Extracting user data from a scanned image of a pre-printed form
First Claim
1. A method for extracting data pixels from a composite image, the composite image consisting of digital data defining a rectilinear array of pixels representative of a pre-printed form having data entered thereon, the method comprising the steps of:
- identifying all shared pixels in the composite image;
defining as non-shared data pixels all (black) data pixels that are not common to the pre-printed form;
attaching to the non-shared data pixels all shared pixels that are non-diagonally adjacent to at least one non-shared data pixel; and
removing from the composite image all pixels that are part of the pre-printed form except the attached shared pixels, thereby forming a data image having at least a useful approximation of the data pixels.
2 Assignments
0 Petitions
Accused Products
Abstract
Human and machine readability of pre-printed forms that have been completed with user data is impeded where the user data overlaps zone descriptions, constraint boxes or other markings of the pre-printed form. A “form fracturing” methodology is described that includes processing the composite-image data so as to attach one or more shared pixels to a non-diagonally adjacent data pixel. The remaining form pixels can be removed, resulting in at least a useful approximation and often a complete recovery of the user data. Where blank-form data is not available, a “virtual dropout” technique allows for recovering user data from a pre-printed form using limited speckle size and configurations, constraining gray-scale value, or a combination of the two. The disclosed methodologies are conveniently implemented in software on any digital processor.
31 Citations
13 Claims
-
1. A method for extracting data pixels from a composite image, the composite image consisting of digital data defining a rectilinear array of pixels representative of a pre-printed form having data entered thereon, the method comprising the steps of:
-
identifying all shared pixels in the composite image;
defining as non-shared data pixels all (black) data pixels that are not common to the pre-printed form;
attaching to the non-shared data pixels all shared pixels that are non-diagonally adjacent to at least one non-shared data pixel; and
removing from the composite image all pixels that are part of the pre-printed form except the attached shared pixels, thereby forming a data image having at least a useful approximation of the data pixels. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method for extracting user data from a composite image, the composite image consisting of digital data defining a matrix of pixels responsive to user data entered on a pre-printed form, or a portion thereof, the method comprising the steps of:
-
receiving a first matrix F of form pixel data defining the pre-printed form;
receiving a second matrix X of composite image pixel data defining the composite image;
determining an intersection of the F and X matrices to form a third matrix S defining shared pixels;
subtracting the third matrix S from the second matrix X to form a fourth matrix Y defining non-shared user data pixels;
comparing the shared pixels S to the non-shared user data pixels Y to identify all shared pixels that are non-diagonally adjacent to at least one non-shared user data pixel, to form a fifth matrix Z defining attached pixels; and
thensubtracting the pre-printed form pixels F, except for the attached pixels Z, from the composite image pixels X to recover at least a useful approximation of the user data pixels. - View Dependent Claims (12, 13)
-
Specification