Batched character image processing
First Claim
1. For use in a process for recognizing characters on a batch of documents wherein at least a portion of a document is scanned to produce and store scan data signals reflecting the presence of character elements at particular positions on the document;
- the characters represented by said stored scan data signals being initially analyzed by recognition procedures including that of processing the stored character scan data signals by a recognition algorithm to establish initial identifications of the characters;
the method of verifying the identity of characters initially identified by said procedures comprising the following steps;
(1) storing character images with purposely inserted bogus identities to be presented together with initially recognized characters to develop verification accuracy statistics;
(2) sorting the stored initially recognized characters and characters with bogus identities into groups wherein the characters of each group have a common identity characterization;
(3) simultaneously presenting images of a number of characters of at least one of said groups;
(4) determining by inspection the presence of a character in at least one of said presented groups failing to have the identity characterization common to the group to which said character belongs;
(5) determining by inspection the correct identity of a character failing to have the identity characterization common to the group to which said character belongs;
(6) developing a corrected identity signal for such correct identity; and
(7) utilizing said corrected identity signal for an initially-incorrectly-identified character to create at least part of an output text for said batch of documents.
1 Assignment
0 Petitions
Accused Products
Abstract
Character recognition processing wherein each of a batch of documents is scanned to produce corresponding scan data signals forming a rectilinear data array of binary bits at the intersections of a rectangular coordinate grid. These signals are stored and processed by a recognition algorithm to produce identity signals for recognized characters. Groups of non-recognized characters are presented simultaneously to permit rapid identification by inspection. The identification of recognized characters is verified at high speed by simultaneously presenting the character images as respective groups sorted to have the same recognized identities.
16 Citations
6 Claims
-
1. For use in a process for recognizing characters on a batch of documents wherein at least a portion of a document is scanned to produce and store scan data signals reflecting the presence of character elements at particular positions on the document;
- the characters represented by said stored scan data signals being initially analyzed by recognition procedures including that of processing the stored character scan data signals by a recognition algorithm to establish initial identifications of the characters;
the method of verifying the identity of characters initially identified by said procedures comprising the following steps; (1) storing character images with purposely inserted bogus identities to be presented together with initially recognized characters to develop verification accuracy statistics; (2) sorting the stored initially recognized characters and characters with bogus identities into groups wherein the characters of each group have a common identity characterization; (3) simultaneously presenting images of a number of characters of at least one of said groups; (4) determining by inspection the presence of a character in at least one of said presented groups failing to have the identity characterization common to the group to which said character belongs; (5) determining by inspection the correct identity of a character failing to have the identity characterization common to the group to which said character belongs; (6) developing a corrected identity signal for such correct identity; and (7) utilizing said corrected identity signal for an initially-incorrectly-identified character to create at least part of an output text for said batch of documents.
- the characters represented by said stored scan data signals being initially analyzed by recognition procedures including that of processing the stored character scan data signals by a recognition algorithm to establish initial identifications of the characters;
-
2. For use in a process for recognizing characters on documents wherein at least a portion of a document is scanned to produce and store scan data signals reflecting the presence of character elements at particular positions on the document and presenting the images of the scanned characters on the document;
- the characters represented by said stored scan data signals being initially analyzed by recognition procedures including that of processing the stored character scan data signals by a recognition algorithm;
the method of verifying the identity of characters initially recognized by said procedures comprising the following steps; (1) storing with said scan data signals additional data signals representing images of characters with deliberately incorrect identities; (2) sorting the initially recognized and said additional image data signals into groups with common identity characterizations; (3) simultaneously displaying images of characters of at least one of said sorted groups; (4) determining by inspection the presence of characters in one of said displayed groups failing to have the identity characterization common to said one displayed group; (5) determining by inspection the correct identity of displayed characters failing to have the identity characterization common to said one displayed group; and (6) developing a corrected identity signal for displayed characters determined in step (5) above.
- the characters represented by said stored scan data signals being initially analyzed by recognition procedures including that of processing the stored character scan data signals by a recognition algorithm;
-
3. In a method for verifying initially determined identities of characters, the steps of:
-
storing data representing the images of a number of characters which have been initially identified; storing data representing the images of characters which are purposely incorrectly identified; sorting said stored image data into groups of characters of the same identification; simultaneously displaying a number of character images from a group of the same identification; finding by inspection the presence of characters in the displayed group failing to have the characterization common to that group; and determining whether characters which were purposely-incorrectly-identified were found in the preceding step. - View Dependent Claims (4, 5, 6)
-
Specification