System and method for correction of optical character recognition with display of image segments according to character data
First Claim
1. A data entry system for generating an electronically stored coded representation of a character sequence from one or more electronically stored document images, comprising:
- an optical character recognition logic means for generating, from a document image and storing in a data base, character data specifying one of a plurality of possible character values for corresponding segments of the document image;
the optical character recognition means generating a high or low confidence factor for each character;
preprocessing means for generating composite images of possible character values for corresponding segments of a plurality of document images and pointers to characters in the data base for updating such character data;
the preprocessing means arranging the segments in one type of the composite images which consists of groups of segments for which the character data specifies the same character value and has a high confidence factor;
an interactive display means for generating and sequentially displaying, one or more types of composite images, each composite image comprising segments of the document image arranged according to like character data;
the interactive display means having a correction mechanism responsive to a user input operation to enable the operator to correct the character data associated with the displayed segments; and
post processing means for updating character data in the data base corrected by the operator using the pointer for the character data in the data base.
1 Assignment
0 Petitions
Accused Products
Abstract
A data entry system generates an electronically stored coded representation of a character sequence from one or more electronically stored document images. The system comprising optical character recognition logic for generating, from the document image or images, character data specifying one of a plurality of possible character values for corresponding segments of the document images. The system also has an interactive display means for generating and sequentially displaying, one or more types of composite image, each composite image comprising segments of the document image or images arranged according to the character data, and a correction mechanism responsive to a user input operation to enable the operator to correct the character data associated with displayed segments.
-
Citations
8 Claims
-
1. A data entry system for generating an electronically stored coded representation of a character sequence from one or more electronically stored document images, comprising:
-
an optical character recognition logic means for generating, from a document image and storing in a data base, character data specifying one of a plurality of possible character values for corresponding segments of the document image; the optical character recognition means generating a high or low confidence factor for each character; preprocessing means for generating composite images of possible character values for corresponding segments of a plurality of document images and pointers to characters in the data base for updating such character data; the preprocessing means arranging the segments in one type of the composite images which consists of groups of segments for which the character data specifies the same character value and has a high confidence factor; an interactive display means for generating and sequentially displaying, one or more types of composite images, each composite image comprising segments of the document image arranged according to like character data; the interactive display means having a correction mechanism responsive to a user input operation to enable the operator to correct the character data associated with the displayed segments; and post processing means for updating character data in the data base corrected by the operator using the pointer for the character data in the data base. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of operating a data entry apparatus to generate an electronically stored coded representation of a character sequence from an electronically stored image of a document, the method comprising the steps:
-
generating in optical character means character data specifying one of a plurality of possible character values for corresponding segments of the document images using optical character recognition logic; the character data comprising a confidence value for each specified character value, the confidence value being indicative of the likelihood that the character code associated therewith is correct, the method further comprised in the steps; storing the character data and generating a pointer to the character data in a data base; generating and sequentially displaying, one or more types of composite images from the character values and the document image in a preprocessing module, each composite image comprising the segments of the document image arranged according to the character data; generating and sequentially displaying a plurality of composite images of a first or exemption data entry type wherein the arrangement of the segments is into groups of segments for which the character data specifies the same character value; selecting segments displayed in the first type of composite image and setting the corresponding confidence value to indicate a high likelihood of the specified character value being correct; generating and sequentially displaying a plurality of composite images of a second or memory span data entry type comprising segments having a confidence value indicating a low likelihood associated therewith; selecting segments of the document image displayed as part of the composite images of the second type; correcting the character data corresponding to the selected segments to generate the coded representation; and updating the character data in the data base using the corrected character data and the pointer to the character data. - View Dependent Claims (8)
-
Specification