Document character reading system
First Claim
1. A document character reading system, comprising:
- a first data reading component that reads image data from a recording medium such as a document in which the characters to be read are stored, recognizes first character data from said image data on the basis of a first character recognition method, and outputs said image data and first character data;
a second data reading component that checks whether said first character data matches second character data recognized from said image data on the basis of a second character recognition method different from said first character recognition method, and outputs said character data as correctly read character data if there is a match, but outputs said first or second character data as incorrect data if there is no match;
a correction component having a display that receives said image data and incorrect data, and displays said image data as an image and said incorrect data in a character font, for correcting said incorrect data into correct character data while the operator compares the displayed image data and incorrect data;
a memory component that readably stores image data and first character data from said first data reading component, said second character data, and said correct character data;
wherein said image data comprises the whole image data of said document and field image data extracted from images within fields where the characters to be read are written; and
wherein said first character data and second character data are recognized from said field image data.
1 Assignment
0 Petitions
Accused Products
Abstract
A character reading system which reduces the amount of correction and verification work needed for a document for which characters have been correctly recognized. Whole image data WIMG of a document read by a remote OCR installed at a local station and first character data DATA1 recognized on the basis of a first character recognition method from this data WIMG are stored via a communication network in a memory component provided to a central station. A second recognition component provided to the central station recognizes second character data on the basis of a second character recognition method that is different from the first character recognition method from the image data WIMG read out from the memory component. A decision component decides whether there is a match between these first and second sets of character data. If there is a match, the data is outputted as correct character data, that is, third character data, to a host computer. If there is a mismatch, the data is changed to correct character data by a correction component, and, if needed, this correction is verified, after which the data is outputted as corrected correct character data to the host computer.
127 Citations
15 Claims
-
1. A document character reading system, comprising:
-
a first data reading component that reads image data from a recording medium such as a document in which the characters to be read are stored, recognizes first character data from said image data on the basis of a first character recognition method, and outputs said image data and first character data;
a second data reading component that checks whether said first character data matches second character data recognized from said image data on the basis of a second character recognition method different from said first character recognition method, and outputs said character data as correctly read character data if there is a match, but outputs said first or second character data as incorrect data if there is no match;
a correction component having a display that receives said image data and incorrect data, and displays said image data as an image and said incorrect data in a character font, for correcting said incorrect data into correct character data while the operator compares the displayed image data and incorrect data;
a memory component that readably stores image data and first character data from said first data reading component, said second character data, and said correct character data;
wherein said image data comprises the whole image data of said document and field image data extracted from images within fields where the characters to be read are written; and
wherein said first character data and second character data are recognized from said field image data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
said first recognition component being included in said OCR in said second data reading component. -
13. The document character reading system according to claim 5, wherein a gateway is connected to said LAN, and a host computer is connected to said gateway.
-
14. The document character reading system according to claim 5, further comprising a verification component connected to said LAN for verifying the character data corrected by said correction component through comparison with said image data.
-
15. The document character reading system according to claim 2, wherein said second data reading component includes a second recognition component that recognizes character data as second character data from image data on the basis of a second character recognition method, and a decision component that checks said first character data and second character data and decides whether the two sets of character data match or not, wherein said memory component, said first recognition component, said second recognition component, said decision component, and said correction component are linked together via a LAN.
-
Specification