Certifiable optical character recognition
First Claim
1. A method for ensuring the accuracy of lines of reproduced data electronically reproduced from lines of original data comprising:
- first deriving one or more data-dependent first values from said original data;
second deriving one or more data-dependent second values from said reproduced data;
comparing said first and second values and identifying errors and error locations within said lines of reproduced data based on the difference between said first and second values; and
effecting one or more methods of automatic correction of said identified errors on said reproduced data.
0 Assignments
0 Petitions
Accused Products
Abstract
A certificate, calculated on the text of a document to be reproduced, is included in the document in a form which is machine-readable. Upon optical character recognition scanning, or other digitizing reproduction, the certificate is also scanned and stored for comparison. A new certificate is calculated on the contents of the reproduced document and compared to the scanned certificate. If the two certificates are different, it is known that at least one error exists in the reproduced document. More than one certificate can be provided for a document in order to facilitate error location on the document and error correction therein.
78 Citations
18 Claims
-
1. A method for ensuring the accuracy of lines of reproduced data electronically reproduced from lines of original data comprising:
-
first deriving one or more data-dependent first values from said original data; second deriving one or more data-dependent second values from said reproduced data; comparing said first and second values and identifying errors and error locations within said lines of reproduced data based on the difference between said first and second values; and effecting one or more methods of automatic correction of said identified errors on said reproduced data. - View Dependent Claims (2, 3)
-
-
4. A method for electronically reproducing lines of original data of computer-preprocessed printed documents comprising the steps of:
-
deriving one or more first data-dependent values from said original data; printing said document including said one or more first values; creating an electronic document having lines of reproduced data by optically scanning said printed document and said first values and saving said electronic document and first values; deriving one or more second data-dependent values from said reproduced data of said electronic document; comparing said first and second values and identifying errors and error locations within said lines of reproduced data based on the difference between said first and second values; and automatically correcting said identified errors by applying one or more error correction methods to said reproduced data. - View Dependent Claims (5, 6, 7, 8)
-
-
9. A method for providing an electronic reproduction having lines of reproduced data from a printed document having lines of original data using optical character recognition comprising the steps of:
-
creating a first computer version of said document in a first computer; deriving at least one first digital value for said first version of said document from said original data; creating a printed version of said document by outputting said first version and said at least one first digital value to a printer associated with said first computer; creating a second computer version of said document comprising reproduced data by optically scanning said printed version and said at least one first digital value with a scanner associated with a second computer having software for recognizing said optical characters and said digital value; deriving at least one second digital value for said second computer version of said document from said reproduced data; comparing said at least one first digital value to said at least one second digital value and identifying errors and error locations within said lines of reproduced data based on the difference between said at least one first and second digital values; and effecting automatic error correction of said second computer version by applying one or more error correction methods to said reproduced data. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A method for electronically reproducing a computer-preprocessed printed document having a layout comprising a plurality of original blocks comprising at least multiple text blocks of multiple lines of characters comprising the steps of:
-
deriving at least one first layout-dependent value from the spatial relationship of said plurality of original blocks on said document; printing said multiple text blocks of multiple lines and said at least one first layout-dependent value; creating an electronic document with layout comprising a plurality of reproduced blocks comprising at least multiple text blocks of multiple lines of reproduced characters by optically scanning said multiple lines and said at least one first layout-dependent value; deriving at least one second layout-dependent value from the spatial relationship of said plurality of reproduced blocks on said electronic document; comparing said at least one first layout-dependent value to said at least one second layout-dependent value; and detecting and locating layout errors based upon the difference between said at least one first and second layout-dependent values. - View Dependent Claims (16, 17)
-
-
18. A method for insuring the accuracy of a second document comprising reproduced data sequences reproduced by optically scanning a first document, comprising original data sequences and at least one encoded, machine-readable, first data-dependent value assigned to and scannable on the face of said first document, comprising the steps of:
-
decoding said at least one encoded, machine-readable, first data-dependent value; deriving at least one second data-dependent value for said second document from said reproduced data sequences; comparing said at least one first data-dependent value to said at least one second data-dependent value and identifying errors and error locations within said reproduced data sequences based upon the differences between said first and said second data-dependent values; and effecting automatic error correction on said reproduced data sequences when said first and said second data-dependent values are different.
-
Specification