Assist channel coding using a rewrite model
First Claim
Patent Images
1. A method for decoding image data for a hardcopy document, comprising:
- recording a scanned representation of the hardcopy document that includes a primary set of symbol data and a secondary set of encoding data;
the primary set of symbol data providing a first channel of human readable information rendered on the hardcopy document;
the secondary set of encoding data providing a second channel of machine readable information rendered on the hardcopy document;
receiving a decoded form of the scanned representation of the hardcopy document from a decoding module to define a candidate set of symbol data; and
rewriting, independent of the decoding module, the candidate set of symbol data using an event library and the secondary set of encoding data; and
computing a shortest path of a product graph of the candidate set of symbol data and the secondary set of encoding data;
the event library identifying likely failures encountered when the scanned representation of the hardcopy document is decoded;
the event library comprising a rule that represents a transformation.
5 Assignments
0 Petitions
Accused Products
Abstract
A system encodes a separate assist channel that carries only a small amount of additional information in a hardcopy document to compensate for failure of an OCR system to accurately reconstruct a scanned electronic version of the hardcopy document. After the OCR system produces an interpretation of the scanned electronic version of the hardcopy document, a rewrite system that is guided by the assist channel operates independent from the OCR system to correct decoding errors produced by the OCR system.
-
Citations
18 Claims
-
1. A method for decoding image data for a hardcopy document, comprising:
-
recording a scanned representation of the hardcopy document that includes a primary set of symbol data and a secondary set of encoding data;
the primary set of symbol data providing a first channel of human readable information rendered on the hardcopy document;
the secondary set of encoding data providing a second channel of machine readable information rendered on the hardcopy document;receiving a decoded form of the scanned representation of the hardcopy document from a decoding module to define a candidate set of symbol data; and rewriting, independent of the decoding module, the candidate set of symbol data using an event library and the secondary set of encoding data; and
computing a shortest path of a product graph of the candidate set of symbol data and the secondary set of encoding data;
the event library identifying likely failures encountered when the scanned representation of the hardcopy document is decoded;
the event library comprising a rule that represents a transformation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus for decoding image data for a hardcopy document, comprising:
-
means for recording a scanned representation of the hardcopy document that includes a primary set of symbol data and a secondary set of encoding data;
the primary set of symbol data providing a first channel of human readable information rendered on the hardcopy document;
the secondary set of encoding data providing a second channel of machine readable information rendered on the hardcopy document;means for receiving a decoded form of the scanned representation of the hardcopy document from a decoding module to define a candidate set of symbol data; and means for rewriting, independent of the decoding module, the candidate set of symbol data using an event library and the secondary set of encoding data; and
computing a shortest path of a product graph of the candidate set of symbol data and the secondary set of encoding data;
the event library identifying likely failures encountered when the scanned representation of the hardcopy document is decoded;
the event library comprising a rule that represents a transformation.
-
-
10. An apparatus for decoding image data for a hardcopy document, comprising:
-
a scanner for recording a scanned representation of the hardcopy document that includes a primary set of symbol data and a secondary set of encoding data;
the primary set of symbol data providing a first channel of human readable information rendered on the hardcopy document;
the secondary set of encoding data providing a second channel of machine readable information rendered on the hardcopy document;a decoding module coupled to the scanner for providing a decoded form of the scanned representation of the hardcopy document to define a candidate set of symbol data; and a rewrite module for rewriting, independent of the decoding module, the candidate set of symbol data using an event library and the secondary set of encoding data; and
computing a shortest path of a product graph of the candidate set of symbol data and the secondary set of encoding data;
the event library identifying likely failures encountered when the scanned representation of the hardcopy document is decoded;
the event library comprising a rule that represents a transformation. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification