Image capture systems
First Claim
1. An electronic image capture apparatus comprising:
- an image detecting device adapted to capture a set of sub-images or tiles corresponding to different areas of a document at known locations and a processor adapted to receive the set of sub-images produced by the image detecting device an to process the sub-images to form a machine-readable text document equivalent to the portion of the document covered by the set of sub-images;
wherein;
the processor includes an optical character recognition sub-routine which is adapted to produce a first set of processable data files which each comprises a data set of characters corresponding to characters appearing in a respective sub-image in the set and relative location of the characters in that sub-image;
the processor establishes a co-ordinate system which defines a template of the machine readable document whereby any point in the imaged document can be uniquely identified by co-ordinate of that point in the machine readable text document;
the processor establishes a second co-ordinate system for each sub-image;
the processor, after optical character recognition, stores in the processable data files the characters located in each sub-image along with the location of characters in the second co-ordinate system; and
the processor is adapted to stitch together the characters stored in the data tiles to produce a machine readable text document.
2 Assignments
0 Petitions
Accused Products
Abstract
An electronic image capture apparatus is disclosed comprising: an electronic camera having a detector, a lens having a field of view which is adapted to limit the radiation incident upon the detector to that within the field of view, an actuator for moving the field of view across the document, a control means for controlling the actuator to move the camera across the document so as to obtain a set of overlapping sub-images corresponding to different areas of the document, and electronic processing means adapted to receive the set of sub-images produced by the camera and to process the sub-images to form a composite image of the portion of the document covered by the set of sub-images. A set of processable sub-image files are produced which each comprise a data set of characters corresponding to characters appearing in a respective sub-image in the set and the relative location of the characters in that sub-image. The contents of each of the processable sub-image files are stitched into a blank text document by applying logical operators to the data in the files to produce a complete composite text document containing data indicative of the textual content of the scanned document.
-
Citations
13 Claims
-
1. An electronic image capture apparatus comprising:
-
an image detecting device adapted to capture a set of sub-images or tiles corresponding to different areas of a document at known locations and a processor adapted to receive the set of sub-images produced by the image detecting device an to process the sub-images to form a machine-readable text document equivalent to the portion of the document covered by the set of sub-images;
wherein; the processor includes an optical character recognition sub-routine which is adapted to produce a first set of processable data files which each comprises a data set of characters corresponding to characters appearing in a respective sub-image in the set and relative location of the characters in that sub-image;
the processor establishes a co-ordinate system which defines a template of the machine readable document whereby any point in the imaged document can be uniquely identified by co-ordinate of that point in the machine readable text document;
the processor establishes a second co-ordinate system for each sub-image;
the processor, after optical character recognition, stores in the processable data files the characters located in each sub-image along with the location of characters in the second co-ordinate system; and
the processor is adapted to stitch together the characters stored in the data tiles to produce a machine readable text document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
Specification