Method and apparatus for performing optical character recognition (OCR) and text stitching
First Claim
1. A method of generating an electronic text file from a paper-based document that includes a plurality of characters, the method comprising:
- capturing a plurality of partially overlapping digital images of the document;
performing optical character recognition on each one of the plurality of captured digital images, and thereby generating a corresponding plurality of electronic text files, each one of the electronic text files including a portion of the plurality of characters in the document;
comparing the plurality of electronic text files with one another to identify characters that are in common between the electronic text files; and
combining the plurality of electronic text files into a combined text file based on the comparison, wherein the combined text file includes the plurality of characters in the document.
5 Assignments
0 Petitions
Accused Products
Abstract
A method of generating an electronic text file from a paper-based document that includes a plurality of characters includes capturing a plurality of partially overlapping digital images of the document. Optical character recognition is performed on each one of the plurality of captured digital images, thereby generating a corresponding plurality of electronic text files. Each one of the electronic text files includes a portion of the plurality of characters in the document. The plurality of electronic text files are compared with one another to identify characters that are in common between the electronic text files. The plurality of electronic text files are combined into a combined text file based on the comparison. The combined text file includes the plurality of characters in the document.
67 Citations
17 Claims
-
1. A method of generating an electronic text file from a paper-based document that includes a plurality of characters, the method comprising:
-
capturing a plurality of partially overlapping digital images of the document;
performing optical character recognition on each one of the plurality of captured digital images, and thereby generating a corresponding plurality of electronic text files, each one of the electronic text files including a portion of the plurality of characters in the document;
comparing the plurality of electronic text files with one another to identify characters that are in common between the electronic text files; and
combining the plurality of electronic text files into a combined text file based on the comparison, wherein the combined text file includes the plurality of characters in the document. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A digital camera comprising:
-
a lens;
an image sensor for generating a plurality of partially overlapping digital images based on optical images directed onto the image sensor by the lens, and a controller coupled to the image sensor and configured to perform optical character recognition on the plurality of digital images, and thereby generate an electronic text file for each one of the plurality of digital images, the electronic text file for each digital image including text appearing in the digital image, the controller configured to identify overlapping text between electronic text files and stitch the text in the plurality of text files together based on the identified overlapping text. - View Dependent Claims (8, 9, 10, 11)
-
-
12. An electronic device including a digital camera, the electronic device comprising:
-
a display screen for displaying images captured with the digital camera;
an input device for inputting information into the electronic device; and
a processor configured to perform optical character recognition on digital images captured with the digital camera and generate corresponding electronic text files, the electronic text file for each digital image including text appearing in the digital image, the controller configured to stitch the text from the electronic text files together. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification