Multiple image input for optical character recognition processing systems and methods
First Claim
1. A computer-readable medium having computer-executable instructions stored thereon for performing the computer-implementable method, the method comprising:
- receiving a digital version of an image;
processing the digital version of the image through at least two binarization processes to thereby create a first binarization and a second binarization;
processing the first binarization through a first optical character recognition process to thereby create a first OCR output file, wherein processing the first binarization through a first optical character recognition process comprises compiling first metrics associated with the first OCR output file;
processing the second binarization through the first optical character recognition process to thereby create a second OCR output file, wherein processing the second binarization through the first optical character recognition process comprises;
compiling second metrics associated with the second OCR output file; and
using the first metrics and the second metrics, at least in part, to select a final OCR output file from among the OCR output files;
further comprising;
processing the first binarization through a second optical character recognition process to thereby create a third OCR output file, wherein processing the first binarization through a second optical character recognition process comprises compiling third metrics associated with the third OCR output file; and
processing the second binarization through the second optical character recognition process to thereby create a fourth OCR output file, wherein processing the second binarization through the second optical character recognition process comprises compiling fourth metrics associated with the fourth OCR output file.
16 Assignments
0 Petitions
Accused Products
Abstract
A method of processing an image includes receiving a digital version of the image, processing the digital version of the image through at least two binarization processes to thereby create a first binarization and a second binarization, and processing the first binarization through a first optical character recognition process to thereby create a first OCR output file. Processing the first binarization through a first optical character recognition process includes compiling first metrics associated with the first OCR output file. The method also includes processing the second binarization through the first optical character recognition process to thereby create a second OCR output file. Processing the second binarization through the first optical character recognition process includes compiling second metrics associated with the second OCR output file. The method also includes using the metrics, at least in part, to select a final OCR output file from among the OCR output files.
-
Citations
12 Claims
-
1. A computer-readable medium having computer-executable instructions stored thereon for performing the computer-implementable method, the method comprising:
- receiving a digital version of an image;
processing the digital version of the image through at least two binarization processes to thereby create a first binarization and a second binarization;
processing the first binarization through a first optical character recognition process to thereby create a first OCR output file, wherein processing the first binarization through a first optical character recognition process comprises compiling first metrics associated with the first OCR output file;
processing the second binarization through the first optical character recognition process to thereby create a second OCR output file, wherein processing the second binarization through the first optical character recognition process comprises;
compiling second metrics associated with the second OCR output file; and
using the first metrics and the second metrics, at least in part, to select a final OCR output file from among the OCR output files;
further comprising;
processing the first binarization through a second optical character recognition process to thereby create a third OCR output file, wherein processing the first binarization through a second optical character recognition process comprises compiling third metrics associated with the third OCR output file; and
processing the second binarization through the second optical character recognition process to thereby create a fourth OCR output file, wherein processing the second binarization through the second optical character recognition process comprises compiling fourth metrics associated with the fourth OCR output file. - View Dependent Claims (2, 3, 4, 5, 6)
- receiving a digital version of an image;
-
7. A method of optically recognizing characters in an image, comprising:
- providing a computer system, wherein the computer system includes a computer readable medium, the computer readable medium having stored instructions for;
creating multiple binarizations of the image using different binarization techniques;
presenting each of the multiple binarizations to multiple optical character recognition (OCR) engines to produce multiple OCR output files for each binarization;
developing metrics relating to each OCR output file; and
using the metrics, at least in part, to select a final OCR output file from among the OCR output files;
wherein presenting each binarization to an OCR engine comprises presenting each binarization to a different OCR engine. - View Dependent Claims (8, 9)
- providing a computer system, wherein the computer system includes a computer readable medium, the computer readable medium having stored instructions for;
-
10. An optical character recognition system, comprising:
- at least two binarization processes configured to convert a grayscale image to bitonal images;
at least two different optical character recognition processes configured to process at least, two bitonal images derived from—
the grayscale image into final output files having characters therein;
wherein presenting each binarization to an OCR engine comprises presenting each binarization to a different OCR engine;
further comprising a metrics generation process configured to analyze output files and produce metrics associated therewith;
a voting process configured to select a final output file from among the output files based on the metrics;
a storage arrangement configured to store final output files and serve the information therein to users and at least one processor programmed to execute the at least one optical character recognition processes, the at least one optical character recognition process, metrics generation process, and the voting process. - View Dependent Claims (11, 12)
- at least two binarization processes configured to convert a grayscale image to bitonal images;
Specification