Method and system for detecting and recognizing text in images
First Claim
1. A computer-implemented method for detecting and recognizing text in an image, the method comprising:
- under the control of one or more computer systems configured with executable instructions,obtaining an output image that differs from an input image in at least one visual aspect, the output image comprising one or more text regions;
separately processing the input image to create at least one binary chip, each binary chip corresponding to a text region of the output image;
generating first output by at least recognizing the text in each binary chip from the text region corresponding to the binary chip using an optical character recognizer;
generating second output by at least separately and independently recognizing the text from the one or more text regions of the output image using the optical character recognizer; and
analyzing at least the generated first output and the generated second output to form consensus output.
0 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments of the present invention relate to a method, system and computer program product for detecting and recognizing text in the images captured by cameras and scanners. First, a series of image-processing techniques is applied to detect text regions in the image. Subsequently, the detected text regions pass through different processing stages that reduce blurring and the negative effects of variable lighting. This results in the creation of multiple images that are versions of the same text region. Some of these multiple versions are sent to a character-recognition system. The resulting texts from each of the versions of the image sent to the character-recognition system are then combined to a single result, wherein the single result is detected text.
-
Citations
20 Claims
-
1. A computer-implemented method for detecting and recognizing text in an image, the method comprising:
under the control of one or more computer systems configured with executable instructions, obtaining an output image that differs from an input image in at least one visual aspect, the output image comprising one or more text regions; separately processing the input image to create at least one binary chip, each binary chip corresponding to a text region of the output image; generating first output by at least recognizing the text in each binary chip from the text region corresponding to the binary chip using an optical character recognizer; generating second output by at least separately and independently recognizing the text from the one or more text regions of the output image using the optical character recognizer; and analyzing at least the generated first output and the generated second output to form consensus output. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A system for detecting and recognizing text in an image, the system comprising:
-
one or more processors; and memory, including instructions that, when collectively executed by the one or more processors, cause the system to at least; generate first output by using an optical character recognizer to recognize the text in at least one binary chip formed at least in part by processing an input image; generate second output by using the optical character recognizer to separately and independently recognize the text in an output image formed at least in part by processing the input image; and analyze at least the generated first output and the generated second output to form consensus output. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer program product for use with a computer, the computer program product comprising a computer usable medium having computer-readable program code embodied therein for detecting and recognizing text in an image, the computer program product performing:
-
obtain an output image formed at least in part by processing an input image, the output region comprising at least one detected text region; obtain at least one binary chip formed at least in part by separately processing the input image; obtain first output by at least recognizing the text in each of the at least one binary chip; obtain second output by at least separately and independently recognizing the text from the text regions of the output image; and analyze at least the generated first output and the generated second output to form consensus output. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification