Text Image Quality Based Feedback For Improving OCR
First Claim
1. A method to improve text recognition by using multiple images of identical text, the method comprising:
- capturing a plurality of images of a scene of real world at a plurality of zoom levels, said scene of real world containing text of one or more sizes;
extracting from each of the plurality of images, one or more text regions;
analyzing an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of said plurality of images; and
when the attribute has a value that meets a limit of optical character recognition (OCR) in a version of the first text region, providing the version of the first text region as input to OCR.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic device and method capture multiple images of a scene of real world at a several zoom levels, the scene of real world containing text of one or more sizes. Then the electronic device and method extract from each of the multiple images, one or more text regions, followed by analyzing an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of the multiple images. When an attribute has a value that meets a limit of optical character recognition (OCR) in a version of the first text region, the version of the first text region is provided as input to OCR.
-
Citations
28 Claims
-
1. A method to improve text recognition by using multiple images of identical text, the method comprising:
-
capturing a plurality of images of a scene of real world at a plurality of zoom levels, said scene of real world containing text of one or more sizes; extracting from each of the plurality of images, one or more text regions; analyzing an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of said plurality of images; and when the attribute has a value that meets a limit of optical character recognition (OCR) in a version of the first text region, providing the version of the first text region as input to OCR. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. At least one non-transitory computer readable storage media comprising a plurality of instructions to be executed by at least one processor to correct skew in an image of a scene of real world, the plurality of instructions comprising:
-
first instructions to capture a plurality of images of a scene of real world at a plurality of zoom levels, said scene of real world containing text of one or more sizes; second instructions to extract from each of the plurality of images, one or more text regions; third instructions to analyze an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of said plurality of images; and fourth instructions to provide the version of the first text region as input to OCR, when the attribute has a value that meets a limit of optical character recognition (OCR) in a version of the first text region. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21)
-
-
22. A mobile device to decode text in real world images, the mobile device comprising:
-
a camera; a memory operatively connected to the camera to receive at least an image therefrom, the image comprising one or more text regions; at least one processor operatively connected to the memory to execute a plurality of instructions stored in the memory; wherein the plurality of instructions cause the at least one processor to; capture a plurality of images of a scene of real world at a plurality of zoom levels, said scene of real world containing text of one or more sizes; extract from each of the plurality of images, one or more text regions; analyze an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of said plurality of images; and when the attribute has a value that meets a limit of optical character recognition (OCR) in a version of the first text region, provide the version of the first text region as input to OCR. - View Dependent Claims (23, 24, 25, 26, 27)
-
-
28. A mobile device comprising:
-
a camera configured to capture a plurality of images of a scene of real world at a plurality of zoom levels, said scene of real world containing text of one or more sizes; a memory coupled to the camera for storing the plurality of images; means, coupled to the memory, for extracting from each of the plurality of images, one or more text regions; means for analyzing an attribute that is relevant to OCR in one or more versions of a first text region as extracted from one or more of said plurality of images; and responsive to the attribute having a value that meets a limit of optical character recognition (OCR) in a version of the first text region, means for providing the version of the first text region as input to OCR.
-
Specification