Method and system for preparing text images for optical-character recognition
First Claim
1. An image-processing subsystem included within a device, appliance, or system that acquires an image containing text with curved text lines and inputs the acquired, text-containing image to the image-processing subsystem to generate a corresponding corrected image in which the text lines are straightened and have a rectilinear organization, the image-processing subsystem comprising:
- a separate hardware layer or a hardware layer shared with the device, appliance, or system that includes one or more processors, one or more memories; and
computer instructions stored in one or more physical data-storage devices that, when executed by one or more of the one or more processors, control the image-processing subsystem to receive the input text-containing image,identify a page sub-image within the text-containing image,correct a vertical perspective of the page sub-image,generate a text-line-curvature model for the page sub-image that associates inclination angles with pixels in the page sub-image,generate local displacements, using the text-line-curvature model, for pixels in the page sub-image from the text-line-curvature model, andtransfer pixels from the page sub-image to a corrected page-sub-image using pixel displacements generated from the local displacements to construct a corrected page sub-image in which the text lines are straightened and in which the text characters and symbols have a rectilinear arrangement.
3 Assignments
0 Petitions
Accused Products
Abstract
The current document is directed to methods and systems that straighten in the text lines of text-containing digital images. Initial processing of a text-containing image identifies the outline of a text-containing page. Next, aggregations of symbols, including words and word fragments, are identified within the outlined page image. The centroids and inclination angles of the symbol aggregations are determined, allowing each symbol aggregation to be circumscribed by a closest-fitting rectangle oriented in conformance with the inclination angle determined for the circumscribed symbol aggregation. A model is constructed for the text-line curvature within the text image based on the circumscribed symbol aggregations and is refined using additional information extracted from the text image. The model, essentially an inclination-angle map, allows for assigning local displacements to pixels within the page image which are then used to straighten the text lines in the text image.
53 Citations
22 Claims
-
1. An image-processing subsystem included within a device, appliance, or system that acquires an image containing text with curved text lines and inputs the acquired, text-containing image to the image-processing subsystem to generate a corresponding corrected image in which the text lines are straightened and have a rectilinear organization, the image-processing subsystem comprising:
-
a separate hardware layer or a hardware layer shared with the device, appliance, or system that includes one or more processors, one or more memories; and computer instructions stored in one or more physical data-storage devices that, when executed by one or more of the one or more processors, control the image-processing subsystem to receive the input text-containing image, identify a page sub-image within the text-containing image, correct a vertical perspective of the page sub-image, generate a text-line-curvature model for the page sub-image that associates inclination angles with pixels in the page sub-image, generate local displacements, using the text-line-curvature model, for pixels in the page sub-image from the text-line-curvature model, and transfer pixels from the page sub-image to a corrected page-sub-image using pixel displacements generated from the local displacements to construct a corrected page sub-image in which the text lines are straightened and in which the text characters and symbols have a rectilinear arrangement. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method carried out by an image-processing subsystem included within a device, appliance, or system that acquires an image containing text with curved text lines and inputs the acquired, text-containing image to the image-processing subsystem to generate a corresponding corrected image in which the text lines are straightened and have a rectilinear organization, the method comprising:
-
receiving the input text-containing image and storing the input text-containing image in a memory; identifying a page sub-image within the text-containing image; correcting a vertical perspective of the page sub-image; generating a text-line-curvature model for the page sub-image that associates inclination angles with pixels in the page sub-image; generating local displacements, using the text-line-curvature model, for pixels in the page sub-image from the text-line-curvature model; and transferring pixels from the page sub-image to a corrected page-sub-image using pixel displacements generated from the local displacements to construct, in a memory, the corrected page sub-image in which the text lines are straightened and in which the text characters and symbols have a rectilinear arrangement.
-
-
22. A non-transitory computer-readable storage medium comprising instructions that, when executed by one or more processors of a hardware layer of an image-processing subsystem that includes one or more processors and one or more memories and that is included within a device, appliance, or system that acquires an image containing text with curved text lines to receive the acquired, text-containing image from the device, appliance, or system and generates a corresponding corrected image in which the text lines are straightened and have a rectilinear organization, causes the one or more processors to perform operations comprising:
-
receiving the input text-containing image and storing the input text-containing image in a memory; identifying a page sub-image within the text-containing image; correcting a vertical perspective of the page sub-image; generating a text-line-curvature model for the page sub-image that associates inclination angles with pixels in the page sub-image; generating local displacements, using the text-line-curvature model, for pixels in the page sub-image from the text-line-curvature model; and transferring pixels from the page sub-image to a corrected page-sub-image using pixel displacements generated from the local displacements to construct, in a memory, the corrected page sub-image in which the text lines are straightened and in which the text characters and symbols have a rectilinear arrangement.
-
Specification