Extracting documents from a natural scene image
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving an input image;
automatically identifying, by one or more computing devices, a non-rectangular, quadrilateral-shaped region within the input image;
mapping, by the one or more computing devices, the non-rectangular, quadrilateral-shaped region of the image to a rectangular-shaped output image; and
providing the output image for processing by an optical character recognition engine.
2 Assignments
0 Petitions
Accused Products
Abstract
The present technology proposes techniques for extracting forms and other types of documents from images taken with a mobile client device. By calculating and making adjustments along a document'"'"'s detected borders, an input image can be transformed such that the document within the image may be properly aligned and background clutter completely removed. The resulting text fields of the extracted document are thus upright, aligned and locatable at predictable points.
27 Citations
18 Claims
-
1. A computer-implemented method comprising:
-
receiving an input image; automatically identifying, by one or more computing devices, a non-rectangular, quadrilateral-shaped region within the input image; mapping, by the one or more computing devices, the non-rectangular, quadrilateral-shaped region of the image to a rectangular-shaped output image; and providing the output image for processing by an optical character recognition engine. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving an input image; automatically detecting one or more edges of a non-rectangular, quadrilateral-shaped region in the input image; identifying the non-rectangular, quadrilateral-shaped region based on the detected one or more edges; mapping the non-rectangular, quadrilateral-shaped region of the image to a rectangular-shaped output image; and providing the output image for processing by an optical character recognition engine. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving an input image; detecting one or more changes in brightness of one or more portions of the input image; detecting one or more edges of a non-rectangular, quadrilateral-shaped region in the input image based on the detected one or more changes in brightness of one or more portions of the input image; identifying the non-rectangular, quadrilateral-shaped region based on the detected one or more edges; identifying a non-rectangular, quadrilateral-shaped region within the input image; mapping the identified non-rectangular, quadrilateral-shaped region of the image to a rectangular-shaped output image; and providing the output image for processing by an optical character recognition engine. - View Dependent Claims (16, 17, 18)
-
Specification