Method and device for capturing a document
First Claim
1. A method, comprising:
- performing, by a mobile device having at least one processor, a display, and a camera, edge detection within a two-dimensional image of a document to identify edges of the document, the two-dimensional image of the document obtained by the mobile device via the camera and displayed to a user of the mobile device via the display;
determining, by the mobile device, angles between detected edges of the document;
determining, by the mobile device based on the detected edges and the angles determined, a three-dimensional position of the document relative to a position of the mobile device;
determining, by the mobile device, correction information to correct, by relative movement, the three-dimensional position of the document relative to the position of the mobile device;
determining, by the mobile device, guidance information from the correction information; and
providing, by the mobile device, the guidance information to the user of the mobile device, the guidance information guiding the user to perform the relative movement such that when the three-dimensional position of the document is correct relative to the position of the mobile device, the mobile device automatically captures the document.
4 Assignments
0 Petitions
Accused Products
Abstract
A method and device for capturing a positionally corrected image of a document is disclosed. The method comprises the steps of: obtaining a two-dimensional image of the document with a mobile terminal apparatus; performing edge detection within the two-dimensional image to identify edges of the document; determining angles between detected edges; calculating, based on the detected edges and the angles determined, a three-dimensional position of the document relative to a position of the mobile terminal apparatus; calculating correction information to correct, by relative movement, the position of the document relative to the position of the mobile terminal apparatus; providing first guidance information derived from the correction information to a user of the mobile terminal apparatus, guiding the user to perform the relative movement; and capturing a positionally corrected image of the document. Thereby, the document can be captured with a quality sufficient to permit Optical Character Recognition (OCR).
16 Citations
20 Claims
-
1. A method, comprising:
-
performing, by a mobile device having at least one processor, a display, and a camera, edge detection within a two-dimensional image of a document to identify edges of the document, the two-dimensional image of the document obtained by the mobile device via the camera and displayed to a user of the mobile device via the display; determining, by the mobile device, angles between detected edges of the document; determining, by the mobile device based on the detected edges and the angles determined, a three-dimensional position of the document relative to a position of the mobile device; determining, by the mobile device, correction information to correct, by relative movement, the three-dimensional position of the document relative to the position of the mobile device; determining, by the mobile device, guidance information from the correction information; and providing, by the mobile device, the guidance information to the user of the mobile device, the guidance information guiding the user to perform the relative movement such that when the three-dimensional position of the document is correct relative to the position of the mobile device, the mobile device automatically captures the document. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus, comprising:
-
at least one processor; a display; a camera; non-transitory computer memory; and stored instructions translatable by the at least one processor to perform; identifying edges of a document within a two-dimensional image of the document, the two-dimensional image of the document obtained by the apparatus via the camera and displayed to a user of the apparatus via the display; determining angles between detected edges of the document; determining, based on the detected edges and the angles determined, a three-dimensional position of the document relative to a position of the apparatus; determining correction information to correct, by relative movement, the three-dimensional position of the document relative to the position of the apparatus; determining guidance information from the correction information; and providing the guidance information to the user of the apparatus, the guidance information guiding the user to perform the relative movement such that when the three-dimensional position of the document is correct relative to the position of the apparatus, the apparatus automatically captures the document. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product comprising at least one non-transitory computer-readable storage medium including instructions translatable by at least one processor of an apparatus to perform:
-
identifying edges of a document within a two-dimensional image of the document, the two-dimensional image of the document obtained by the apparatus via a camera and displayed to a user of the apparatus via a display; determining angles between detected edges of the document; determining, based on the detected edges and the angles determined, a three-dimensional position of the document relative to a position of the apparatus; determining correction information to correct, by relative movement, the three-dimensional position of the document relative to the position of the apparatus; determining guidance information from the correction information; and providing the guidance information to the user of the apparatus, the guidance information guiding the user to perform the relative movement such that when the three-dimensional position of the document is correct relative to the position of the apparatus, the apparatus automatically captures the document. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification