Methods for mobile image capture and processing of documents
DCFirst Claim
1. A method for mobile image capture and processing for documents, comprising:
- capturing a first image of a document using the mobile communication device;
automatically detecting the document within the first image using a processor of the communication device, wherein detecting the document within the first image further comprises;
converting the first image to a color icon image, wherein the first image is in color;
reducing color within the color icon image, resulting in a gray-scale icon image; and
determining a plurality of corners belonging to the document within the gray-scale icon image;
geometrically correcting the first image using the processor of the communication device, resulting in a second image;
transmitting the second image to a server;
binarizing the second image using a processor of the server, resulting in a third image;
correcting the orientation of the third image using the processor of the server, resulting in a fourth image;
outputting the fourth image of the document using the processor of the server.
2 Assignments
Litigations
0 Petitions
Accused Products
Abstract
The present invention relates to automated document processing and more particularly, to methods and systems for document image capture and processing using mobile devices. In accordance with various embodiments, methods and systems for document image capture on a mobile communication device are provided such that the image is optimized and enhanced for data extraction from the document as depicted. These methods and systems may comprise capturing an image of a document using a mobile communication device; transmitting the image to a server; and processing the image to create a bi-tonal image of the document for data extraction. Additionally, these methods and systems may comprise capturing a first image of a document using the mobile communication device; automatically detecting the document within the image; geometrically correcting the image; binarizing the image; correcting the orientation of the image; correcting the size of the image; and outputting the resulting image of the document.
125 Citations
18 Claims
-
1. A method for mobile image capture and processing for documents, comprising:
-
capturing a first image of a document using the mobile communication device; automatically detecting the document within the first image using a processor of the communication device, wherein detecting the document within the first image further comprises; converting the first image to a color icon image, wherein the first image is in color; reducing color within the color icon image, resulting in a gray-scale icon image; and determining a plurality of corners belonging to the document within the gray-scale icon image; geometrically correcting the first image using the processor of the communication device, resulting in a second image; transmitting the second image to a server; binarizing the second image using a processor of the server, resulting in a third image; correcting the orientation of the third image using the processor of the server, resulting in a fourth image; outputting the fourth image of the document using the processor of the server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 9, 10)
-
-
8. A computer-implemented method for mobile image capture and processing for documents, wherein one or more processors are programmed perform the steps comprising:
-
receiving a color image of a document from a mobile device, the color image being captured using an image capture device of the mobile device; detecting the financial document in the color image; geometrically correcting the color image and generating a gray-scale image; binarizing the gray-scale image to produce a bi-tonal image correcting the orientation of the bi-tonal image; correcting the size of the bi-tonal image using the processor of the server, resulting in a size-corrected bi-tonal image, wherein correcting the size of the bi-tonal image comprises; reading a relevant object of a known position on the document within the bi-tonal image; computing an average width of the relevant object; computing a scaling factor based on the average width of the relevant object; using the scaling factor to determine whether the bi-tonal image needs a size correction; applying a size correction to the color image, resulting in a resized image; geometrically correcting the resized image, resulting in a corrected image; and outputting the corrected image of the document.
-
-
11. A computer program product comprising a non-transitory computer readable medium having computer program code embodied therein for enabling a computing device to perform the operations of:
-
receiving a color image of a document from a mobile device, the color image being captured using an image capture device of the mobile device; detecting the financial document in the color image; geometrically correcting the color image and generating a gray-scale image; binarizing the gray-scale image to produce a bi-tonal image correcting the orientation of the bi-tonal image; correcting the size of the bi-tonal image using the processor of the server, resulting in a size-corrected bi-tonal image, wherein correcting the size of the bi-tonal image comprises; reading a relevant object of a known position on the document within the bi-tonal image; computing an average width of relevant object of a known position; computing a scaling factor based on the average width of the relevant object of a known position; using the scaling factor to determine whether the fourth image needs a size correction; applying a size correction to the color image, resulting in a resized image; geometrically correcting the resized image, resulting in the geometrically corrected resized image; binarizing the geometrically corrected resized image resulting in the size-corrected bi-tonal image; and outputting the size-corrected bi-tonal image.
-
-
12. A computer-implemented method for image capture and processing of financial documents, wherein one or more processors are programmed to perform steps comprising:
-
receiving a color image of a financial document from a mobile device, the color image being captured using an image capture device of the mobile device; detecting the financial document in the color image; geometrically correct the color image and generating a gray-scale image, geometrically correcting the color image and generating a geometrically corrected gray-scale image by; reducing the color depth of the color image, wherein reducing the color depth of the color image further comprises eliminating over-sharpening in the color image, eliminating small dark objects in the color image, and suppressing high local contrast objects in the color image; generating a gray-scale image corresponding to the color image; and applying perspective transformation to the interior of a quadrangle comprising found document corner positions included in the gray-scale image, binarizing the gray-scale image to produce a bi-tonal image, correcting the orientation of the bi-tonal image, and correcting the size of the bi-tonal image.
-
-
13. A computer-implemented method for image capture and processing of financial documents, wherein one or more processors are programmed to perform steps comprising:
-
receiving a color image of a financial document from a mobile device, the color image being captured using an image capture device of the mobile device; detecting the financial document in the color image; geometrically correcting the color image and generating a gray-scale image; binarizing the gray-scale image to produce a bi-tonal image, wherein binarizing the geometrically corrected image to produce a bi-tonal image further comprises; choosing a pixel on the gray-scale image; computing an average value and standard deviation for the chosen pixel; whether the standard deviation is too small and if it is too small then converting the chosen pixel to white and if it is not too small then converting the chosen pixel to black or white based on the intensity; and choosing another pixel an repeating the computing and determining steps until there are no more pixels to chose; correcting the orientation of the bi-tonal image; and correcting the size of the bi-tonal image.
-
-
14. A computer-implemented method for image capture and processing of financial documents, wherein one or more processors are programmed to perform steps comprising:
-
receiving a color image of a financial document from a mobile device, the color image being captured using an image capture device of the mobile device; detecting the financial document in the color image, wherein detecting the financial document in the color image further comprises; converting the color image into a smaller color image; performing color depth reduction on the smaller color image to generate a small gray-scale image; and determining document corner positions for the financial document in the small gray-scale image; geometrically correcting the color image and generate a gray-scale image, binarizing the gray-scale image to produce a bi-tonal image, correcting the orientation of the bi-tonal image, and correcting the size of the bi-tonal image. - View Dependent Claims (15, 16, 17, 18)
-
Specification