Systems and methods for mobile image capture and processing
First Claim
1. A computer program product, comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to perform operations comprising:
- capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document;
defining a plurality of candidate edge points within the image data;
removing one or more outlier candidate edge points from the plurality of candidate edge points;
defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and
defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein defining the plurality of candidate edge points comprises, for each of a plurality of portions of the image data;
calculating one or more statistics corresponding to the portion;
estimating one or more distributions of statistics corresponding to the portion;
determining whether a statistically significant difference exists between one or more of the statistics calculated for the portion and the distribution of statistics estimated for the portion; and
designating a point corresponding to the statistically significant difference as a candidate edge point in response to determining the statistically significant difference exists;
wherein each side of the tetragon corresponds to a different side of the document;
wherein an area of the tetragon comprises at least a threshold percentage of a total area of the digital image;
wherein the tetragon bounds the digital representation of the document;
wherein each portion is characterized by a single center pixel; and
wherein the center pixel is designated as a candidate edge point upon determining the statistically significant difference exists.
6 Assignments
0 Petitions
Accused Products
Abstract
In several embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. The techniques include detecting medical documents and/or documents relevant to an insurance claim by defining candidate edge points based on the captured image data and defining four sides of a tetragon based on at least some of the candidate edge points. In the case of an insurance claim process, the techniques also include determining whether the document is relevant to an insurance claim; and in response to determining the document is relevant to the insurance claim, submitting the image data, information extracted from the image data, or both to a remote server for claims processing. The image capture and processing techniques further facilitate processing of medical documents and/or insurance claims with a plurality of additional features that may be used individually or in combination in various embodiments.
720 Citations
16 Claims
-
1. A computer program product, comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to perform operations comprising:
-
capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; defining a plurality of candidate edge points within the image data; removing one or more outlier candidate edge points from the plurality of candidate edge points; defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein defining the plurality of candidate edge points comprises, for each of a plurality of portions of the image data; calculating one or more statistics corresponding to the portion; estimating one or more distributions of statistics corresponding to the portion; determining whether a statistically significant difference exists between one or more of the statistics calculated for the portion and the distribution of statistics estimated for the portion; and designating a point corresponding to the statistically significant difference as a candidate edge point in response to determining the statistically significant difference exists; wherein each side of the tetragon corresponds to a different side of the document; wherein an area of the tetragon comprises at least a threshold percentage of a total area of the digital image; wherein the tetragon bounds the digital representation of the document; wherein each portion is characterized by a single center pixel; and wherein the center pixel is designated as a candidate edge point upon determining the statistically significant difference exists. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system, comprising a processor and logic executable by the processor to cause the processor to perform operations comprising:
-
capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; defining a plurality of candidate edge points within the image data; removing one or more outlier candidate edge points from the plurality of candidate edge points; defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein defining the plurality of candidate edge points comprises, for each of a plurality of portions of the image data; calculating one or more statistics corresponding to the portion; estimating one or more distributions of statistics corresponding to the portion; determining whether a statistically significant difference exists between one or more of the statistics calculated for the portion and the distribution of statistics estimated for the portion; and designating a point corresponding to the statistically significant difference as a candidate edge point in response to determining the statistically significant difference exists; determining whether the tetragon satisfies one or more quality control metrics; and rejecting the tetragon upon determining the tetragon does not satisfy one or more of the quality control metrics; wherein the quality control metrics comprise one or more of; a Least Mean Squares (LMS) support metric, a minimum tetragon area metric; and a tetragon diagonal intersection location metric; wherein each side of the tetragon corresponds to a different side of the document; wherein the tetragon bounds the digital representation of the document; wherein each portion is characterized by a single center pixel; and wherein the center pixel is designated as a candidate edge point upon determining the statistically significant difference exists.
-
-
16. A computer program product, comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to perform operations comprising:
-
capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; defining a plurality of candidate edge points within the image data; removing one or more outlier candidate edge points from the plurality of candidate edge points; defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document, and wherein the tetragon bounds the digital representation of the document, wherein defining the plurality of candidate edge points comprises, for each of a plurality of portions of the image data; calculating one or more statistics corresponding to the portion; estimating one or more distributions of statistics corresponding to the portion; determining whether a statistically significant difference exists between one or more of the statistics calculated for the portion and the distribution of statistics estimated for the portion; and designating a point corresponding to the statistically significant difference as a candidate edge point in response to determining the statistically significant difference exists; wherein each portion is characterized by a single center pixel; and wherein the center pixel is designated as a candidate edge point upon determining the statistically significant difference exists.
-
Specification