Systems and methods for mobile image capture and processing
First Claim
1. A method, comprising:
- capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document;
defining a plurality of candidate edge points within the image data;
removing one or more outlier candidate edge points from the plurality of candidate edge points;
defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and
defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points,wherein each side of the tetragon corresponds to a different side of the document, andwherein the tetragon bounds the digital representation of the document.
7 Assignments
0 Petitions
Accused Products
Abstract
In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner, or multifunction peripheral. Particularly advantageous features include robustly detecting edges of one or more documents depicted in the digital image data, and defining/locating document pages at least partially on this basis. The statistical approaches employed enable robust yet computationally efficient techniques to accomplish page detection, and associated functions, using hardware typically included in mobile devices and within practical (especially temporal) limits imposed by device manufacturers, users, associated and/or downstream computational and/or business processes.
524 Citations
19 Claims
-
1. A method, comprising:
-
capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; defining a plurality of candidate edge points within the image data; removing one or more outlier candidate edge points from the plurality of candidate edge points; defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document, and wherein the tetragon bounds the digital representation of the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system, comprising:
-
a processor configured to execute logic; and logic configured to cause the processor executing the logic to; capture one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; define a plurality of candidate edge points within the image data, remove one or more outlier candidate edge points from the plurality of candidate edge points; define a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and define four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document; and output the digital representation of the document and the tetragon to a display of a mobile device, wherein the tetragon bounds the digital representation of the document.
-
-
19. A computer program product comprising a computer readable storage medium having computer readable program code stored thereon, the computer readable program code comprising:
-
computer readable program code configured to cause a processor to; capture one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; define a plurality of candidate edge points within the image data, remove one or more outlier candidate edge points from the plurality of candidate edge points; define a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and define four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document; and output the digital representation of the document and the tetragon to a display of a mobile device, wherein the tetragon bounds the digital representation of the document.
-
Specification