Systems and methods for generating composite images of long documents using mobile video data
First Claim
1. A computer program product comprising a non-transitory computer readable medium having stored thereon instructions executable by a processor of a mobile device, the instructions being configured to cause the processor, upon execution thereof, to generate a composite image of a long document with sufficient resolution for downstream processing by:
- detecting a long document depicted in image data;
tracking a position of the detected long document within the image data;
selecting a plurality of images, wherein the selection is based at least in part on the tracked position of the detected long document; and
generating a composite image of the long document based on at least two of the selected plurality of images, wherein the composite image of the long document is characterized by a resolution greater than a resolution of any of the selected plurality of images, wherein the resolution of the composite image is at least about 200 dots per inch (DPI) or at least about 200 pixels per inch (PPI).
5 Assignments
0 Petitions
Accused Products
Abstract
Techniques for capturing long document images and generating composite images therefrom include: detecting a document depicted in image data; tracking a position of the detected document within the image data; selecting a plurality of images, wherein the selection is based at least in part on the tracked position of the detected document; and generating a composite image based on at least one of the selected plurality of images. The tracking and selection are optionally but preferably based in whole or in part on motion vectors estimated at least partially based on analyzing image data such as test and reference frames within the captured video data/images. Corresponding systems and computer program products are also disclosed.
689 Citations
20 Claims
-
1. A computer program product comprising a non-transitory computer readable medium having stored thereon instructions executable by a processor of a mobile device, the instructions being configured to cause the processor, upon execution thereof, to generate a composite image of a long document with sufficient resolution for downstream processing by:
-
detecting a long document depicted in image data; tracking a position of the detected long document within the image data; selecting a plurality of images, wherein the selection is based at least in part on the tracked position of the detected long document; and generating a composite image of the long document based on at least two of the selected plurality of images, wherein the composite image of the long document is characterized by a resolution greater than a resolution of any of the selected plurality of images, wherein the resolution of the composite image is at least about 200 dots per inch (DPI) or at least about 200 pixels per inch (PPI). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A mobile device having logic embodied therewith, the logic being configured to cause the mobile device, upon execution thereof, to generate an image of a long document sufficient for downstream processing by:
-
detecting, using a processor of the mobile device, a long document depicted in image data; tracking, using the processor of the mobile device, a position of the detected long document within the image data; selecting, using the processor of the mobile device, a plurality of images, wherein the selection is based at least in part on the tracked position of the detected long document; and generating, using the processor of the mobile device, a composite image of the long document based on at least two of the selected plurality of images wherein the composite image of the long document is characterized by a resolution greater than a resolution of any of the selected plurality of images, wherein the resolution of the composite image is at least about 200 dots per inch (DPI) or at least about 200 pixels per inch (PPI).
-
-
20. A computer-implemented method for generating a composite image of a long document suitable for downstream processing, the method comprising:
-
tracking, using a processor of a mobile device, a long document within a plurality of frames of image data; selecting, using the processor, a subset of the plurality of frames of the image data based on the tracking; generating, using the processor, alignment hypotheses between at least some of the selected subset of frames of image data, wherein the alignment hypotheses are generated based on matching sampled features of one or more reference frames of the image data with sampled features of one or more test frames of the image data; storing at least some of the selected frames of the image data to a memory of the mobile device; and generating, using the processor, a composite image of the long document by stitching together at least two of the selected subset of frames; wherein the at least two of the selected subset of frames are characterized by an overlap greater than a predefined overlap threshold; and wherein the composite image is characterized by a resolution of at least about 200 dots per inch (DPI) or a resolution of at least about 200 pixels per inch (PPI).
-
Specification