Three-dimensional model construction using unstructured pattern
First Claim
1. A method, comprising:
- projecting an unstructured pattern onto a target object using a projector, wherein the target object is a pair of pages of a multi-page document;
capturing a pair of stereoscopic images of the unstructured pattern projected onto the target object, wherein one or more stereoscopic cameras capture the pair of stereoscopic images;
generating respective lines in each of the stereoscopic images;
vertically rectifying the pair of stereoscopic images relative to each other to align the respective lines;
performing signal processing on the pair of stereoscopic images to horizontally cross-correlate portions of the pair of stereoscopic images in order to identify corresponding portions in the pair of stereoscopic images, wherein the signal processing is performed utilizing the image data from the stereoscopic images without specification of the pattern;
capturing a pair of readable images each conveying the content of one of the pair of pages, wherein one or more cameras that are different from the one or more stereoscopic cameras capture the pair of readable images; and
compensating for distortion in the pair of readable images by transforming the pair of readable images with an output of the signal processing,wherein performing signal processing includes dividing a first of the two stereoscopic images into regions, a plurality of regions forming each line, each region having a width that is greater than its height and wherein the performing signal processing is performed for each region of the first stereoscopic image.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods using an unstructured pattern for three-dimensional model construction, such as may be used for processing images of documents are disclosed. The method generally includes projecting an unstructured infrared pattern onto a target object, capturing a pair of stereoscopic images of the pattern projected onto the target object, such as with infrared cameras, performing signal processing on the pair of stereoscopic images to cross-correlate portions of the stereoscopic images in order to identify corresponding portions in the stereoscopic images, and constructing a three-dimensional model of the target object. The three-dimensional model can be utilized to rectify, e.g., de-warp, an image of the target object, e.g., a page or both facing pages of a bound document.
-
Citations
25 Claims
-
1. A method, comprising:
-
projecting an unstructured pattern onto a target object using a projector, wherein the target object is a pair of pages of a multi-page document; capturing a pair of stereoscopic images of the unstructured pattern projected onto the target object, wherein one or more stereoscopic cameras capture the pair of stereoscopic images; generating respective lines in each of the stereoscopic images; vertically rectifying the pair of stereoscopic images relative to each other to align the respective lines; performing signal processing on the pair of stereoscopic images to horizontally cross-correlate portions of the pair of stereoscopic images in order to identify corresponding portions in the pair of stereoscopic images, wherein the signal processing is performed utilizing the image data from the stereoscopic images without specification of the pattern; capturing a pair of readable images each conveying the content of one of the pair of pages, wherein one or more cameras that are different from the one or more stereoscopic cameras capture the pair of readable images; and compensating for distortion in the pair of readable images by transforming the pair of readable images with an output of the signal processing, wherein performing signal processing includes dividing a first of the two stereoscopic images into regions, a plurality of regions forming each line, each region having a width that is greater than its height and wherein the performing signal processing is performed for each region of the first stereoscopic image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer readable storage medium embedded with a computer program product, the computer program product including instructions that, when executed by a processor, cause the processor to perform actions comprising:
-
receiving a pair of stereoscopic images of an unstructured pattern projected onto a target object, wherein the pair of stereoscopic images are captured from a first pair of perspectives, respectively, and wherein the target object is a curved surface of a document; generating respective lines in each of the stereoscopic images; vertically rectifying the pair of stereoscopic images relative to each other to align the respective lines; performing signal processing on the pair of stereoscopic images to horizontally cross-correlate portions of the pair of stereoscopic images in order to identify corresponding portions in the pair of stereoscopic images, wherein the signal processing is performed utilizing the image data from the stereoscopic images without specification of the unstructured pattern; receiving an image of the curved surface of the document from a third perspective that is different from either of the first pair of perspectives, wherein the image of the curved surface of the document conveys the content of the document; and transforming the image of the curved surface of the document into an image of the document at it would appear were it not curved based on a result of the signal processing, wherein performing signal processing includes dividing a first of the two stereoscopic images into regions, a plurality of regions forming each line, each region having a width that is greater than its height and wherein the performing signal processing is performed for each region of the first stereoscopic image. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A system, comprising:
-
an infrared projector configured to project an infrared pattern onto a target object, wherein the target object is a multi-page document; a support configured to position the target object in front of the infrared projector; a pair of stereoscopic infrared cameras configured to capture stereoscopic images of the infrared pattern projected onto the target object; a signal processor configured to vertically rectify a pair of stereoscopic images relative to each other to align respective lines of the stereoscopic images, and further configured to perform signal processing on the pair of stereoscopic images to horizontally cross-correlate portions of the pair of stereoscopic images in order to identify corresponding portions in the pair of stereoscopic images and to construct a three-dimensional model of the target object; and another camera configured to capture an image of the semantic content of a page of the target object, wherein the signal processor is further configured to transform the image of the semantic content of the page based on the three-dimensional models wherein the signal processor is further configured to divide a first of the two stereoscopic images into regions, a plurality of regions forming each line, each region having a width that is greater than its height, the signal processor is further configured to perform the signal processing for each region of the first stereoscopic image. - View Dependent Claims (21, 22, 23, 24, 25)
-
Specification