Combining narrow-baseline and wide-baseline stereo for three-dimensional modeling
First Claim
1. A method for determining a three-dimensional model from a plurality of images taken of a geographic region by one or more cameras from different perspectives, comprising:
- (a) determining, using a first stereo reconstruction technique;
(i) a plurality of three-dimensional candidate surface points from the plurality of images, and (ii) which of the images in the plurality of images view each of the plurality of candidate surface points;
(b) identifying an empty space between each of the plurality of candidate surface points and each camera model for the respective images determined in (a) to view the candidate surface point;
(c) for each of a plurality of pairs of images from the plurality of images, determining, using a second stereo reconstruction technique, a surface estimate for the pair of images, the determining a surface estimate (c) for the pair of images comprising;
(i) determining a likelihood of each point in a volume of interest being a surface point based on an empty space identified in (b) such that the likelihood of the point being a surface point is less when the point is inside the empty space than when the point is outside the empty space; and
(ii) determining a surface estimate for the pair of images based on the likelihoods determined in (c)(i),wherein a distance between positions of cameras used to generate the pair of images for the second stereo reconstruction technique is less than a distance between positions of cameras used to generate a pair of images used in the first stereo reconstruction technique; and
(d) merging the surface estimates determined in (c) to identify a final surface.
2 Assignments
0 Petitions
Accused Products
Abstract
In an embodiment, a method determines a three-dimensional model from a plurality of images taken of a geographic region by one or more cameras from different perspectives. The method includes determining, using a first stereo reconstruction technique: (i) a plurality of three-dimensional candidate surface points from the plurality of images, and (ii) which of the images in the plurality of images view each of the plurality of candidate surface points. The method also includes identifying an empty space between each of the plurality of candidate surface points and each camera model for the respective images determined to view the candidate surface point. The method further includes for each of a plurality of pairs of images from the plurality of images, determining, using a second stereo reconstruction technique, a surface estimate for the pair of images. The method also includes merging the surface estimates to identify a final surface.
31 Citations
21 Claims
-
1. A method for determining a three-dimensional model from a plurality of images taken of a geographic region by one or more cameras from different perspectives, comprising:
-
(a) determining, using a first stereo reconstruction technique;
(i) a plurality of three-dimensional candidate surface points from the plurality of images, and (ii) which of the images in the plurality of images view each of the plurality of candidate surface points;(b) identifying an empty space between each of the plurality of candidate surface points and each camera model for the respective images determined in (a) to view the candidate surface point; (c) for each of a plurality of pairs of images from the plurality of images, determining, using a second stereo reconstruction technique, a surface estimate for the pair of images, the determining a surface estimate (c) for the pair of images comprising; (i) determining a likelihood of each point in a volume of interest being a surface point based on an empty space identified in (b) such that the likelihood of the point being a surface point is less when the point is inside the empty space than when the point is outside the empty space; and (ii) determining a surface estimate for the pair of images based on the likelihoods determined in (c)(i), wherein a distance between positions of cameras used to generate the pair of images for the second stereo reconstruction technique is less than a distance between positions of cameras used to generate a pair of images used in the first stereo reconstruction technique; and (d) merging the surface estimates determined in (c) to identify a final surface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for determining a three-dimensional model from a plurality of images taken of a geographic region by one or more cameras from different perspectives, comprising:
-
at least one computing device; a wide-baseline stereo engine, implemented on the at least one computing device, configured to determine;
(i) a plurality of three-dimensional candidate surface points from the plurality of images, and (ii) which of the images in the plurality of images view each of the plurality of candidate surface points;an empty space identifier, implemented on the at least one computing device, configured to identify an empty space between each of the plurality of candidate surface points and each camera model for the respective images determined by the wide-baseline stereo engine to view the candidate surface point; a volume of interest identifier, implemented on the at least one computing device, configured to, for each of a plurality of pairs of images from the plurality of images, identify a volume of interest in the pair of images; a voxel grid, implemented on the at least one computing device, configured to store photo-consistency scores for each point in the volume of interest; a narrow-baseline stereo engine, implemented on the at least one computing device, configured to, for each of a plurality of pairs of images from the plurality of images;
(i) determine a plane in the volume of interest, (ii) project images determined to view the point onto the plane, (iii) compute the photo-consistency score for each point in the plane based on the projection, wherein a photo-consistency score of a first point inside the empty space is less than a photo-consistency score of a second point outside the empty space, and (iv) determine a surface estimate based on the photo-consistency scores in the voxel grid, wherein the surface estimate includes photo-consistency scores that satisfy a threshold; anda merging engine, implemented on the at least one computing device, configured to merge the surface estimates determined by the narrow-baseline stereo engine to determine a final surface. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer readable storage medium encoded with a computer program, the program comprising instructions that when executed by one or more processors cause the one or more processors to perform operations to:
-
(a) determine, using a first stereo reconstruction technique;
(i) a plurality of three-dimensional candidate surface points from the plurality of images, and (ii) which of the images in the plurality of images view each of the plurality of candidate surface points;(b) identify an empty space between each of the plurality of candidate surface points and each camera model for the respective images determined in (a) to view the candidate surface point; (c) for each of a plurality of pairs of images from the plurality of images, determine, using a second stereo reconstruction technique, a surface estimate for the pair of images, wherein the program comprising instructions that when executed by one or more processors cause the one or more processors to perform operations to determine, using a second stereo reconstruction technique, a surface estimate for the pair of images further cause the one or more processors to perform operations to; (i) determine a likelihood of each point in a volume of interest being a surface point based on an empty space identified in (b) such that the likelihood of the point being a surface point is less when the point is inside the empty space than when the point is outside the empty space; and (ii) determine a surface estimate for the pair of images based on the likelihoods determined in (c)(i), wherein a distance between positions of cameras used to generate the pair of images for the second stereo reconstruction technique is less than a distance between positions of cameras used to generate a pair of images used in the first stereo reconstruction technique; and (d) merge the surface estimates determined in (c) to identify a final surface. - View Dependent Claims (20)
-
-
21. A method for determining a three-dimensional model from a plurality of images taken of a geographic region by one or more cameras from different perspectives, comprising:
-
(a) determining, using wide-baseline stereo, a plurality of three-dimensional candidate surface points from the plurality of images; (b) identifying, using wide-baseline stereo, an empty space between each of the plurality of candidate surface points and each camera model for the respective images that view the candidate surface point; (c) for each of a plurality of pairs of images from the plurality of images, determining, using narrow-baseline stereo, a surface estimate for the pair of images based on the plurality of candidate surface points and the empty space; and (d) merging the surface estimates determined in (c) to identify a final surface.
-
Specification