Method of pose estimation and model refinement for video representation of a three dimensional scene
First Claim
1. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
- (a) generating an initial estimate of the pose;
(b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose, the relevant features representing discontinuities in at least one of a surface normal or material properties in the scene;
(c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose;
(d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors; and
(e) updating the estimate of the pose to reduce the plurality of matching errors.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention is embodied in a video flashlight method. This method creates virtual images of a scene using a dynamically updated three-dimensional model of the scene and at least one video sequence of images. An estimate of the camera pose is generated by comparing a present image to the three-dimensional model. Next, relevant features of the model are selected based on the estimated pose. The relevant features are then virtually projected onto the estimated pose and matched to features of the image. Matching errors are measured between the relevant features of the virtual projection and the features of the image. The estimated pose is then updated to reduce these matching errors. The model is also refined with updated information from the image. Meanwhile, a viewpoint for a virtual image is selected. The virtual image is then created by projecting the dynamically updated three-dimensional model onto the selected virtual viewpoint.
150 Citations
23 Claims
-
1. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
-
(a) generating an initial estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose, the relevant features representing discontinuities in at least one of a surface normal or material properties in the scene; (c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose; (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors; and (e) updating the estimate of the pose to reduce the plurality of matching errors. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
-
(a) generating an initial estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose including the steps of (b1) creating a set of features of the three dimensional model including a plurality of edges of at least one object represented in the three dimensional model, each edge having a dihedral angle of greater than a predetermined angle; and (b2) selecting the set of relevant features from the set of features of the three dimensional model. (c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose; (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors; and (e) updating the estimate of the pose to reduce the plurality of matching errors.
-
-
8. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
-
(a) generating an initial estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose; (c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose including the step of removing a model feature from the set of relevant features if the model feature is less than a predetermined distance from a remaining relevant feature in the virtual projection. (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors; and (e) updating the estimate of the pose to reduce the plurality of matching errors.
-
-
9. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
-
(a) generating an initial estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose; (c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose; (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors; (e) updating the estimate of the pose to reduce the plurality of matching errors; and (f) perturbing the estimate of the pose and repeating to steps (c), (d), and (e) using the perturbed estimate of the pose until the plurality of matching errors are less than predetermined matching criteria.
-
-
10. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
-
(a) generating an initial estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose, the relevant features representing discontinuities in at least one of a surface normal or material properties in the scene; (c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose; (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors including the steps of; (d1) computing an oriented energy image of the image received from the camera; and (d2) integrating the oriented energy image along the virtual projection of the set of relevant features and measuring the plurality of matching errors; and (e) updating the estimate of the pose to reduce the plurality of matching errors.
-
-
11. A method for accurately estimating a pose of a camera within a scene using a three dimension model of the scene, comprising the steps of:
-
(a) generating an initial estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the initial estimate of the pose; (c) creating a virtual projection of the set of relevant features responsive to the initial estimate of the pose; (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features and measuring a plurality of matching errors including the steps of; (d1) computing an oriented energy image of the image received from the camera; (d2) generating a set of scaled oriented energy images; (d3) selecting a scaled oriented energy image from the set of scaled oriented energy images; and (d4) integrating the selected scaled oriented energy image along the virtual projection of the set of relevant features and measuring the plurality of matching errors. (e) updating the estimate of the pose to reduce the plurality of matching errors.
-
-
12. A method for refining a three dimensional model of a scene using an image of the scene taken by a camera having an unknown pose, comprising the steps of:
-
(a) comparing the image to a set of relevant features of the three dimension model of the scene to generate an estimate of the pose, the relevant features representing discontinuities in at least one of the surface normal or the material properties in the scene; and (b) updating the three dimensional model of the scene based on data from the image and the estimate of the pose. - View Dependent Claims (13)
-
-
14. A method for refining a three dimension model of a scene containing an object using a plurality of images of the scene, each image including the object, comprising the steps of:
-
(a) comparing a first image of the plurality of images to the three dimension model of the scene to generate an estimate of a first viewpoint corresponding to the first image; (b) comparing a second image of the plurality of images to the three dimension model of the scene to generate an estimate of a second viewpoint corresponding to the second image; (c) selecting a first set of relevant features of the three dimensional model based on the first viewpoint; (d) matching a plurality of first features of the first image to the first set of relevant features and measuring a plurality of first matching errors; (e) selecting a second set of relevant features of the three dimensional model based on the second viewpoint; (f) matching a plurality of second features of the second image to the second set of relevant features and measuring a plurality of second matching errors; and (g) updating a position estimate of the object within the three dimensional model of the scene based on the plurality of first matching errors and the plurality of second matching errors. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A method for refining a three dimension model of a scene containing an object using a plurality of images of the scene, each image including the object, comprising the steps of:
-
(a) selecting a subset of images from the plurality of images of the scene, the subset of frames containing at least two of the images; (b) determining a plurality of approximate relative viewpoints of the subset of images; (c) comparing each image in the subset of images to the three dimensional model to generate a subset of estimated viewpoints corresponding to the subset of images, the subset of estimated viewpoints constrained by the plurality of approximate relative viewpoints; (d) selecting a set of relevant features of the three dimensional model corresponding to each estimated viewpoint; (e) matching a plurality of features of the each image in the subset of images to the corresponding set of relevant features and measuring a plurality of matching errors; and (f) updating a position estimate of the object within the three dimensional model of the scene based on the plurality of matching errors. - View Dependent Claims (20, 21)
-
-
22. A computer readable medium adapted to instruct a general purpose computer to update a three dimensional model of a scene using the three dimensional model of the scene, an image received from a camera having an unknown pose, the method comprising the steps of:
-
(a) generating an estimate of the pose; (b) selecting a set of relevant features of the three dimensional model based on the estimate of the pose, the relevant features representing discontinuities in at least one of a surface normal or material properties in the scene; (c) creating a virtual projection of the set of relevant features responsive to the estimate of the pose; (d) matching a plurality of features of an image received from the camera to the virtual projection of the set of relevant features, and measuring a plurality of matching errors; (e) updating the estimate of the pose to reduce the plurality of matching errors; and (f) updating the three dimensional model of the scene based on data from the image and the estimate of the pose.
-
-
23. An automatic three-dimensional model updating apparatus for accurately estimating a point of view of an image of a scene, relative to a three-dimensional model of the scene, and updating the three-dimensional model comprising:
-
(a) estimating means for providing an estimate of the point of view of the image; (b) relevant feature selecting means for selecting a set of relevant features of the three dimensional model based on the estimate of the point of view, the relevant features representing discontinuities in at least one of a surface normal or material properties in the scene; (c) virtual projection means for creating a virtual projection of the set of relevant features responsive to the estimate of the point of view; (d) matching means for matching a plurality of features of the image to the virtual projection of the set of relevant features; (e) measurement means for measuring a plurality of matching errors; (f) point of view refinement means for updating the estimate of the point of view to reduce the plurality of marching errors; and (g) model refinement means, responsive to the estimated point of view and to the image, for updating the three-dimensional model.
-
Specification