Method and apparatus for recovering depth using multi-plane stereo and spatial propagation
First Claim
1. A method of recovering depth information for pixels of a base image representing a view of a scene, the method comprising the steps of:
- detecting a plurality of pixels in a base image that represents a first view of a scene, wherein the first view is one of at least three relative views of a scene;
determining 3-D depth of the plurality of pixels in the base image by contemporaneously matching image features between three or more relative images of a scene by matching correspondence to a plurality of pixels in three or more relative images representing the at least three relative views of the scene, wherein the three or more images are captured relative to each other and are of the same scene, and wherein each of the three or more images includes at least an overlapping portion of the scene relative to each other, wherein the at least three relative views of the scene are situated in a non-linear arrangement relative to each other and are further oriented in non-parallel planes relative to each other; and
tracing pixels in a virtual piecewise continuous depth surface by spatial propagation starting from the detected pixels in the base image by using the matching and corresponding plurality of pixels in the three or more relative images to create the virtual piecewise continuous depth surface viewed from the base image, each successfully traced pixel being associated with a depth in the scene viewed from the base image.
1 Assignment
0 Petitions
Accused Products
Abstract
An image processing system recovers 3-D depth information for pixels of a base image representing a view of a scene. The system detects a plurality of pixels in a base image that represents a first view of a scene. The system the determines 3-D depth of the plurality of pixels in the base image by matching correspondence to a plurality of pixels in a plurality of images representing a plurality of views of the scene. The system then traces pixels in a virtual piecewise continuous depth surface by spatial propagation starting from the detected pixels in the base image by using the matching and corresponding plurality of pixels in the plurality of images to create the virtual piecewise continuous depth surface viewed from the base image, each successfully traced pixel being associated with a depth in the scene viewed from the base image.
-
Citations
29 Claims
-
1. A method of recovering depth information for pixels of a base image representing a view of a scene, the method comprising the steps of:
-
detecting a plurality of pixels in a base image that represents a first view of a scene, wherein the first view is one of at least three relative views of a scene; determining 3-D depth of the plurality of pixels in the base image by contemporaneously matching image features between three or more relative images of a scene by matching correspondence to a plurality of pixels in three or more relative images representing the at least three relative views of the scene, wherein the three or more images are captured relative to each other and are of the same scene, and wherein each of the three or more images includes at least an overlapping portion of the scene relative to each other, wherein the at least three relative views of the scene are situated in a non-linear arrangement relative to each other and are further oriented in non-parallel planes relative to each other; and tracing pixels in a virtual piecewise continuous depth surface by spatial propagation starting from the detected pixels in the base image by using the matching and corresponding plurality of pixels in the three or more relative images to create the virtual piecewise continuous depth surface viewed from the base image, each successfully traced pixel being associated with a depth in the scene viewed from the base image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An image processing system comprising:
-
a memory; a controller/processor electrically coupled to the memory; an image matching module, electrically coupled to the controller/processor and to the memory, for providing a plurality of seed pixels that represent 3-D depth of the plurality of pixels in the base image view of a scene by contemporaneously matching image features between three or more relative images of a scene by matching correspondence to a plurality of pixels in three or more relative images representing a plurality of relative views of the scene, wherein the three or more relative images are captured relative to each other and are of the same scene, and wherein each of the three or more images includes at least an overlapping portion of the scene relative to each other, wherein the base image view is one of at least three relative views of the scene, the at least three relative views of the scene being situated in a non-linear arrangement and are further oriented in non-parallel planes relative to each other; and a propagation module, electrically coupled to the controller/processor and to the memory, for tracing pixels in a virtual piecewise continuous depth surface by spatial propagation starting from the provided plurality of seed pixels in the base image by using the matching and corresponding plurality of pixels in the three or more relative images to create the virtual piecewise continuous depth surface viewed from the base image, each successfully traced pixel being associated with a depth in the scene viewed from the base image. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A non-transistory computer readable medium including computer instructions for a 3-D image reconstruction computer system, the computer instructions comprising instructions for:
-
detecting a plurality of pixels in a base image that represents a first view of a scene, wherein the first view is one of at least three relative views of a scene; determining 3-D depth of the plurality of pixels in the base image by contemporaneously matching image features between three or more relative images of a scene by matching correspondence to a plurality of pixels in three or more relative images representing the at least three relative views of the scene, wherein the three or more images are captured relative to each other and are of the same scene, and wherein each of the three or more images includes at least an overlapping portion of the scene relative to each other, wherein the at least three relative views of the scene are situated in a non-linear arrangement relative to each other and are further oriented in non-parallel planes relative to each other; and tracing pixels in a virtual piecewise continuous depth surface by spatial propagation starting from the detected pixels in the base image by using the matching and corresponding plurality of pixels in the three or more relative images to create the virtual piecewise continuous depth surface viewed from the base image, each successfully traced pixel being associated with a depth in the scene viewed from the base image. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A method of recovering depth information for pixels of a base image representing a view of a scene, the method comprising the steps of:
-
capturing a base image representing a view of a scene; tracing at least one parameter surface associated with the base image, each of the at least one parameter surface traced starting from at least one predetermined seed pixel point associated with the base image; calculating a derivative of function E(g) with respect to parameter g by using finite difference to minimize the following equation where is a normalized cross-correlation between Ii(ū
,v ) and I0(uj, vj);
Ii(ū
,v ) is a first window of size ω
×
ω
centered at pixel (ū
,v ) in Ii,
I0(uj, vj) is a second window of size ω
×
ω
centered at pixel (uj, vj) in I0,
Ii is a reference image,
I0 is a base image,
(ū
,v ) is a pixel point closest to Pi(C0+gL(uj, vj));
C0+gL(uj, vj) is a 3-D point that projects to (uj, vj) in the base view, and to Pi(C0+gL(uj, vj)) in the ith reference view;
C0 is the base image camera'"'"'s center of projection;
L(uj, vj) is the unit vector from C0 to the point on the image plane that corresponds the pixel (uj, vj);
g is a depth parameter; andstoring the calculated derivative of the function E(g) in a memory; and generating a three-dimensional image of at least a portion of the scene using at least the calculated derivative of the function E(g) that has been stored in memory.
-
Specification