REAL-TIME THREE-DIMENSIONAL RECONSTRUCTION OF A SCENE FROM A SINGLE CAMERA
First Claim
1. A device comprising:
- a camera tracking circuit having an input that receives an image from a sequence of images and an output providing a pose for the image, and an output storing the image as a key frame based on the pose of the image and a pose of other stored key frames;
a depth map estimation circuit having an input the receives the image from the sequence of images and one or more key frames from among the stored key frames selected for the image and having an output providing a depth map for the image; and
a volumetric fusion circuit having an input receiving the depth map for the image and an output providing a three-dimensional model as a fusion of depth maps received for the sequence of images.
3 Assignments
0 Petitions
Accused Products
Abstract
A combination of three computational components may provide memory and computational efficiency while producing results with little latency, e.g., output can begin with the second frame of video being processed. Memory usage may be reduced by maintaining key frames of video and pose information for each frame of video. Additionally, only one global volumetric structure may be maintained for the frames of video being processed. To be computationally efficient, only depth information may be computed from each frame. Through fusion of multiple depth maps from different frames into a single volumetric structure, errors may average out over several frames, leading to a final output with high quality.
-
Citations
20 Claims
-
1. A device comprising:
-
a camera tracking circuit having an input that receives an image from a sequence of images and an output providing a pose for the image, and an output storing the image as a key frame based on the pose of the image and a pose of other stored key frames; a depth map estimation circuit having an input the receives the image from the sequence of images and one or more key frames from among the stored key frames selected for the image and having an output providing a depth map for the image; and a volumetric fusion circuit having an input receiving the depth map for the image and an output providing a three-dimensional model as a fusion of depth maps received for the sequence of images. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A process for generating a three-dimensional model from a sequence of images from a camera, the process comprising:
-
receiving an image from the sequence of images; selecting one or multiple key frames from the sequence of images corresponding to the received image; determining a depth map for the received image using the received image and the selected key frame(s); merging the depth map into a volumetric representation of a scene represented in the image; and repeating the receiving, selecting, determining and merging for successive images in the sequence of images. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product, comprising:
-
a computer storage device; computer program instructions stored in the computer storage device that when read from the storage device and processed by a processor of a computer instruct the computer to perform a process for generating a three-dimensional model from a sequence of images from a camera, the process comprising; receiving an image from the sequence of images; selecting a key frame from the sequence of images corresponding to the received image; determining a depth map for the received image using the received image and the selected key frame; merging the depth map into a volumetric representation of a scene represented in the image; and repeating the receiving, selecting, determining and merging for successive images in the sequence of images. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification