Measurement of depth image considering time delay
First Claim
1. A depth image measurement apparatus for acquiring depth information of a scene, comprising:
- image input means for inputting an image of the scene at a first viewpoint;
depth image generation means for generating a first depth image from the scene image inputted at the first viewpoint by said image input means;
position/posture estimation means for estimating, based on information relating to displacement of the first viewpoint, a position and posture information at a second viewpoint viewed from a position and posture of the first viewpoint; and
warping means for warping the first depth image generated by said depth image generation means to a second depth image at the second viewpoint on the basis of the position and posture information at the second viewpoint estimated by said position/posture estimation means.
2 Assignments
0 Petitions
Accused Products
Abstract
An augmented reality presentation system that generates and presents a virtual image free from any latency from a real space. This system has a position/posture sensor for time-sequentially inputting viewpoint position/posture information, stereo cameras for inputting a continuous time sequence of a plurality of images, and an image processing apparatus. The image processing apparatus detects a continuous time sequence of depth images ID from the continuous time sequence of input stereo images, estimates the viewpoint position/posture of the observer at a future time at which a three-dimensional image will be presented to the observer, on the basis of changes in previous viewpoint position/posture input from the position/posture sensor, continuously warps the continuously obtained depth images to those at the estimated future viewpoint position/posture, and presents three-dimensional grayscale (or color) images generated according to the warped depth images to the observer.
200 Citations
49 Claims
-
1. A depth image measurement apparatus for acquiring depth information of a scene, comprising:
-
image input means for inputting an image of the scene at a first viewpoint;
depth image generation means for generating a first depth image from the scene image inputted at the first viewpoint by said image input means;
position/posture estimation means for estimating, based on information relating to displacement of the first viewpoint, a position and posture information at a second viewpoint viewed from a position and posture of the first viewpoint; and
warping means for warping the first depth image generated by said depth image generation means to a second depth image at the second viewpoint on the basis of the position and posture information at the second viewpoint estimated by said position/posture estimation means. - View Dependent Claims (13, 47)
said depth image generation means generates a first depth image in correspondence with each respective first viewpoint, said position/posture estimation means estimates viewpoint position/posture information of the second viewpoint viewed from each first viewpoint, and said warping means warps each of the first depth image on the basis of each viewpoint position/posture information, and outputs the second depth image. -
47. An apparatus according to claim 1, wherein said warping means generates at least one of the second sequence of depth images from a respective one of the first sequence of depth images by approximation method.
-
-
2. A depth image measurement apparatus for continuously acquiring depth information of a scene, comprising:
-
image input means for inputting a sequence of images of the scene at a first sequence of viewpoints;
depth image generation means for generating a first sequence of depth images from the scene images sequentially input at said first sequence of viewpoints by said image input means;
position/posture estimation means for estimating, based on information relating to displacement of the first viewpoint, a sequence of viewpoint position/posture information of a second sequence of viewpoints viewed from the first sequence of viewpoints; and
warping means for sequentially warping the first sequence of depth images generated by said depth image generation means to a second sequence of depth images at the second sequence of viewpoints on the basis of the viewpoint position/posture information of the second sequence of viewpoints estimated by said position/posture estimation means. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 45, 46, 48)
recovery means for recovering a time-sequential locus of a viewpoint on the basis of the images sequentially input by said image input means, and said position/posture estimation means estimates the sequence of viewpoint position/posture information on the basis of the locus.
-
-
20. The apparatus according to claim 19, wherein said recovery means recovers the locus of the viewpoint on the basis of time-sequential information of images input by said image input means.
-
21. The apparatus according to claim 19, wherein said recovery means recovers the locus of the viewpoint on the basis of information of each image input by said image input means.
-
22. The apparatus according to claim 18, wherein said position/posture estimation means estimates the image coordinate positions of the feature points at the second viewpoint position on the basis of images sequentially input by said image input means, and
estimates the sequence of viewpoint position/posture information on the basis of the estimated image coordinate positions. -
23. The apparatus according to claim 17, wherein said position/posture estimation means estimates a viewpoint position of said image input means on the basis of the image, and
outputs a viewpoint position/posture information obtained by adding an offset to the viewpoint position and posture of said image input means. -
24. The apparatus according to claim 2, wherein said position/posture estimation means receives a position/posture information from a three-dimensional position/posture sensor attached to said image input means, and
estimates the viewpoint position/posture information on the basis of a signal from said three-dimensional position/posture sensor. -
25. The apparatus according to claim 24, wherein said position/posture estimation means recovers a time-sequential locus of a viewpoint on the basis of the signals sequentially inputted from said three-dimensional position/posture sensor, and
estimates viewpoint on the basis of the locus. -
26. The apparatus according to claim 24, wherein said position/posture estimation means estimates a position and posture of a viewpoint of said three-dimensional position/posture sensor itself on the basis of an output from said three-dimensional position/posture sensor, and
outputs a viewpoint information obtained by adding an offset to the position and posture of the viewpoint of said three-dimensional position/posture sensor. -
27. The apparatus according to claim 19, wherein said position/posture estimation means specifies a function having a shape similar to a locus of the viewpoint, and
outputs as the viewpoint position/posture information a value of the function at the second time. -
28. The apparatus according to claim 17, wherein said position/posture estimation means outputs as the viewpoint position/posture information a coordinate transformation matrix from each first viewpoint to a respective second viewpoint.
-
29. The apparatus according to claim 2, wherein said warping means includes calculation means for calculating a coordinate value and depth value of a point on the second depth image, which corresponds to a point on the first depth image, by three-dimensional coordinate transformation on the basis of the viewpoint position/posture information.
-
30. The apparatus according to claim 29, wherein said warping means further includes compensation means for compensating a depth value of each point on the second depth image, which cannot be calculated by said calculation means.
-
45. An image processing apparatus using an image measurement apparatus of claim 2.
-
46. An image processing method using an image measurement apparatus of claim 2.
-
48. An apparatus according to claim 2, wherein said warping means generates at least one of the second sequence of depth images from a respective one of the first sequence of depth images by approximation method.
-
31. An augmented reality presentation system comprising:
-
a depth image measurement apparatus for continuously acquiring depth information of a scene, comprising;
image input means for inputting a sequence of images of the scene at a first sequence of viewpoints;
depth image generation means for generating a first sequence of depth images from the scene images sequentially input at said first sequence of viewpoints by said image input means;
position/posture estimation means for estimating, based on information relating to displacement of the first sequence of viewpoints, a sequence of viewpoint position/posture information of a second sequence of viewpoints viewed from the first sequence of viewpoints; and
warping means for sequentially warping the first sequence of depth images generated by said depth image generation means to a second sequence of depth images at the second sequence of viewpoints on the basis of the viewpoint position/posture information of the second sequence of viewpoints estimated by said position/posture estimation means; and
a head mount display comprising a plurality of video cameras for inputting images in front of an observer, and a display for displaying a three-dimensional grayscale image or color image, wherein a three-dimensional grayscale or color image generated according to the second depth image is presented to the observer. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40, 49)
an optical see-through HMD;
rendering means for rendering a computer graphics figure; and
masking means for masking the computer graphics figure on the basis of the second depth image, and wherein the computer graphics figure masked by said masking means is output to said display.
-
-
33. The system according to claim 31, further comprising:
-
a video see-through HMD;
rendering means for rendering a computer graphics figure; and
merging means for merging the computer graphics figure to an image input by said image input means on the basis of the second depth image, and wherein the computer graphics figure merged by said merging means is output to said display.
-
-
34. The system according to claim 32, wherein said rendering means generates a grayscale image or color image and a depth image of the computer graphics figure, and
said masking means masks the computer graphics figure on the basis of depth ordering of the depth image of the computer graphics figure and the second depth image in units of pixels. -
35. The system according to claim 33, wherein said rendering means generates a grayscale image or color image and a depth image of the computer graphics figure, and
said merging means executes occlusion processing between the computer graphics figure and an image input by said image input means on the basis of depth ordering of the depth image of the computer graphics figure and the second depth image in units of pixels. -
36. The system according to claim 32, wherein the second sequence of viewpoints are a viewpoint of an observer, and
an offset is an offset from a viewpoint of one of said video cameras to a viewpoint of the observer. -
37. The system according to claim 36, wherein the viewpoint of the observer includes right and left viewpoints of the observer.
-
38. The system according to claim 36, wherein the second time is a time at which the graphics figure is displayed on said display.
-
39. The system according to claim 33, wherein the second sequence of viewpoints include a viewpoint position of the video cameras corresponding to right and left viewpoints of the observer at the second time.
-
40. The system according to claim 39, wherein the second time is a time at which the image to be merged with the graphics figure was sensed.
-
49. A system according to claim 31, wherein said warping means generates at least one of the second sequence of depth images from a respective one of the first sequence of depth images by approximation method.
-
41. A depth image measurement method for continuously acquiring depth information of a scene, comprising:
-
the image input step of inputting a sequence of images of the scene from a first sequence of viewpoints;
the depth image generation step of generating a first sequence of depth images from the scene images inputted in the image input step;
the position/posture estimation step of estimating, based on information relating to displacement of the first sequence of viewpoints, a sequence of viewpoint position/posture information for a second sequence of viewpoints viewed from the first sequence of viewpoints; and
the warping step of continuously warping the first sequence of depth images generated in the depth image generation step to a second sequence of depth images at the second sequence of viewpoints on the basis of the viewpoint position/posture information of the second sequence of viewpoints estimated in the position/posture estimation step.
-
-
42. An augmented reality presentation method comprising:
-
the image input step of inputting a sequence of images of a scene from a first sequence of viewpoints, using a stereo camera for outputting a stereo image in front of an observer;
the depth image generation step of generating a first sequence of depth images from the scene images continuously input in the image input step;
the position/posture estimation step of estimating, based on information relating to displacement of the first sequence of viewpoints, a sequence of viewpoint position/posture information at a second sequence of viewpoints, when viewed from the first sequence of viewpoints;
the warping step of warping the first sequence of depth images continuously generated in the depth image generation step to a second sequence of depth images at the second sequence of viewpoints on the basis of the sequence of viewpoint position/posture information of the second sequence of viewpoints estimated in the position/posture estimation step;
the step of discriminating depth ordering of a virtual three-dimensional grayscale image and a real world on the-basis of the second depth image; and
the step of displaying the virtual three-dimensional grayscale image onto a head mount display to merge the grayscale image with the real world.
-
-
43. A storage medium that stores an image processing program, which is implemented on a computer and continuously presents three-dimensional images to an observer, storing:
-
an image input program code of inputting a sequence of images from a first sequence of viewpoints;
a depth image generation program code of generating a first sequence of depth images from the continuously input images;
a position/posture information estimation program code of estimating, based on information relating to displacement of the first sequence of viewpoints, a sequence of viewpoint position/posture information of a second sequence of viewpoints, when viewed from the first sequence of viewpoints;
a warping program code of continuously warping the continuously generated first sequence of depth images into second sequence of depth images at the second sequence of viewpoints on the basis of the viewpoint position/posture information; and
a program code of presenting to the observer three-dimensional grayscale images or color images generated according to the second depth images. - View Dependent Claims (44)
-
Specification