Method and apparatus for determining the distance between an image and an object
First Claim
1. An apparatus for estimating the distance of an object from a video image comprising an array of pixels, said apparatus comprising:
- imaging means having an image plane for acquiring in said image plane at a plurality of successive imaging positions, greater than two, a respective plurality of video images of a region of the object having a distance component in a given direction from said image plane, which distance component is to be determined, each image of said plurality of images having at least one pixel corresponding to a point on said object within said region;
means for automatically, without any action by an operator, predicting the location of a pixel window of a predetermined size in which said at least one pixel will be located in a next succeeding one of said plurality of video images;
means for estimating the value of the disparity of said at least one pixel in each pixel window, said disparity corresponding to the difference in position of said at least one pixel in a current image as compared to the position of said at least one pixel in an immediately preceding one of said plurality of images; and
means for determining the value of said distance component from the value of a last estimated disparity.
2 Assignments
0 Petitions
Accused Products
Abstract
A video camera is displaced to successive positions a displacement distance that differs from each preceding position by a factor of two wherein the camera image plane is at a constant distance from an object whose distance from the camera is to be estimated. The further that the camera is moved the more accurate the estimated distance. The camera is displaced a relatively small increment initially to provide an accurate estimate of the corresponding pixel of interest in the next camera position. The successive displacements increase rapidly to relatively large values, to which the estimated pixel disparity corresponds to give a relatively accurate camera to object distance estimate. The window size and position of each successive camera position for each pixel of the image can be accurately estimated to reduce potential error in pixel disparity estimation.
127 Citations
17 Claims
-
1. An apparatus for estimating the distance of an object from a video image comprising an array of pixels, said apparatus comprising:
-
imaging means having an image plane for acquiring in said image plane at a plurality of successive imaging positions, greater than two, a respective plurality of video images of a region of the object having a distance component in a given direction from said image plane, which distance component is to be determined, each image of said plurality of images having at least one pixel corresponding to a point on said object within said region; means for automatically, without any action by an operator, predicting the location of a pixel window of a predetermined size in which said at least one pixel will be located in a next succeeding one of said plurality of video images; means for estimating the value of the disparity of said at least one pixel in each pixel window, said disparity corresponding to the difference in position of said at least one pixel in a current image as compared to the position of said at least one pixel in an immediately preceding one of said plurality of images; and means for determining the value of said distance component from the value of a last estimated disparity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus for estimating the distance of an object from a video image comprising an array of pixels, said apparatus comprising:
-
first imaging means having an image plane for creating at a first imaging means position a first video image of a region of the object, said first imaging means position having a distance component in a given direction from said image plane, which distance component is to be determined; second imaging means having a corresponding image plane for creating at a second imaging means position at a first distance value from the first position a second video image of said region, said second imaging means position having said distance component in said given direction from said corresponding image plane; means for automatically, without any action by an operator, predicting the location of a pixel window of a determined size at said second position in which at least one pixel of the first image is predicted to be positioned, said at least one pixel corresponding to a point on said object within said region; means for estimating the value of a disparity of said at least one pixel in said window, said disparity corresponding to the difference in position of said at least one pixel in said second image as compared to the position of said at least one pixel in said first video image; means for causing said second imaging means to create successive images at successive positions, which successive positions are at respective first distance values, for causing said means for predicting and estimating to successively respectively predict the pixel window location of each successive position and estimate the disparity value of said at least one pixel for each successive position; and means for determining the value of said distance component from the value of the last estimated disparity.
-
-
12. A system for manipulating an object comprising:
-
a robot; computer means for operating the robot; imaging means coupled to the computer means for imaging said object, said imaging means having an image plane for creating at a plurality of imaging means positions, greater than two, a corresponding plurality of video images of a region of the object, said images each comprising a matrix of pixels, said region having a distance component to be determined, which distance component is in a given direction from said image plane; and distance determining means coupled to the computer means and imaging means for estimating the distance component and therefrom, the distance between said robot and object, said distance determining means including;
means for placing imaging means at each said position for creating images of said object at each position such that all said image planes at said positions are coplanar, means for automatically, without any action by an operator, predicting a pixel window in each successively created image in which a given pixel of a preceding image is expected to occur and means for estimating a disparity of the given pixel in each said image, wherein the position of the imaging means at each successive position differs from the preceding position by a factor greater than one.
-
-
13. A method of estimating the distance of an object from a video image comprising an array of pixels, said method comprising:
-
acquiring in an image plane at a plurality, greater than two, of successive imaging positions a respective plurality of video images of a region of the object having a distance component in a given direction from said image plane, each image of said acquired plurality of images having at least one pixel corresponding to a point on the object within said region; automatically, without any action by an operator, predicting the location of a pixel window of a predetermined magnitude in the image acquired at each of said successive positions in which said at least one pixel is located; estimating the value of a disparity of said at least one pixel in each pixel window, said disparity corresponding to the difference in position of said at least one pixel in each successive image of said plurality of images as compared to the position of said at least one pixel in the immediately preceding image; and determining the value of said distance component from the value of the last estimated disparity. - View Dependent Claims (14, 15)
-
-
16. An apparatus for grasping and manipulating an object using a video image comprising an array of pixels to estimate the distance of the object, said apparatus comprising:
-
imaging means having an image plane for acquiring in said image plane at a plurality of successive imaging positions, greater than two, a respective plurality of video images of a region of the object having a distance component in a given direction from said image plane, which distance component is to be determined, each image of said plurality of images comprising an array of pixels including at least one pixel corresponding to a point on said object within said region; means for automatically, without any action by an operator, predicting the location of a pixel window of a predetermined size in which said at least one pixel will be located in a next succeeding one of said plurality of video images; means for estimating the value of the disparity of said at least one pixel in each pixel window, said disparity corresponding to the difference in position of said at least one pixel in a current image as compared to the position of said at least one pixel in an immediately preceding one of said plurality of images; means for determining the value of said distance component from the value of a last estimated disparity; and robot means constructed to grasp and manipulate said object; and control means responsive to said determined value of said distance component for causing said robot means to grasp said object. - View Dependent Claims (17)
-
Specification