Method and apparatus for predicting the direction of movement in machine vision
First Claim
1. A machine vision system for determining movement in a 2-dimensional field of view comprising:
- a) video camera means for viewing the field of view and producing 2-dimensional binary representations of the pattern thereof at consecutive times t1 and t2 ;
b) image enhancement means for receiving said 2-dimensional binary representations of the field of view and for producing enhanced 2-dimensional binary representations thereof;
c) computational means for producing smoothed versions of said enhanced 2-dimensional binary representations, said computational means including means for filtering binary data comprising said enhanced 2-dimensional binary representations to provided the spatial gradients of components thereof comprising a background frame of reference and components identified with objects moving against said background frame of reference;
d) means for producing the temporal gradients of said enhanced 2-dimensional binary representations; and
,e) sensing means for comparing said spatial and temporal gradients at said consecutive times to one another to determine any motion parallax existing in the field of view, whereby movement of objects in the field of view is determined.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-simulated cortical network computes the visibility of shifts in the direction of movement and computes: 1) the magnitude of the position difference between the test and background patterns, 2) localized contrast differences at different spatial scales analyzed by computing temporal gradients of the difference and sum of the outputs of paired even- and odd-symmetric bandpass filters convolved with the input pattern and 3) using global processes that pool the output from paired even- and odd-symmetric simple and complex cells across the spatial extent of the background frame of reference to determine the direction a test pattern moved relative to a textured background. The direction of movement of an object in the field of view of a robotic vision system is detected in accordance with non-linear Gabor function algorithms. The movement of objects relative to their background is used to infer the 3-dimensional structure and motion of object surfaces.
297 Citations
27 Claims
-
1. A machine vision system for determining movement in a 2-dimensional field of view comprising:
-
a) video camera means for viewing the field of view and producing 2-dimensional binary representations of the pattern thereof at consecutive times t1 and t2 ; b) image enhancement means for receiving said 2-dimensional binary representations of the field of view and for producing enhanced 2-dimensional binary representations thereof; c) computational means for producing smoothed versions of said enhanced 2-dimensional binary representations, said computational means including means for filtering binary data comprising said enhanced 2-dimensional binary representations to provided the spatial gradients of components thereof comprising a background frame of reference and components identified with objects moving against said background frame of reference; d) means for producing the temporal gradients of said enhanced 2-dimensional binary representations; and
,e) sensing means for comparing said spatial and temporal gradients at said consecutive times to one another to determine any motion parallax existing in the field of view, whereby movement of objects in the field of view is determined. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. In a machine vision system including a video camera having an input lens for viewing an area of interest and producing an analog signal at an output thereof reflecting a 2-dimensional area of interest being viewed by the input lens, an analog to digital converter connected to the output of the video camera for converting the analog signal into a binary signal, a digital buffer for storing the binary signal, and logic for analyzing the binary signal at times t1 and t2 to provide information about the contents of the area of interest, the method for determining movement in the area of interest comprising the steps of:
-
a) filtering the binary signal from the digital buffer at times t1 and t2 to produce enhanced 2-dimensional binary representations of the area of interest at the times; b) producing smoothed versions of the enhanced 2-dimensional binary representations; c) processing the enhanced 2-dimensional binary representations to provided the spatial gradients of components thereof comprising a background frame of reference and components identified with objects moving against the background frame of reference; d) producing the temporal gradients of the enhanced 2-dimensional binary representations; and
,e) comparing the spatial and temporal gradients at times t1 and t2 to one another to determine any motion parallax existing in the area of interest, whereby movement of objects in the area of interest is determined. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. In a machine vision system including a video camera having an input lens for viewing an area of interest and producing an analog signal at an output thereof reflecting a 2-dimensional area of interest being viewed by the input lens, an analog to digital converter connected to the output of the video camera for converting the analog signal into a binary signal, a digital buffer for storing the binary signal, and logic for analyzing the binary signal at times t1 and t2 to provide information about the contents of the area of interest, the method for determining movement in the area of interest by establishing the 2-dimensional velocity of objects and converting them into a 3-dimensional position and depth map comprising the steps of:
-
a) horizontally scanning the area of interest to create a sequence of video images taken at small distance intervals so as to measure motion parallax or translational movement taking place in the area of interest; and
,b) constructing the depth map by scene analysis using paired even- and odd-symmetric filters that are normalized Gabor filters, that comprise a cosine times a Gaussian and a sine times a Gaussian, the even- and odd-symmetric filters being oriented filters that compute both the smoothed contrast and the corresponding spatial gradient at each spatial position to provide two independent measures of the contrast differences in the area of interest, the paired filters being computed at several orientations and resolutions using a coarse-to-fine matching procedure; wherein said step of scanning the area of interest includes the step of; dividing the area of interest into rectangular windows of attention determined by different object widths wherein closer occluding objects determine a maximum window of attention that can be used to view more distant object; wherein; a) within each window of attention, the location that produces the maximum output of the paired even- and odd-symmetric filters at low resolutions that is above a pre-established threshold is used to determine the width of objects that are used to partition the area of interest into different depth regions with those objects being verified at higher resolution spatial scales; b) constructing the depth map using regions of constant velocity to construct each moving object'"'"'s height and width and changes in the direction of movement relative to the video camera and other objects in the scene to infer the position of the objects in a three-dimensional scene such that the closer an object is to the video camera, the larger the distance transversed by the object in each video image of the scene; and
,c) updating the depth map by minimizing deviations from expected changes in object boundaries at different depths over time. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27)
-
Specification