System and method for object recognition
First Claim
1. A method for recognizing an object in an image comprising the steps of:
- (a) acquiring in electronic memory an image of the model object;
(b) transforming the image of the model object into a multi-level representation consistent with a recursive subdivision of the search space, said multi-level representation including at least the original image;
(c) generating at least one precomputed model of the object for each level of discretization of the search space, said precomputed model consisting of a plurality of points with corresponding direction vectors, said points and direction vectors being generated by an image processing operation that returns a direction vector for at least each model point;
(d) acquiring in electronic memory a current image;
(e) transforming the current image into a multi-level representation consistent with a recursive subdivision of the search space, said multi-level representation including at least the original image;
(f) performing an image processing operation on each transformed image of the multi-level representation that returns a direction vector for a subset of points within said image that corresponds to the range of translations for which the model should be searched;
(g) computing a match metric that uses the direction information of the model and the transformed image for all possible poses of the model in the coarsest discretization level of the search space;
(h) determining those model poses whose match metric exceeds a user-selectable threshold and whose match metric is locally maximal, and generating a list of instances of the model in the coarsest discretization level of the search space from said poses and said match metrics;
(i) tracking said instances of the model in the coarsest discretization level of the search space through the recursive subdivision of the search space until the finest level of discretization is reached; and
(j) providing the pose of the instances of the objects on the finest level of discretization.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for recognizing a user-defined model object within an image is provided, which is invariant to occlusion (i.e., the object to be found is only partially visible), clutter (i.e., there may be other objects in the image, even within the model object), non-linear illumination changes, and global or local contrast reversals. The object to be found may have been distorted when compared to the user-defined model by geometric transformations of a certain class, e.g., translations, rigid transformations (translation and rotation), similarity transformations (translation, rotation, and uniform scaling), or arbitrary affine transformations.
-
Citations
21 Claims
-
1. A method for recognizing an object in an image comprising the steps of:
-
(a) acquiring in electronic memory an image of the model object;
(b) transforming the image of the model object into a multi-level representation consistent with a recursive subdivision of the search space, said multi-level representation including at least the original image;
(c) generating at least one precomputed model of the object for each level of discretization of the search space, said precomputed model consisting of a plurality of points with corresponding direction vectors, said points and direction vectors being generated by an image processing operation that returns a direction vector for at least each model point;
(d) acquiring in electronic memory a current image;
(e) transforming the current image into a multi-level representation consistent with a recursive subdivision of the search space, said multi-level representation including at least the original image;
(f) performing an image processing operation on each transformed image of the multi-level representation that returns a direction vector for a subset of points within said image that corresponds to the range of translations for which the model should be searched;
(g) computing a match metric that uses the direction information of the model and the transformed image for all possible poses of the model in the coarsest discretization level of the search space;
(h) determining those model poses whose match metric exceeds a user-selectable threshold and whose match metric is locally maximal, and generating a list of instances of the model in the coarsest discretization level of the search space from said poses and said match metrics;
(i) tracking said instances of the model in the coarsest discretization level of the search space through the recursive subdivision of the search space until the finest level of discretization is reached; and
(j) providing the pose of the instances of the objects on the finest level of discretization. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21)
-
-
19. A system for recognizing an object in an image comprising the steps of:
-
(a) means for acquiring in electronic memory an image of the model object;
(b) means for transforming the image of the model object into a multi-level representation consistent with a recursive subdivision of the search space, said multi-level representation including at least the original image;
(c) means for generating at least one precomputed model of the object for each level of discretization of the search space, said precomputed model consisting of a plurality of points with corresponding direction vectors, said points and direction vectors being generated by an image processing operation that returns a direction vector for at least each model point;
(d) means for acquiring in electronic memory a current image;
(e) means for transforming the current image into a multi-level representation consistent with a recursive subdivision of the search space, said multi-level representation including at least the original image;
(f) means for performing an image processing operation on each transformed image of the multi-level representation that returns a direction vector for a subset of points within said image that corresponds to the range of translations for which the model should be searched;
(g) means for computing a match metric that uses the direction information of the model and the transformed image for all possible poses of the model in the coarsest discretization level of the search space;
(h) means for determining those model poses whose match metric exceeds a user-selectable threshold and whose match metric is locally maximal, and generating a list of instances of the model in the coarsest discretization level of the search space from said poses and said match metrics;
(i) means for tracking said instances of the model in the coarsest discretization level of the search space through the recursive subdivision of the search space until the finest level of discretization is reached; and
(j) providing the pose of the instances of the objects on the finest level of discretization.
-
Specification