Apparatus for visual recognition
First Claim
1. A visual recognition apparatus, comprising:
- basic image extraction means for extracting a basic image of objects from each pixel in a two-dimensional gray image of said objects;
line image extraction means for extracting a line image obtained through a spatial differentiation of said extracted basic image;
geometrical feature extraction means for extracting geometrical elements representing geometry of said objects from said extracted line image;
search means for searching said extracted geometrical elements which match a geometrical model composed of model elements representing an object to be recognized, and for getting a candidate position/orientation of said object to be recognized, based on a relative position of said geometrical elements and said model elements; and
verification means for verifying whether or not said candidate position/orientation of object to be recognized obtained by said search means accurately matches said object to be recognized;
the verification means including;
network input-value arithmetic operation means for mapping mesh cells obtained by dividing a region fixed to said geometrical model into some small regions on said basic image or said line image based on said candidate position/orientation, matching positions/orientations of said object to be recognized and the geometrical model to each other and determining a value representing each of said mesh cells in reference to a basic image or a line image in each of said mesh cells; and
a neutral network having input elements, each of said input elements one to one corresponding to each cell in said mesh cells and inputting an output-value of each cell obtained by said network input-value arithmetic operation means, having an output element for outputting a true or false value, and calculating an output-value of said output element from input-values to said input elements.
1 Assignment
0 Petitions
Accused Products
Abstract
A basic image of objects is extracted from a two-dimensional image of objects. Geometrical elements of the objects are extracted from the extracted basic image. The objects to be recognized are identified by searching a combination of the geometrical elements which match a geometrical model and then utilizing candidate position/orientation of the objects to be recognized, said candidate position/orientation being determined from a relationship in relative position between the combination of geometrical elements and the geometrical model. Mesh cells fixed to the geometrical model are mapped on the basic image based on the candidate position/orientation. In addition, verification is made as to whether an image of the geometrical model mapped by the candidate position/orientation is accurately matched with an image of one of the objects to be recognized, through a neural network to which values got from the basic image included in the individual mesh cells are to be applied as input values. Combination weight factors employed in the neural network are learned according to the verified results. It is also possible to recognize the multi-purpose objects according to how to learn the combination weight factors.
105 Citations
8 Claims
-
1. A visual recognition apparatus, comprising:
-
basic image extraction means for extracting a basic image of objects from each pixel in a two-dimensional gray image of said objects; line image extraction means for extracting a line image obtained through a spatial differentiation of said extracted basic image; geometrical feature extraction means for extracting geometrical elements representing geometry of said objects from said extracted line image; search means for searching said extracted geometrical elements which match a geometrical model composed of model elements representing an object to be recognized, and for getting a candidate position/orientation of said object to be recognized, based on a relative position of said geometrical elements and said model elements; and verification means for verifying whether or not said candidate position/orientation of object to be recognized obtained by said search means accurately matches said object to be recognized; the verification means including; network input-value arithmetic operation means for mapping mesh cells obtained by dividing a region fixed to said geometrical model into some small regions on said basic image or said line image based on said candidate position/orientation, matching positions/orientations of said object to be recognized and the geometrical model to each other and determining a value representing each of said mesh cells in reference to a basic image or a line image in each of said mesh cells; and a neutral network having input elements, each of said input elements one to one corresponding to each cell in said mesh cells and inputting an output-value of each cell obtained by said network input-value arithmetic operation means, having an output element for outputting a true or false value, and calculating an output-value of said output element from input-values to said input elements. - View Dependent Claims (4, 6, 7)
-
-
2. A visual recognition apparatus, comprising:
-
basic image extraction means for extracting a basic image of objects from each pixel in a two-dimensional gray image of said objects; line image extraction means for extracting a line image obtained through a spatial differentiation of said extracting basic image; geometrical feature extraction means for extracting geometrical elements representing geometry of said objects from said extracted line image; search means for searching said extracted geometrical elements which match a geometrical model composed of model elements representing an object to be recognized, and for getting a candidate position/orientation of said object to be recognized, based on a relative position of said geometrical elements and said model elements; verification means for verifying whether or not said candidate position/orientation of object to be recognized obtained by said search means accurately matches said object to be recognized; the verification means including; network input-value arithmetic operation means for mapping mesh cells obtained by dividing a region fixed to said geometrical model into some small regions on said basic image or said line image based on said candidate position/orientation, for matching position/orientations of said object to be recognized and the geometrical model to each other and determining a value representing each of said mesh cells in reference to a basic image or a line image in each of said mesh cells; and a neural network having input elements, each of said input elements, one to one corresponding to each cell in said mesh cells and inputting an output-value of each cell obtained by said network input-value arithmetic operation means, having an output element for outputting a true of false value, and calculating an output-value of said output element from input-values to said input elements; and learning means for making an output of each cell of said mesh cells as an Input signal, modifying combination weight factors of said neural network based on said input signal, a verified result of said verification means corresponding to said input signal and a teacher signal for providing an evaluation value representing truth of said verified result and outputting modified combination weight factors to said verification means.
-
-
3. A visual recognition apparatus, comprising:
-
basic image extraction means for extracting a basic image of objects from each pixel in a two-dimensional gray image of said objects; line image extraction means for extracting a line image obtained through a spatial differentiation of said extracted basic image; geometrical modeling means for producing a geometrical model composed of model elements constituted of lines specifying geometry of a model representing an object to be recognized; geometrical feature extraction means for extracting geometrical elements representing geometry of said objects from said extracted line image; search means for searching said extracted geometrical elements which match a geometrical model composed of model elements representing an object to be recognized, and for getting a candidate position/orientation of said object to be recognized, based on a relative position of said geometrical elements and said model elements; verification means for verifying whether or not said candidate position/orientation of object to be recognized obtained by said search means accurately matches said object to be recognized; the verification means including; network input-value arithmetic operation means for mapping mesh cells obtained by dividing a region fixed to said geometrical model into some small regions on said basic image or said line image based on said candidate position/orientation, matching positions/orientations of said object to be recognized and the geometrical model to each other and determining a value representing each of said mesh cells in reference to a basic image or a line image in each of said mesh calls; and a neural network having input elements, each of said input elements one to one corresponding to each cell in said mesh cells and inputting an output-value of each cell obtained by said network input-value arithmetic operation means, having an output element for outputting a true or false value, and calculating an output-value of said output element from input-values to said input elements; and learning means for making an output of each cell of said mesh cells as an input signal, modifying combination weight factors of said neural network based on said input signal, a verified result of said verification means corresponding to said input signal and a teacher signal for providing an evaluation value representing truth of said verified result and outputting modified combination weight factors to said verification means, - View Dependent Claims (5)
-
-
8. A visual recognition apparatus, comprising:
-
basic image extraction means for extracting a basic image of objects from each pixel in a two-dimensional gray image of said objects; geometrical feature extraction means for extracting geometrical elements representing geometry of said objects from said extracted basic image; search means for searching said extracted geometrical elements which match a geometrical model composed of model elements representing an object to be recognized, and for getting a candidate position/orientation of said object to be recognized, based on a relative position of said geometrical elements and said model elements; and verification means for verifying whether or not said candidate position/orientation of object to be recognized obtained by said search means accurately matches said object to be recognized; the verification means including; network input-value arithmetic operation means for mapping mesh cells obtained by dividing a region fixed to said geometrical model into some small regions on said basic image based on said candidate position/orientation, matching positions/orientations of said object to be recognized and the geometrical model to each other and determining a value representing each of said mesh cells in reference to a basic image in each of the said mesh cells; and a neural network having input elements, each of said input elements one to one corresponding to each cell in said mesh cells and inputting an output-value of each cell obtained by said network input-value arithmetic operation means, having an output element for outputting a true or false value, and calculation an output-value of said output element from input-values to said input elements.
-
Specification