Image recognition method and image recognition apparatus
First Claim
1. An apparatus comprising:
- a first obtaining unit configured to obtain a target image including a target object which belongs to at least one category of a plurality of categories;
a second obtaining unit configured to obtain a plurality of partial target images from the obtained target image;
a holding unit configured to hold a dictionary in which, for each of a plurality of partial learning images each of which is a part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image, and the position information representing information on a relative position between the target object and the partial learning image in the learning image;
an acquiring unit configured to acquire a plurality of pixel values, from each of the plurality of partial target images obtained by the second obtaining unit, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images;
a comparing unit configured to compare the plurality of pixel values acquired by the acquiring unit to each other;
a third obtaining unit configured to obtain from the dictionary, for each of the plurality of partial target images, a corresponding category and position information of the partial learning image, based on a result of a comparison by the comparing unit;
a voting unit configured to vote, for each of the plurality of partial target images, the result obtained by the third obtaining unit at a position indicated by the position information obtained by the third obtaining unit in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, anda recognizing unit configured to recognize a category and a position of the target object included in the target image by collecting a result of voting by the voting unit.
1 Assignment
0 Petitions
Accused Products
Abstract
An image recognition apparatus is provided which comprises a first extracting means for extracting, from every registration image previously registered, a set of registration partial images of a predetermined size, and a second extracting means for extracting, from an input new image, a set of new partial images of a predetermined size. The apparatus further comprises a discriminating means for discriminating an attribute of the new partial image based on a rule formed by dividing the set of the registration partial images extracted by the first extracting means, and a collecting means for deriving a final recognition result of the new image by collecting discrimination results by the discriminating means at the time when the new partial images as elements of the set of the new partial images are input.
-
Citations
23 Claims
-
1. An apparatus comprising:
-
a first obtaining unit configured to obtain a target image including a target object which belongs to at least one category of a plurality of categories; a second obtaining unit configured to obtain a plurality of partial target images from the obtained target image; a holding unit configured to hold a dictionary in which, for each of a plurality of partial learning images each of which is a part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image, and the position information representing information on a relative position between the target object and the partial learning image in the learning image; an acquiring unit configured to acquire a plurality of pixel values, from each of the plurality of partial target images obtained by the second obtaining unit, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images; a comparing unit configured to compare the plurality of pixel values acquired by the acquiring unit to each other; a third obtaining unit configured to obtain from the dictionary, for each of the plurality of partial target images, a corresponding category and position information of the partial learning image, based on a result of a comparison by the comparing unit; a voting unit configured to vote, for each of the plurality of partial target images, the result obtained by the third obtaining unit at a position indicated by the position information obtained by the third obtaining unit in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, and a recognizing unit configured to recognize a category and a position of the target object included in the target image by collecting a result of voting by the voting unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for recognizing a category and a position of a target object included in an image, comprising:
-
holding a dictionary in which, for each of a plurality of partial learning images each of which is part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image and the position information representing information on a relative position between the target object and the partial learning image in the learning image; obtaining the target image including the target object which belongs to at least one category of a plurality of categories; obtaining a plurality of partial target images from the obtained target image; acquiring a plurality of pixel values, from each of the obtained plurality of partial target images, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images; comparing the plurality of acquired pixel values to each other; obtaining from the dictionary, for each of the plurality of partial target images, a corresponding category and the position information of the partial learning image, based on a result of the comparison; voting, for each of the plurality of partial target images, the obtained result at a position indicated by the obtained position information by the in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, and recognizing a category and a position of the target object included in the target image by collecting a result of voting. - View Dependent Claims (22)
-
-
23. A non-transitory computer-readable storage medium storing a computer program for causing a computer to execute a method comprising:
-
obtaining a target image including a target object which belongs to at least one category of a plurality of categories; obtaining a plurality of partial target images from the obtained target image; holding a dictionary in which, for each of a plurality of partial learning images each of which is a part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image, and the position information representing information on a relative position between the target object and the partial learning image in the learning image; acquiring a plurality of pixel values, from each of the obtained plurality of partial target images, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images; comparing the plurality of acquired pixel values to each other; obtaining from the dictionary, for each of the plurality of partial target images, a corresponding category and position information of the partial learning image, based on a result of the comparing; voting, for each of the plurality of partial target images, the obtained result at a position indicated by the obtained position information in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, and recognizing a category and a position of the target object included in the target image by collecting a result of voting.
-
Specification