Image recognition method and image recognition apparatus

US 9,852,159 B2
Filed: 06/15/2010
Issued: 12/26/2017
Est. Priority Date: 06/18/2009
Status: Active Grant

First Claim

Patent Images

1. An apparatus comprising:

a first obtaining unit configured to obtain a target image including a target object which belongs to at least one category of a plurality of categories;

a second obtaining unit configured to obtain a plurality of partial target images from the obtained target image;

a holding unit configured to hold a dictionary in which, for each of a plurality of partial learning images each of which is a part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image, and the position information representing information on a relative position between the target object and the partial learning image in the learning image;

an acquiring unit configured to acquire a plurality of pixel values, from each of the plurality of partial target images obtained by the second obtaining unit, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images;

a comparing unit configured to compare the plurality of pixel values acquired by the acquiring unit to each other;

a third obtaining unit configured to obtain from the dictionary, for each of the plurality of partial target images, a corresponding category and position information of the partial learning image, based on a result of a comparison by the comparing unit;

a voting unit configured to vote, for each of the plurality of partial target images, the result obtained by the third obtaining unit at a position indicated by the position information obtained by the third obtaining unit in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, anda recognizing unit configured to recognize a category and a position of the target object included in the target image by collecting a result of voting by the voting unit.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An image recognition apparatus is provided which comprises a first extracting means for extracting, from every registration image previously registered, a set of registration partial images of a predetermined size, and a second extracting means for extracting, from an input new image, a set of new partial images of a predetermined size. The apparatus further comprises a discriminating means for discriminating an attribute of the new partial image based on a rule formed by dividing the set of the registration partial images extracted by the first extracting means, and a collecting means for deriving a final recognition result of the new image by collecting discrimination results by the discriminating means at the time when the new partial images as elements of the set of the new partial images are input.

Citations

23 Claims

1. An apparatus comprising:
- a first obtaining unit configured to obtain a target image including a target object which belongs to at least one category of a plurality of categories;
  
  a second obtaining unit configured to obtain a plurality of partial target images from the obtained target image;
  
  a holding unit configured to hold a dictionary in which, for each of a plurality of partial learning images each of which is a part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image, and the position information representing information on a relative position between the target object and the partial learning image in the learning image;
  
  an acquiring unit configured to acquire a plurality of pixel values, from each of the plurality of partial target images obtained by the second obtaining unit, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images;
  
  a comparing unit configured to compare the plurality of pixel values acquired by the acquiring unit to each other;
  
  a third obtaining unit configured to obtain from the dictionary, for each of the plurality of partial target images, a corresponding category and position information of the partial learning image, based on a result of a comparison by the comparing unit;
  
  a voting unit configured to vote, for each of the plurality of partial target images, the result obtained by the third obtaining unit at a position indicated by the position information obtained by the third obtaining unit in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, anda recognizing unit configured to recognize a category and a position of the target object included in the target image by collecting a result of voting by the voting unit.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
- - 2. The apparatus according to claim 1, wherein the learning image is a computer graphics image.
  - 3. The apparatus according to claim 1, wherein the target image is a grey-scale image.
  - 4. The apparatus according to claim 1, wherein the position information of the partial learning image indicates position information of the partial learning image in the learning image.
  - 5. The apparatus according to claim 1, wherein the dictionary comprises a tree-structured discriminator.
  - 6. The apparatus according to claim 1, wherein the plurality of pixels of which the pixel values are acquired are decided at random.
  - 7. The apparatus according to claim 1, wherein the plurality of partial images overlap each other.
  - 8. The apparatus according to claim 1, wherein the category is an orientation of the target object.
  - 9. The apparatus according to claim 1, wherein the obtained image includes a plurality of the target objects.
  - 10. The apparatus according to claim 1, wherein the plurality of the partial target images are obtained by shifting a predetermined partial image.
  - 11. The apparatus according to claim 1,wherein the recognizing unit recognizes the category and the position of the target object included in the target image based on the distribution of the result of voting by the voting unit.
  - 12. The apparatus according to claim 1,wherein the recognizing unit acquires, for each of the obtained categories, a score of a peak position in the distribution of the result of voting by the voting unit, recognizes one of the obtained categories that corresponds to a voting surface having a peak position with the highest score as the category of the target object, and recognizes the peak position with the highest score as the position of the target object.
  - 13. The apparatus according to claim 1, wherein the voting surface is formed with a multi-dimension table.
  - 14. The apparatus according to claim 13,wherein the voting surface is formed with a two-dimension table having elements corresponding to the respective pixels of the target image.
  - 15. The apparatus according to claim 1,wherein the holding unit holds a plurality of the dictionaries,wherein the third obtaining unit obtains from each of the plurality of dictionaries, for each of the plurality of partial target images, the corresponding category and position information of the partial learning image, andwherein the voting unit votes, for each of the plurality of partial target images, the result obtained by the third obtaining unit at a position indicated by the position information in a voting surface corresponding to the every obtained category.
  - 16. The apparatus according to claim 1,wherein the third obtaining unit obtains the corresponding category and position information of the partial learning image as a set.
  - 17. The apparatus according to claim 1, wherein each of the plurality of pixel values indicates a luminance value.
  - 18. The apparatus according to claim 1, wherein the acquiring unit is configured to acquire, from each of the plurality of partial target images obtained by the second obtaining unit, a plurality of pixel values of positions identical to the positions where the plurality of pixel values are obtained from each of the partial learning images.
  - 19. The apparatus according to claim, 1, wherein an acquiring processing by the acquiring unit and a comparing processing by the comparing unit are repeated.
  - 20. The apparatus according to claim 19, wherein the positions where the plurality of pixel values are obtained from each of the partial target images can be changed for each time a process including the acquiring processing and the comparing processing is repeated.

21. A method for recognizing a category and a position of a target object included in an image, comprising:
- holding a dictionary in which, for each of a plurality of partial learning images each of which is part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image and the position information representing information on a relative position between the target object and the partial learning image in the learning image;
  
  obtaining the target image including the target object which belongs to at least one category of a plurality of categories;
  
  obtaining a plurality of partial target images from the obtained target image;
  
  acquiring a plurality of pixel values, from each of the obtained plurality of partial target images, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images;
  
  comparing the plurality of acquired pixel values to each other;
  
  obtaining from the dictionary, for each of the plurality of partial target images, a corresponding category and the position information of the partial learning image, based on a result of the comparison;
  
  voting, for each of the plurality of partial target images, the obtained result at a position indicated by the obtained position information by the in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, andrecognizing a category and a position of the target object included in the target image by collecting a result of voting.
- View Dependent Claims (22)
- - 22. The method according to claim 21, wherein the learning image is a computer graphics image.

23. A non-transitory computer-readable storage medium storing a computer program for causing a computer to execute a method comprising:
- obtaining a target image including a target object which belongs to at least one category of a plurality of categories;
  
  obtaining a plurality of partial target images from the obtained target image;
  
  holding a dictionary in which, for each of a plurality of partial learning images each of which is a part of a learning image for recognizing the target object, a category of the partial learning image and position information of the partial learning image are registered, the category and the position information of the partial learning image being classified based on a comparison result of pixel values obtained from a plurality of pixels in the partial learning image, and the position information representing information on a relative position between the target object and the partial learning image in the learning image;
  
  acquiring a plurality of pixel values, from each of the obtained plurality of partial target images, of positions corresponding to the positions where the plurality of pixel values are obtained from each of the partial learning images;
  
  comparing the plurality of acquired pixel values to each other;
  
  obtaining from the dictionary, for each of the plurality of partial target images, a corresponding category and position information of the partial learning image, based on a result of the comparing;
  
  voting, for each of the plurality of partial target images, the obtained result at a position indicated by the obtained position information in a voting surface corresponding to the obtained category of a plurality voting surfaces corresponding to the plurality of categories, andrecognizing a category and a position of the target object included in the target image by collecting a result of voting.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Canon Kabushiki Kaisha (Canon Inc.)
Original Assignee
Canon Kabushiki Kaisha (Canon Inc.)
Inventors
Yoshii, Hiroto, Matsugu, Masakazu
Primary Examiner(s)
CESE, KENNY A

Application Number

US13/375,448
Publication Number

US 20120076417A1
Time in Patent Office

2,751 Days
Field of Search

382224-228
US Class Current
CPC Class Codes

G06F 16/51   Indexing; Data structures t...

G06F 18/24323   Tree-organised classifiers

G06F 18/254   of classification results, ...

G06F 18/28   Determining representative ...

G06T 2207/20021   Dividing image into blocks,...

G06T 2207/20081   Training; Learning

G06T 7/337   involving reference images ...

G06V 10/50   by performing operations wi...

G06V 10/751   Comparing pixel values or l...

G06V 10/764   using classification, e.g. ...

G06V 10/772   Determining representative ...

G06V 10/809   of classification results, ...

Image recognition method and image recognition apparatus

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

23 Claims

Specification

Solutions

Use Cases

Quick Links

Image recognition method and image recognition apparatus

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

23 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links