Hierarchical system for object recognition in images
First Claim
1. A method for identifying objects in images, comprising:
- computing one or more interest points in each of a plurality of training images including one or more objects;
extracting tokens associated with the interest points;
comparing tokens of training image pairs to find matched tokens, and grouping the matched tokens into groups;
computing a group token to represent each group; and
building a model tree using the group tokens, where each node of the tree represents an object model.
1 Assignment
0 Petitions
Accused Products
Abstract
Object recognition techniques are disclosed that provide both accuracy and speed. One embodiment of the present invention is an identification system. The system is capable of locating objects in images by searching for local features of an object. The system can operate in real-time. The system is trained from a set of images of an object or objects. The system computes interest points in the training images, and then extracts local image features (tokens) around these interest points. The set of tokens from the training images is then used to build a hierarchical model structure. During identification/detection, the system, computes interest points from incoming target images. The system matches tokens around these interest points with the tokens in the hierarchical model. Each successfully matched image token votes for an object hypothesis at a certain scale, location, and orientation in the target image. Object hypotheses that receive insufficient votes are rejected.
41 Citations
20 Claims
-
1. A method for identifying objects in images, comprising:
-
computing one or more interest points in each of a plurality of training images including one or more objects;
extracting tokens associated with the interest points;
comparing tokens of training image pairs to find matched tokens, and grouping the matched tokens into groups;
computing a group token to represent each group; and
building a model tree using the group tokens, where each node of the tree represents an object model. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A machine-readable medium encoded with instructions, that when executed by a processor, cause the processor to carry out a process for identifying objects in images, comprising:
-
computing one or more interest points in each of a plurality of training images including one or more objects;
extracting tokens associated with the interest points;
comparing tokens of training image pairs to find matched tokens, and grouping the matched tokens into groups;
computing a group token to represent each group; and
building a model tree using the group tokens, where each node of the tree represents an object model. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for identifying objects in images, comprising:
-
an interest point locator module for computing one or more interest points in each of a plurality of training images including one or more objects;
a token extraction module for extracting tokens associated with the interest points;
a token grouping module for comparing tokens of training image pairs to find matched tokens, grouping the matched tokens into groups, and computing a group token to represent each group; and
a model tree builder module for building a model tree using the group tokens, where each node of the tree represents an object model. - View Dependent Claims (16, 17, 18)
-
-
19. A system for identifying objects in images, comprising:
-
a means for computing one or more interest points in each of a plurality of training images including one or more objects;
a means for extracting tokens associated with the interest points;
a means for comparing tokens of training image pairs to find matched tokens, grouping the matched tokens into groups, and computing a group token to represent each group; and
a means for building a model tree using the group tokens, where each node of the tree represents an object model. - View Dependent Claims (20)
-
Specification