USING A MODEL TREE OF GROUP TOKENS TO IDENTIFY AN OBJECT IN AN IMAGE
First Claim
1. A computer-implemented method for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
- computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image;
extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and
comparing the extracted tokens with tokens in the tree to identify matches.
0 Assignments
0 Petitions
Accused Products
Abstract
Object recognition techniques are disclosed that provide both accuracy and speed. One embodiment of the present invention is an identification system. The system is capable of locating objects in images by searching for local features of an object. The system can operate in real-time. The system is trained from a set of images of an object or objects. The system computes interest points in the training images, and then extracts local image features (tokens) around these interest points. The set of tokens from the training images is then used to build a hierarchical model structure. During identification/detection, the system computes interest points from incoming target images. The system matches tokens around these interest points with the tokens in the hierarchical model. Each successfully matched image token votes for an object hypothesis at a certain scale, location, and orientation in the target image. Object hypotheses that receive insufficient votes are rejected.
-
Citations
20 Claims
-
1. A computer-implemented method for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
-
computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image; extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and comparing the extracted tokens with tokens in the tree to identify matches. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A machine-readable storage medium encoded with instructions that, when executed by a processor, cause the processor to perform a method for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
-
computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image; extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and comparing the extracted tokens with tokens in the tree to identify matches. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
-
a machine-readable storage medium encoded with machine-readable instructions for performing a method, the method comprising; computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image; extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and comparing the extracted tokens with tokens in the tree to identify matches; and a processor configured to execute the machine-readable instructions encoded on the machine-readable storage medium. - View Dependent Claims (20)
-
Specification