USING A MODEL TREE OF GROUP TOKENS TO IDENTIFY AN OBJECT IN AN IMAGE

US 20100121794A1
Filed: 01/22/2010
Published: 05/13/2010
Est. Priority Date: 02/02/2006
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented method for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:

computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image;

extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and

comparing the extracted tokens with tokens in the tree to identify matches.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Object recognition techniques are disclosed that provide both accuracy and speed. One embodiment of the present invention is an identification system. The system is capable of locating objects in images by searching for local features of an object. The system can operate in real-time. The system is trained from a set of images of an object or objects. The system computes interest points in the training images, and then extracts local image features (tokens) around these interest points. The set of tokens from the training images is then used to build a hierarchical model structure. During identification/detection, the system computes interest points from incoming target images. The system matches tokens around these interest points with the tokens in the hierarchical model. Each successfully matched image token votes for an object hypothesis at a certain scale, location, and orientation in the target image. Object hypotheses that receive insufficient votes are rejected.

Citations

20 Claims

1. A computer-implemented method for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
- computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image;
  
  extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and
  
  comparing the extracted tokens with tokens in the tree to identify matches.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method of claim 1, further comprising the preliminary steps of:
    - receiving the target image; and
      
      formatting the target image so that the target image can be processed according to the method.
  - 3. The method of claim 1, wherein comparing the extracted tokens with tokens in the tree comprises comparing the extracted tokens with tokens in an object model of a first node of the tree.
  - 4. The method of claim 3, further comprising determining whether a token match threshold is satisfied.
  - 5. The method of claim 4, further comprising:
    - in response to determining that the token match threshold is satisfied, accepting a hypothesis that the target object satisfies the object model of the first node; and
      
      in response to determining that the token match threshold is not satisfied, rejecting a hypothesis that the target object satisfies the object model of the first node.
  - 6. The method of claim 4, further comprising repeating the computing, extracting, comparing, and determining for a plurality of target images.
  - 7. The method of claim 3, further comprising responsive to the extracted tokens matching tokens in the object model of the first node, comparing the extracted tokens with tokens in an object model of a child node of the first node.
  - 8. The method of claim 1, wherein an object model of a first node of the tree is more specific than an object model of a parent node of the first node.
  - 9. The method of claim 1, wherein each object model includes a list of tokens.

10. A machine-readable storage medium encoded with instructions that, when executed by a processor, cause the processor to perform a method for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
- computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image;
  
  extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and
  
  comparing the extracted tokens with tokens in the tree to identify matches.
- View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
- - 11. The medium of claim 10, wherein the method further comprises the preliminary steps of:
    - receiving the target image; and
      
      formatting the target image so that the target image can be processed according to the method.
  - 12. The medium of claim 10, wherein comparing the extracted tokens with tokens in the tree comprises comparing the extracted tokens with tokens in an object model of a first node of the tree.
  - 13. The medium of claim 12, wherein the method further comprises determining whether a token match threshold is satisfied.
  - 14. The medium of claim 13, wherein the method further comprises:
    - in response to determining that the token match threshold is satisfied, accepting a hypothesis that the target object satisfies the object model of the first node; and
      
      in response to determining that the token match threshold is not satisfied, rejecting a hypothesis that the target object satisfies the object model of the first node.
  - 15. The medium of claim 13, wherein the method further comprises repeating the computing, extracting, comparing, and determining for a plurality of target images.
  - 16. The medium of claim 12, wherein the method further comprises responsive to the extracted tokens matching tokens in the object model of the first node, comparing the extracted tokens with tokens in an object model of a child node of the first node.
  - 17. The medium of claim 10, wherein an object model of a first node of the tree is more specific than an object model of a parent node of the first node.
  - 18. The medium of claim 10, wherein each object model includes a list of tokens.

19. A system for identifying a target object in a target image using a plurality of object models organized into a tree data structure, wherein each node of the tree represents an object model, comprising:
- a machine-readable storage medium encoded with machine-readable instructions for performing a method, the method comprising;
  
  computing one or more interest points in the target image, wherein each interest point represents one pixel, and wherein the one or more interest points in the target image represent a subset of the pixels of the target image;
  
  extracting tokens associated with the interest points, wherein a token associated with an interest point comprises an image feature of an image region surrounding the interest point; and
  
  comparing the extracted tokens with tokens in the tree to identify matches; and
  
  a processor configured to execute the machine-readable instructions encoded on the machine-readable storage medium.
- View Dependent Claims (20)
- - 20. The system of claim 19, wherein comparing the extracted tokens with tokens in the tree comprises comparing the extracted tokens with tokens in an object model of a first node of the tree.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Honda Motor Co., Ltd. (Honda Motor Company)
Original Assignee
Honda Motor Co., Ltd. (Honda Motor Company)
Inventors
Sharon, Yoav, Heisele, Bernd

Granted Patent

US 8,676,733 B2
Time in Patent Office

Days
Field of Search
US Class Current

706/13
CPC Class Codes

G06F 18/231 Hierarchical techniques, i....

USING A MODEL TREE OF GROUP TOKENS TO IDENTIFY AN OBJECT IN AN IMAGE

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

USING A MODEL TREE OF GROUP TOKENS TO IDENTIFY AN OBJECT IN AN IMAGE

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links