×

Multi-view cognitive swarm for object recognition and 3D tracking

  • US 7,558,762 B2
  • Filed: 03/20/2006
  • Issued: 07/07/2009
  • Est. Priority Date: 08/14/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A multi-view object recognition system incorporating swarming domain classifiers, comprising:

  • a processor having a plurality of software agents configured to operate as a cooperative swarm to classify an object in a domain as seen from multiple view points, where each agent is a complete classifier and is assigned an initial velocity vector to explore a solution space for object solutions, where each agent is configured to perform at least one iteration, the iteration being a search in the solution space for a potential solution optima where each agent keeps track of its coordinates in multi-dimensional space that are associated with an observed best solution (pbest) that the agent has identified, and a global best solution (gbest) where the gbest is used to store the best location among all agents, with each velocity vector thereafter changing towards pbest and gbest, allowing the cooperative swarm to concentrate on the vicinity of the object and classify the object when a classification level exceeds a preset threshold;

    wherein the agents are configured to search for the object in three-dimensional (3D) spatial coordinates, such that the object is a 3D object and the 3D object has distinct appearances from each view point in the multiple view points; and

    wherein the distinct appearances of the 3D object from the multiple view points are linked by agents searching for the 3D object in the spatial coordinates, such that each agent has an associated 3D location X and an object height h, and wherein each of the multiple view points is provided as a 2D image from a calibrated camera having a given geometry, such that given the known geometry of the calibrated cameras, a 2D location, [x,y]T

    (X), of an agent'"'"'s projection in each view (2D image) is calculated and used to select an image window that is sent to a classifier having a classifier output that corresponds to the classifier'"'"'s confidence that the image window contains the object, where superscript T denotes transpose.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×