Object recognition using a cognitive swarm vision framework with attention mechanisms
First Claim
1. An object recognition system incorporating swarming domain classifiers, comprising:
- at least one cognitive map stored in memory having a one-to-one relationship with an input image domain, the cognitive map being capable of recording information in the memory that software agents utilize to focus a cooperative swarm'"'"'s attention on regions in the domain most likely to contain objects of interest;
a plurality of software agents executing on a processor configured to operate as a cooperative swarm to classify an object in the domain, where each agent is a complete classifier and is assigned an initial velocity vector to explore a solution space for object solutions, where each agent is configured to perform at least one iteration as influenced by the recorded information of the cognitive map, the iteration being a search in the solution space for a potential solution optima where each agent keeps track of its coordinates in multi-dimensional space that are associated with an observed best solution (pbest) that the agent has identified, and a global best solution (gbest), where the gbest is used to store the best location among all agents, with each velocity vector thereafter changing towards pbest and gbest, allowing the cooperative swarm to concentrate on the vicinity of the object and classify the object when a classification level exceeds a preset threshold;
the cognitive map is a map selected from a group consisting of a ground surface map, an interest map, an object map, and a saliency map; and
the interest map stored in the memory is configured to run on the processor and maintain a sorted list for gbest and pbest, along with the associated FA values, where FA is an objective function and is calculated according to the following;
FA=μ
(Q+−
Q−
)+(1−
μ
)FC,where Q+ denotes an attracting pheromone and Q−
denotes a repelling pheromone, and where m is a nonnegative weighting factor, and FC is an object classifier confidence value; and
the interest map is updated in the memory at each iteration of the swarm and FA is updated for each entry in the sorted list, whereby the swarm is modified by the interest map in such a way as to focus attention on regions of increased saliency.
1 Assignment
0 Petitions
Accused Products
Abstract
An object recognition system is described that incorporates swarming classifiers with attention mechanisms. The object recognition system includes a cognitive map having a one-to-one relationship with an input image domain. The cognitive map records information that software agents utilize to focus a cooperative swarm'"'"'s attention on regions likely to contain objects of interest. Multiple agents operate as a cooperative swarm to classify an object in the domain. Each agent is a classifier and is assigned a velocity vector to explore a solution space for object solutions. Each agent records its coordinates in multi-dimensional space that are an observed best solution that the agent has identified, and a global best solution that is used to store the best location among all agents. Each velocity vector thereafter changes to allow the swarm to concentrate on the vicinity of the object and classify the object when a classification level exceeds a preset threshold.
29 Citations
30 Claims
-
1. An object recognition system incorporating swarming domain classifiers, comprising:
-
at least one cognitive map stored in memory having a one-to-one relationship with an input image domain, the cognitive map being capable of recording information in the memory that software agents utilize to focus a cooperative swarm'"'"'s attention on regions in the domain most likely to contain objects of interest; a plurality of software agents executing on a processor configured to operate as a cooperative swarm to classify an object in the domain, where each agent is a complete classifier and is assigned an initial velocity vector to explore a solution space for object solutions, where each agent is configured to perform at least one iteration as influenced by the recorded information of the cognitive map, the iteration being a search in the solution space for a potential solution optima where each agent keeps track of its coordinates in multi-dimensional space that are associated with an observed best solution (pbest) that the agent has identified, and a global best solution (gbest), where the gbest is used to store the best location among all agents, with each velocity vector thereafter changing towards pbest and gbest, allowing the cooperative swarm to concentrate on the vicinity of the object and classify the object when a classification level exceeds a preset threshold; the cognitive map is a map selected from a group consisting of a ground surface map, an interest map, an object map, and a saliency map; and the interest map stored in the memory is configured to run on the processor and maintain a sorted list for gbest and pbest, along with the associated FA values, where FA is an objective function and is calculated according to the following;
FA=μ
(Q+−
Q−
)+(1−
μ
)FC,where Q+ denotes an attracting pheromone and Q−
denotes a repelling pheromone, and where m is a nonnegative weighting factor, and FC is an object classifier confidence value; andthe interest map is updated in the memory at each iteration of the swarm and FA is updated for each entry in the sorted list, whereby the swarm is modified by the interest map in such a way as to focus attention on regions of increased saliency. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product for object recognition, the computer program product comprising computer-readable instruction means encoded on a computer-readable medium for causing a computer to, when executed, perform the operations of:
- configuring at least one cognitive map to have a one-to-one relationship with an input image domain, the cognitive map being capable of recording information that software agents utilize to focus a cooperative swarm'"'"'s attention on regions in the domain most likely to contain objects of interest;
configuring a plurality of software agents to operate as a cooperative swarm to classify an object in the domain, where each agent is a complete classifier and is assigned an initial velocity vector to explore a solution space for object solutions, where each agent is configured to perform at least one iteration as influenced by the recorded information of the cognitive map, the iteration being a search in the solution space for a potential solution optima where each agent keeps track of its coordinates in multi-dimensional space that are associated with an observed best solution (pbest) that the agent has identified, and a global best solution (gbest) where the gbest is used to store the best location among all agents, with each velocity vector thereafter changing towards pbest and gbest, allowing the cooperative swarm to concentrate on the vicinity of the object and classify the object when a classification level exceeds a preset threshold; and further comprising instruction means for causing a computer to; running and maintaining a sorted list for gbest and pbest, along with the associated FA values, where FA is an objective function and is calculated according to the following;
FA=μ
(Q+−
Q−
)+(1−
μ
)FC,where Q+ denotes an attracting pheromone and Q−
denotes a repelling pheromone, and where m is a nonnegative weighting factor, and FC is an object classifier confidence value; andupdating the cognitive map at each iteration of the swarm and update FA for each entry in the sorted list, whereby the swarm is modified by the cognitive map in such a way as to focus attention on regions of increased saliency. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
- configuring at least one cognitive map to have a one-to-one relationship with an input image domain, the cognitive map being capable of recording information that software agents utilize to focus a cooperative swarm'"'"'s attention on regions in the domain most likely to contain objects of interest;
-
23. A computer implemented method for object recognition using swarming domain classifiers, when executed on the computer comprises the acts of:
-
configuring at least one cognitive map to have a one-to-one relationship with an input image domain, the cognitive map being capable of recording information that software agents utilize to focus a cooperative swarm'"'"'s attention on regions in the domain most likely to contain objects of interest; configuring a plurality of software agents to operate as a cooperative swarm to classify an object in the domain, where each agent is a complete classifier and is assigned an initial velocity vector to explore a solution space for object solutions, where each agent is configured to perform at least one iteration as influenced by the recorded information of the cognitive map, the iteration being a search in the solution space for a potential solution optima where each agent keeps track of its coordinates in multi-dimensional space that are associated with an observed best solution (pbest) that the agent has identified, and a global best solution (gbest), where the gbest is used to store the best location among all agents, with each velocity vector thereafter changing towards pbest and gbest, allowing the cooperative swarm to concentrate on the vicinity of the object and classify the object when a classification level exceeds a preset threshold; and running and maintaining a sorted list for gbest and pbest, along with the associated FA values, where FA is an objective function and is calculated according to the following;
FA=μ
(Q+−
Q−
)+(1−
μ
)FC,where Q+ denotes an attracting pheromone and Q−
denotes a repelling pheromone, and where m is a nonnegative weighting factor, and FC is an object classifier confidence value; andupdating the cognitive map at each iteration of the swarm and updating FA for each entry in the sorted list, whereby the swarm is modified by the cognitive map in such a way as to focus attention on regions of increased saliency. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
-
30. A computer program product for object recognition, the computer program product comprising computer-readable instruction means encoded on a computer-readable medium for causing a computer to, when executed, perform the operations of:
-
receiving a first current input image; initializing a global swarm to search for objects within the input image; assigning local swarms to objects identified by the global swarm; receiving a next input image, where the next input image is deemed the current input image and a previous current input image is deemed the previous input image; initializing the local swarms to search for and identify objects in the current input image; deleting local swarms that lost their identified objects between the current and previous input images; initializing the global swarm to search for new objects in the current input image; assigning local swarms to new objects identified in the current input image; and repeating the operations of receiving a next input image, initializing, deleting, initializing, and assigning for subsequent next images; and further camp rising instruction means for causing a computer to; running and maintaining a sorted list for gbest and pbest, along with the associated FA values, where FA is an objective function and is calculated according to the following;
FA=μ
(Q+−
Q−
)+(1−
μ
)FC,where Q+ denotes an attracting pheromone and Q−
denotes a repelling pheromone, and where m is a nonnegative weighting factor, and FC is an object classifier confidence value; andupdating a cognitive map at each iteration of the swarm and update FA for each entry in the sorted list, whereby the swarm is modified by the cognitive map in such a way as to focus attention on regions of increased saliency.
-
Specification