Visual attention and object recognition system
First Claim
1. A vision system for object recognition, comprising:
- one or more processors and a memory, the memory having instructions encoded thereon to include;
an attention module configured to receive an image representing a scene with an object in the scene and find and extract the object from the image as an extracted object, the attention module also being configured to generate feature vectors corresponding to color, intensity, and orientation information within the extracted object; and
an object recognition module configured to receive the extracted object and the feature vectors and associate a label with the extracted object to classify the object, whereby a user can use the vision system to classify an object in a scene; and
wherein the attention module is further configured to;
receive an image that includes a representation of an object in a scene, the image having color features;
determine light and dark intensity channels from the color features;
create four fully-saturated color channels from the color features;
compute feature opponency maps from the light and dark intensity channels and the four fully-saturated color channels;
compute an edge map for each opponency map;
segment the scene into a series of “
proto-objects”
based on the edge maps, where boundaries of the proto-objects are defined by common features between immediate regions within the image;
compute a saliency of a given proto-object using color and intensity information contained within the image;
rank the proto-objects according to saliency;
designate the proto-object with the highest saliency as the object to be extracted from the image; and
extract the object from the image.
1 Assignment
0 Petitions
Accused Products
Abstract
Described is a bio-inspired vision system for object recognition. The system comprises an attention module, an object recognition module, and an online labeling module. The attention module is configured to receive an image representing a scene and find and extract an object from the image. The attention module is also configured to generate feature vectors corresponding to color, intensity, and orientation information within the extracted object. The object recognition module is configured to receive the extracted object and the feature vectors and associate a label with the extracted object. Finally, the online labeling module is configured to alert a user if the extracted object is an unknown object so that it can be labeled.
84 Citations
9 Claims
-
1. A vision system for object recognition, comprising:
one or more processors and a memory, the memory having instructions encoded thereon to include; an attention module configured to receive an image representing a scene with an object in the scene and find and extract the object from the image as an extracted object, the attention module also being configured to generate feature vectors corresponding to color, intensity, and orientation information within the extracted object; and an object recognition module configured to receive the extracted object and the feature vectors and associate a label with the extracted object to classify the object, whereby a user can use the vision system to classify an object in a scene; and wherein the attention module is further configured to; receive an image that includes a representation of an object in a scene, the image having color features; determine light and dark intensity channels from the color features; create four fully-saturated color channels from the color features; compute feature opponency maps from the light and dark intensity channels and the four fully-saturated color channels; compute an edge map for each opponency map; segment the scene into a series of “
proto-objects”
based on the edge maps, where boundaries of the proto-objects are defined by common features between immediate regions within the image;compute a saliency of a given proto-object using color and intensity information contained within the image; rank the proto-objects according to saliency; designate the proto-object with the highest saliency as the object to be extracted from the image; and extract the object from the image. - View Dependent Claims (2)
-
3. A vision system for object recognition, comprising:
one or more processors and a memory, the memory having instructions encoded thereon to include; an attention module configured to receive an image representing a scene with an object in the scene and find and extract the object from the image as an extracted object, the attention module also being configured to generate feature vectors corresponding to color, intensity, and orientation information within the extracted object; and an object recognition module configured to receive the extracted object and the feature vectors and associate a label with the extracted object to classify the object, whereby a user can use the vision system to classify an object in a scene; wherein the object recognition module is further configured to; rotate and rescale the object to an invariant representation utilizing a filter; extract simple shape features from the image utilizing a Log-Gabor filter; extract high-level features from the simple shape features utilizing a spatial pyramid matching technique; perform a coarse classification utilizing a k-Nearest Neighbor technique; perform a fine classification to generate an object label utilizing a Support Vector Machine; and output the object label.
-
4. A computer program product for recognizing an object, the computer program product comprising computer-readable instruction means stored on a non-transitory computer-readable medium that are executable by a computer for causing the computer to:
-
receive an image representing a scene with an object in the scene; find and extract the object from the image as an extracted object; generate feature vectors corresponding to color intensity, and orientation information within the extracted object; associate a label with the extracted object to classify the object, whereby a user can use the computer to classify an object in a scene; receive an image that includes a representation of an object in a scene, the image having color features; determine light and dark intensity channels from the color features; create four fully-saturated color channels from the color features; compute feature opponency maps from the light and dark intensity channels and the four fully-saturated color channels; compute an edge map for each opponency map; segment the scene into a series of “
proto-objects”
based on the edge maps, where boundaries of the prow-objects are defined by common features between immediate regions within the image;compute a saliency of a given proto-object using color and intensity information contained within the image; rank the proto-objects according to saliency; designate the proto-object with the highest saliency as the object to be extracted from the image; and extract the object from the image. - View Dependent Claims (5)
-
-
6. A computer program product for recognizing an object, the computer program product comprising computer-readable instruction means stored on a non-transitory computer-readable medium that are executable by a computer for causing the computer to:
-
receive an image representing a scene with an object in the scene; find and extract the object from the image as an extracted object; generate feature vectors corresponding to color, intensity, and orientation information within the extracted object; associate a label with the extracted object to classify the object, whereby a user can use the computer to classify an object in a scene; rotate and rescale the object to an invariant representation utilizing a filter; extract simple shape features from the image utilizing a Log-Gabor filter; extract high-level features from the simple shape features utilizing a spatial pyramid matching technique; perform a coarse classification utilizing a k-Nearest Neighbor technique; perform a fine classification to generate an object label utilizing a Support Vector Machine; and output the object label.
-
-
7. A method for recognizing an object, the method comprising acts of:
-
receiving an image representing a scene with an object in the scene; finding and extracting the object from the image as an extracted object; generating feature vectors corresponding to color, intensity, and orientation information within the extracted object; and associating a label with the extracted object to classify the object, whereby a user can use the computer to classify an object in a scene; receiving an image that includes a representation of an object in a scene, the image having color features; determining light and dark intensity channels from the color features; creating four fully-saturated color channels from the color features; computing feature opponency maps from the light and dark intensity channels and the four fully-saturated color channels; computing an edge map for each opponency map; segmenting the scene into a series of “
proto-objects”
based on the edge maps, where boundaries of the proto-objects are defined by common features between immediate regions within the image;computing a saliency of a given proto-object using color and intensity information contained within the image; ranking the proto-objects according to saliency; designating the proto-object with the highest saliency as the object to be extracted from the image; and extracting the object from the image. - View Dependent Claims (8)
-
-
9. A method for recognizing an object, the method comprising acts of:
-
receiving an image representing a scene with an object in the scene; finding and extracting the object from the image as an extracted object; generating feature vectors corresponding to color, intensity, and orientation information within the extracted object; and associating a label with the extracted object to classify the object, whereby a user can use the computer to classify an object in a scene; rotating and resealing the object to an invariant representation utilizing a filter; extracting simple shape features from the image utilizing a Log-Gabor filter; extracting high-level features from the simple shape features utilizing a spatial pyramid matching technique; performing a coarse classification utilizing a k-Nearest Neighbor technique; performing a fine classification to generate an object label utilizing a Support Vector Machine; and outputting the object label.
-
Specification