Rapid object detection by combining structural information from image segmentation with bio-inspired attentional mechanisms
First Claim
1. A system for rapid object detection, the system comprising:
- one or more processors and a non-transitory memory having instructions encoded thereon such that when the instructions are executed, the one or more processors perform operations of;
oversegmenting an input image into a set of superpixels, each superpixel comprising a plurality of pixels;
determining, for each superpixel, a bounding box defining a region of the input image representing a detection hypothesis;
calculating, for each superpixel, an average residual saliency (ARS) for the plurality of pixels belonging to the superpixel;
eliminating each detection hypothesis that is out of a range of a predetermined threshold value for object size, resulting in a first set of remaining detection hypotheses;
eliminating each remaining detection hypothesis in the first set of remaining detection hypotheses having an ARS below a predetermined threshold value, resulting in a second set of remaining detection hypotheses;
calculating, for each remaining detection hypothesis in the second set of remaining detection hypotheses, color contrast for the region defined by the bounding box, and eliminating each detection hypothesis in the second set of remaining detection hypotheses having a color contrast below a predetermined threshold value, resulting in a third set of remaining detection hypotheses; and
outputting the third set of remaining detection hypotheses to a classifier for object recognition.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is a system for rapid object detection combining structural information with bio-inspired attentional mechanisms. The system oversegments an input image into a set of superpixels, where each superpixel comprises a plurality of pixels. For each superpixel, a bounding box defining a region of the input image representing a detection hypothesis is determined. An average residual saliency (ARS) is calculated for the plurality of pixels belonging to each superpixel. Each detection hypothesis that is out of a range of a predetermined threshold value for object size is eliminated. Next, each remaining detection hypothesis having an ARS below a predetermined threshold value is eliminated. Then, color contrast is calculated for the region defined by the bounding box for each remaining detection hypothesis. Each detection hypothesis having a color contrast below a predetermined threshold is eliminated. Finally, the remaining detection hypotheses are output to a classifier for object recognition.
-
Citations
12 Claims
-
1. A system for rapid object detection, the system comprising:
-
one or more processors and a non-transitory memory having instructions encoded thereon such that when the instructions are executed, the one or more processors perform operations of; oversegmenting an input image into a set of superpixels, each superpixel comprising a plurality of pixels; determining, for each superpixel, a bounding box defining a region of the input image representing a detection hypothesis; calculating, for each superpixel, an average residual saliency (ARS) for the plurality of pixels belonging to the superpixel; eliminating each detection hypothesis that is out of a range of a predetermined threshold value for object size, resulting in a first set of remaining detection hypotheses; eliminating each remaining detection hypothesis in the first set of remaining detection hypotheses having an ARS below a predetermined threshold value, resulting in a second set of remaining detection hypotheses; calculating, for each remaining detection hypothesis in the second set of remaining detection hypotheses, color contrast for the region defined by the bounding box, and eliminating each detection hypothesis in the second set of remaining detection hypotheses having a color contrast below a predetermined threshold value, resulting in a third set of remaining detection hypotheses; and outputting the third set of remaining detection hypotheses to a classifier for object recognition. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method for rapid object detection, comprising:
-
an act of causing a data processor to execute instructions stored on a non-transitory memory such that upon execution, the data processor performs operations of; oversegmenting an input image into a set of superpixels, each superpixel comprising a plurality of pixels; determining, for each superpixel, a bounding box defining a region of the input image representing a detection hypothesis; calculating, for each superpixel, an average residual saliency (ARS) for the plurality of pixels belonging to the superpixel; eliminating each detection hypothesis that is out of a range of a predetermined threshold value for object size, resulting in a first set of remaining detection hypotheses; eliminating each remaining detection hypothesis in the first set of remaining detection hypotheses having an ARS below a predetermined threshold value, resulting in a second set of remaining detection hypotheses; calculating, for each remaining detection hypothesis in the second set of remaining detection hypotheses, color contrast for the region defined by the bounding box, and eliminating each detection hypothesis in the second set of remaining detection hypotheses having a color contrast below a predetermined threshold value, resulting in a third set of remaining detection hypotheses; and outputting the third set of remaining detection hypotheses to a classifier for object recognition. - View Dependent Claims (6, 7, 8)
-
-
9. A computer program product for rapid object detection, the computer program product comprising computer-readable instructions stored on a non-transitory computer-readable medium that are executable by a computer having a processor for causing the processor to perform operations of:
-
oversegmenting an input image into a set of superpixels, each superpixel comprising a plurality of pixels; determining, for each superpixel, a bounding box defining a region of the input image representing a detection hypothesis; calculating, for each superpixel, an average residual saliency (ARS) for the plurality of pixels belonging to the superpixel; eliminating each detection hypothesis that is out of a range of a predetermined threshold value for object size, resulting in a first set of remaining detection hypotheses; eliminating each remaining detection hypothesis in the first set of remaining detection hypotheses having an ARS below a predetermined threshold value, resulting in a second set of remaining detection hypotheses; calculating, for each remaining detection hypothesis in the second set of remaining detection hypotheses, color contrast for the region defined by the bounding box, and eliminating each detection hypothesis in the second set of remaining detection hypotheses having a color contrast below a predetermined threshold value, resulting in a third set of remaining detection hypotheses; and outputting the third set of remaining detection hypotheses to a classifier for object recognition. - View Dependent Claims (10, 11, 12)
-
Specification