System for detecting an object of interest in a scene
First Claim
Patent Images
1. An system for detecting an object of interest in a scene, comprising:
- one or more processors and a memory, the memory having executable instructions encoded thereon, such that upon execution of the instructions, the one or more processors perform operations of;
receiving an image frame of a scene;
extracting features from the image frame, the features being descriptors, wherein the features are extracted from the image frame by performing operations of;
creating three threads, one for each of three independent scales;
providing, for each thread, a link to the image frame and a set of running parameters;
running each thread, in parallel, to identify descriptors in each of the three threads;
upon completion of the three threads, compiling the results from each of the three threads into a full set of dense Scale Invariant Feature Transform (SIFT) descriptors (DSIFT);
quantizing the DSIFT descriptors to generate a pyramid histogram of visual word (PHOW) features;
implementing a sliding window protocol to slide a window over the image and analyze PHOW features that fall inside the window; and
determining if the PHOW features represent the object of interest and, if so, then designating the window as a location in the image with a detected object of interest.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a system for detecting an object of interest in a scene. The system operates by receiving an image frame of a scene and extracting features from the image frame, the features being descriptors. The descriptors are quantized to generate PHOW features. A sliding window protocol is implemented to slide a window over the image and analyze the PHOW features that fall inside the window. Finally, the system determines if the PHOW features represent the object of interest and, if so, then designates the window as a location in the image with a detected object of interest.
13 Citations
18 Claims
-
1. An system for detecting an object of interest in a scene, comprising:
one or more processors and a memory, the memory having executable instructions encoded thereon, such that upon execution of the instructions, the one or more processors perform operations of; receiving an image frame of a scene; extracting features from the image frame, the features being descriptors, wherein the features are extracted from the image frame by performing operations of; creating three threads, one for each of three independent scales; providing, for each thread, a link to the image frame and a set of running parameters; running each thread, in parallel, to identify descriptors in each of the three threads; upon completion of the three threads, compiling the results from each of the three threads into a full set of dense Scale Invariant Feature Transform (SIFT) descriptors (DSIFT); quantizing the DSIFT descriptors to generate a pyramid histogram of visual word (PHOW) features; implementing a sliding window protocol to slide a window over the image and analyze PHOW features that fall inside the window; and determining if the PHOW features represent the object of interest and, if so, then designating the window as a location in the image with a detected object of interest. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A computer program product for detecting an object of interest in a scene, the computer program product comprising:
a non-transitory computer-readable medium having executable instructions encoded thereon, such that upon execution of the instructions by one or more processors, the one or more processors perform operations of; receiving an image frame of a scene; extracting features from the image frame, the features being descriptors, wherein the features are extracted from the image frame by performing operations of; creating three threads, one for each of three independent scales; providing, for each thread, a link to the image frame and a set of running parameters; running each thread, in parallel, to identify descriptors in each of the three threads; upon completion of the three threads, compiling the results from each of the three threads into a full set of dense Scale Invariant Feature Transform (SIFT) descriptors (DSIFT); quantizing the DSIFT descriptors to generate a pyramid histogram of visual word (PHOW) features; implementing a sliding window protocol to slide a window over the image and analyze PHOW features that fall inside the window; and determining if the PHOW features represent the object of interest and, if so, then designating the window as a location in the image with a detected object of interest. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A computer implemented method for detecting an object of interest in a scene, the method comprising an act of causing one or more processors to execute instructions encoded on a non-transitory computer-readable medium, such that upon execution of the instructions, the one or more processors perform operations of:
-
receiving an image frame of a scene; extracting features from the image frame, the features being descriptors, wherein the features are extracted from the image frame by performing operations of; creating three threads, one for each of three independent scales; providing, for each thread, a link to the image frame and a set of running parameters; running each thread, in parallel, to identify descriptors in each of the three threads; upon completion of the three threads, compiling the results from each of the three threads into a full set of dense Scale Invariant Feature Transform (SIFT) descriptors (DSIFT); quantizing the DSIFT descriptors to generate a pyramid histogram of visual word (PHOW) features; implementing a sliding window protocol to slide a window over the image and analyze PHOW features that fall inside the window; and determining if the PHOW features represent the object of interest and, if so, then designating the window as a location in the image with a detected object of interest. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification