Method, apparatus, and computer program product for object tracking
First Claim
Patent Images
1. A method comprising:
- identifying a first interest point;
receiving a first video frame;
defining a neighborhood for the first interest point;
detecting, via a processor, a second interest point in the neighborhood of the first video frame using a scale space image pyramid, the neighborhood being a portion of the scale space image pyramid;
matching the second interest point with the first interest point;
determining a motion estimation based on the matched interest points;
generating a feature descriptor based at least on the second interest point;
identifying a type of an object associated with the feature descriptor;
augmenting the first video frame to include a label identifying the type of the object, the label being positioned within the first video frame based on the matched interest points;
tracking the object in a subsequent video frame and within the neighborhood without recalculating the feature descriptor for the subsequent video frame; and
augmenting the subsequent video frame such that the position of the label is updated with respect to the first video frame.
8 Assignments
0 Petitions
Accused Products
Abstract
A method for object tracking is provided. The method may include identifying a first interest point, receiving a video frame, and detecting, via a processor, a second interest point in the video frame using a scale space image pyramid. The method may further include matching the second interest point with the first interest point, and determining a motion estimation based on the matched interest points. Similar apparatuses and computer program products are also provided.
-
Citations
16 Claims
-
1. A method comprising:
-
identifying a first interest point; receiving a first video frame; defining a neighborhood for the first interest point; detecting, via a processor, a second interest point in the neighborhood of the first video frame using a scale space image pyramid, the neighborhood being a portion of the scale space image pyramid; matching the second interest point with the first interest point; determining a motion estimation based on the matched interest points; generating a feature descriptor based at least on the second interest point; identifying a type of an object associated with the feature descriptor; augmenting the first video frame to include a label identifying the type of the object, the label being positioned within the first video frame based on the matched interest points; tracking the object in a subsequent video frame and within the neighborhood without recalculating the feature descriptor for the subsequent video frame; and augmenting the subsequent video frame such that the position of the label is updated with respect to the first video frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus comprising a processor and a memory storing executable instructions that, in response to execution by the processor, cause the apparatus to at least:
-
identify a first interest point; receive a first video frame; define a neighborhood for the first interest point; detect a second interest point in the neighborhood of the first video frame using a scale space image pyramid, the neighborhood being a portion of the scale space image pyramid; match the second interest point with the first interest point; determine a motion estimation based on the matched interest points; generate a feature descriptor based at least on the second interest point; identify a type of an object associated with the feature descriptor; augment the first video frame to include a label identifying the type of the object, the label being positioned within the first video frame based on the matched interest points; track the object in a subsequent video frame and without recalculating the feature descriptor for the subsequent video frame; and augment the subsequent video frame such that the position of the label is updated with respect to the first video frame. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A computer program product comprising at least one non-transitory computer-readable storage medium having executable computer-readable program code instructions stored therein, the computer-readable program code instructions configured to:
-
identify a first interest point; receive a first video frame; define a neighborhood for the first interest point; detect a second interest point in the neighborhood of the first video frame using a scale space image pyramid, the neighborhood being a portion of the scale space image pyramid; match the second interest point with the first interest point; determine a motion estimation based on the matched interest points; generate a feature descriptor based at least on the second interest point; identify a type of an object associated with the feature descriptor; augment the first video frame to include a label identifying the type of the object, the label being positioned within the first video frame based on the matched interest points; track the object in a subsequent video frame and within the neighborhood without recalculating the feature descriptor for the subsequent video frame; and augment the subsequent video frame such that the position of the label is updated with respect to the first video frame.
-
Specification