Bi-Directional Tracking Using Trajectory Segment Analysis
First Claim
1. At least one computer-readable media storing computer-executable instructions for performing a method, the method comprising:
- determining two keyframes of a video sequence;
obtaining a first state for a target object within one of the two keyframes and a second state for the target object within the other keyframe; and
tracking the target object in the frames from the one keyframe to the other keyframe based on the first and second state.
2 Assignments
0 Petitions
Accused Products
Abstract
The present video tracking technique outputs a Maximum A Posterior (MAP) solution for a target object based on two object templates obtained from a start and an end keyframe of a whole state sequence. The technique first minimizes the whole state space of the sequence by generating a sparse set of local two-dimensional modes in each frame of the sequence. The two-dimensional modes are converted into three-dimensional points within a three-dimensional volume. The three-dimensional points are clustered using a spectral clustering technique where each cluster corresponds to a possible trajectory segment of the target object. If there is occlusion in the sequence, occlusion segments are generated so that an optimal trajectory of the target object can be obtained.
-
Citations
20 Claims
-
1. At least one computer-readable media storing computer-executable instructions for performing a method, the method comprising:
-
determining two keyframes of a video sequence;
obtaining a first state for a target object within one of the two keyframes and a second state for the target object within the other keyframe; and
tracking the target object in the frames from the one keyframe to the other keyframe based on the first and second state. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method comprising:
-
designating a start and an end frame within a video sequence;
obtaining an initial state for a target object within the start frame and a final state for the target object within the end frame;
performing two-dimensional extraction on the frames starting with the start frame and ending with the end frame to obtain a sparse set of local two-dimensional modes for the target object in the frames based on the initial state and the final state; and
performing three-dimensional trajectory segment extraction based on the sparse set of local two-dimensional (2D) modes to obtain a plurality of three-dimensional trajectory segments for the target object. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computing device, comprising:
-
a processor;
a memory into which a plurality of instructions are loaded, the plurality of instructions performing a method for tracking a target object within a video sequence when executed by the processor, the video sequence being decomposed into several short sequences, the shorter sequences having a start frame and an end frame, the method comprising;
a) generating a set of local two-dimensional modes for each frame of one short sequence, each local 2D mode identifying a location within the frame that has visual statistics similar to the target object identified in the start frame;
b) obtaining a plurality of three-dimensional trajectory segments for the target object based on the set of local two-dimensional modes;
c) obtaining at least one occlusion segment that connects two disjointed trajectory segments of the plurality of three-dimensional trajectory segments; and
d) determining an optimal trajectory based on the plurality of three-dimensional trajectory segments and the at least one occlusion segment. - View Dependent Claims (19, 20)
-
Specification