Tracking using multilevel representations
First Claim
1. A method of tracking a target object in frames of video data, comprising:
- receiving a first tracking position associated with the target object in a first frame of a video sequence;
identifying, for a second frame of the video sequence, a plurality of representation levels, wherein the plurality of representation levels includes a bounding box level;
determining, by a processor, at least one node for each representation level, wherein the at least one node for the bounding box level is included in a second tracking position associated with the target object in the second frame determined by estimating motion of the target object between the first frame and the second frame;
determining, at each of the plurality of representation levels by the processor, a value for each node based on a conditional property of the node; and
adjusting, by the processor, the second tracking position based on the values determined for each of the nodes and interactions between at least the at least one node for the bounding box level and the at least one node for a different representation level.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of tracking a target object in frames of video data includes receiving a first tracking position associated with the target object in a first frame of a video sequence; identifying, for a second frame of the video sequence, a plurality of representation levels and at least one node for each representation level; determining, by a processor, a second tracking position in the second frame by estimating motion of the target object in the second frame between the first frame and the second frame; determining, at each representation level by the processor, a value for each node based on a conditional property of the node in the second frame; and adjusting, by the processor, the second tracking position based on the values determined for each of the nodes and interactions between at least some of the nodes at different representation levels.
-
Citations
20 Claims
-
1. A method of tracking a target object in frames of video data, comprising:
-
receiving a first tracking position associated with the target object in a first frame of a video sequence; identifying, for a second frame of the video sequence, a plurality of representation levels, wherein the plurality of representation levels includes a bounding box level; determining, by a processor, at least one node for each representation level, wherein the at least one node for the bounding box level is included in a second tracking position associated with the target object in the second frame determined by estimating motion of the target object between the first frame and the second frame; determining, at each of the plurality of representation levels by the processor, a value for each node based on a conditional property of the node; and adjusting, by the processor, the second tracking position based on the values determined for each of the nodes and interactions between at least the at least one node for the bounding box level and the at least one node for a different representation level. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus for tracking a target object in frames of video data, comprising:
-
one or more processors; and a memory for storing data and program instructions executed by the one or more processors, wherein the one or more processors are configured to executed instructions stored in the memory to; receive a first tracking position associated with the target object in a first frame of a video sequence; identify, for a second frame of the video sequence, a plurality of representation levels, wherein the plurality of representation levels includes a bounding box level; determine at least one node for each representation level, wherein the at least one node for the bounding box level is included in a second tracking position associated with the target object in the second frame determined by estimating motion of the target object between the first frame and the second frame; determine, at each of the plurality of representation levels, a value for each node based on a conditional property of the node; and adjust the second tracking position based on the values determined for each of the nodes and interactions between at least the at least one node for the bounding box level and the at least one node for a different representation level. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification