Real time tracking/detection of multiple targets
First Claim
1. A method of real-time pose estimation and tracking, the method comprising:
- tracking at least one target;
creating an occlusion mask based on the at least one tracked target; and
searching the area of the current image indicated by the occlusion mask, which comprises pixels in the current image that are not covered by the occlusion mask, to detect a new target.
1 Assignment
0 Petitions
Accused Products
Abstract
A mobile platform detects and tracks at least one target in real-time, by tracking at least one target, and creating an occlusion mask indicating an area in a current image to detect a new target. The mobile platform searches the area of the current image indicated by the occlusion mask to detect the new target. The use of a mask to instruct the detection system where to look for new targets increases the speed of the detection task. Additionally, to achieve real-time operation, the detection and tracking is performed in the limited time budget of the (inter) frame duration. Tracking targets is given higher priority than detecting new targets. After tracking is completed, detection is performed in the remaining time budget for the frame duration. Detection for one frame, thus, may be performed over multiple frames.
34 Citations
31 Claims
-
1. A method of real-time pose estimation and tracking, the method comprising:
-
tracking at least one target; creating an occlusion mask based on the at least one tracked target; and searching the area of the current image indicated by the occlusion mask, which comprises pixels in the current image that are not covered by the occlusion mask, to detect a new target. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An apparatus comprising:
-
a camera; a processor connected to the camera configured to receive images from the camera in a live video stream; a display controlled by the processor; and memory in communication with the processor, wherein the memory is configured to store instructions configured to cause the processor to track at least one target, to create an occlusion mask based on the at least one tracked target, and search the area of the current image indicated by the occlusion mask, which comprises pixels in the current image that are not covered by the occlusion mask, to detect a new target. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system for real-time pose estimation and tracking, the system comprising:
-
means for tracking at least one target; means for creating an occlusion mask based on the at least one tracked target; and means for searching the area of the current image indicated by the occlusion mask, which comprises pixels in the current image that are not covered by the occlusion mask, to detect a new target. - View Dependent Claims (12, 13, 14)
-
-
15. A non-transitory computer-readable medium including program code stored thereon, comprising:
-
program code to track at least one target; program code to create an occlusion mask based on the at least one tracked target; and program code to search the area of the current image indicated by the occlusion mask, which comprises pixels in the current image that are not covered by the occlusion mask, to detect a new target. - View Dependent Claims (16, 17, 18)
-
-
19. A method of real-time pose estimation and tracking, the method comprising:
-
determining a time budget based in part on a frame duration; tracking at least one target in each frame in a live video stream by matching a number of features in each frame to features from a plurality of reference images, wherein the tracking the target in each frame comprises varying the number of features to be matched based on the time budget; determining remaining time in the time budget after the tracking the at least one target; and detecting a new target in one frame from the live video stream during a plurality of frames in the live video stream, wherein the detecting the new target is at least partially performed during the remaining time. - View Dependent Claims (20, 21)
-
-
22. An apparatus comprising:
-
a camera; a processor connected to the camera configured to receive frames from the camera in a live video stream; a display controlled by the processor; and memory in communication with the processor, wherein the memory is configured to store instructions configured to cause the processor to determine a time budget based in part on a frame duration, to track at least one target in each frame in the live video stream by matching a number of features in each frame to features from a plurality of reference images, wherein the processor varies the number of features to be matched based on the time budget, to determine a remaining time in the time budget after the tracking the at least one target; and
to detect a new target in one frame from the live video stream during a plurality of frames in the live video stream, wherein the detecting the new target is at least partially performed during the remaining time. - View Dependent Claims (23, 24)
-
-
25. A system of real-time pose estimation and tracking, the system comprising:
-
means for determining a time budget based in part on a frame duration; means for tracking at least one target in each frame in a video stream by matching a number of features in each frame to features from a plurality of reference images, wherein the number of features to be matched is varied based on the time budget; determining remaining time in the time budget after the tracking the at least one target; and means for detecting a new target in one frame from the live video stream during a plurality of frames in the live video stream, wherein the detecting the new target is at least partially performed during the remaining time. - View Dependent Claims (26, 27)
-
-
28. A non-transitory computer-readable medium including program code stored thereon, comprising:
-
program code to determine a time budget based in part on a frame duration; program code to track at least one target in each frame in a live video stream by matching a number of features in each frame to features from a plurality of reference images, wherein the number of features to be matched is varied based on the time budget; program code to determine remaining time in the time budget after the tracking the at least one target; and program code to detect a new target in one frame from the live video stream during a plurality of frames in the live video stream, wherein the detecting the new target is at least partially performed during the remaining time. - View Dependent Claims (29, 30)
-
-
31. A method of real-time pose estimation and tracking, the method comprising:
-
determining a time budget based in part on image acquisition time and frame duration; tracking at least one target in each frame in a video stream by matching a number of features in each frame to features from a plurality of reference images, wherein the number of features to be matched for the least one target is varied based on the time budget; determining remaining time in the time budget after the tracking at least one target is completed; and detecting a new target in one frame from the video stream during a plurality of frames in the video stream, wherein the detecting the new target is partially performed during the remaining time if the remaining time is greater than zero.
-
Specification