SYSTEMS AND METHODS FOR OBJECT TRACKING AND CLASSIFICATION
First Claim
1. A method for classifying at least one object of interest in a video, the method comprising:
- accessing, using at least one processing device, a frame of the video, the frame including at least one object of interest to be classified;
performing, using the at least one processing device, object detection on the frame to detect the object of interest;
tracking, using the at least one processing device, the object of interest over a plurality of frames in the video over time using a persistent tracking capability;
isolating, using the at least one processing device, a segment of the frame that includes the object of interest;
classifying, using the at least one processing device, the object of interest by processing the segment using deep learning; and
generating an output that indicates the classification of the object of interest.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for classifying at least one object of interest in a video is provided. The method includes accessing, using at least one processing device, a frame of the video, the frame including at least one object of interest to be classified, performing, using the at least one processing device, object detection on the frame to detect the object of interest, tracking, using the at least one processing device, the object of interest over a plurality of frames in the video over time using a persistent tracking capability, isolating, using the at least one processing device, a segment of the frame that includes the object of interest, classifying, using the at least one processing device, the object of interest by processing the segment using deep learning, and generating an output that indicates the classification of the object of interest.
-
Citations
20 Claims
-
1. A method for classifying at least one object of interest in a video, the method comprising:
-
accessing, using at least one processing device, a frame of the video, the frame including at least one object of interest to be classified; performing, using the at least one processing device, object detection on the frame to detect the object of interest; tracking, using the at least one processing device, the object of interest over a plurality of frames in the video over time using a persistent tracking capability; isolating, using the at least one processing device, a segment of the frame that includes the object of interest; classifying, using the at least one processing device, the object of interest by processing the segment using deep learning; and generating an output that indicates the classification of the object of interest. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-implemented system for classifying at least one object of interest in a video, the system comprising:
-
a tracking component implemented using at least one processing device and configured to; access a frame of the video, the frame including at least one object of interest to be classified; perform object detection on the frame to detect the object of interest; track the object of interest over a plurality of frames in the video over time using a persistent tracking capability; and isolate a segment of the frame that includes the object of interest; and a classification component communicatively coupled to said tracking component, said classification component implemented using the at least one processing device and configured to; classify the object of interest by processing the segment using deep learning; and generate an output that indicates the classification of the object of interest. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. An object classification computing device for classifying at least one object of interest in a video, the object classification computing device comprising:
-
a memory device; and a processor communicatively coupled to said memory device, said processor configured to; access a frame of the video, the frame including at least one object of interest to be classified; perform object detection on the frame to detect the object of interest; track the object of interest over a plurality of frames in the video over time using a persistent tracking capability; isolate a segment of the frame that includes the object of interest; classify the object of interest by processing the segment using deep learning; and generate an output that indicates the classification of the object of interest. - View Dependent Claims (18, 19, 20)
-
Specification