Systems and methods for automated object recognition
First Claim
1. A system for recognizing an object in a video stream, comprising:
- a memory storing instructions; and
a processor configured to execute the stored instructions to;
receive the video stream from a video source, the video stream comprising a first set of video frames;
select at least one video frame from the first set of the video frames according to a frame selection rate, wherein the frame selection rate determines a number of the selected video frames;
partition the one or more selected video frames into one or more sets of image blocks, each set of image blocks corresponding to a respective video frame;
identify, within one or more sets of image blocks, a region which comprise an image of an object, the identification being based on a machine-learning algorithm for determining regions characterized by an image recognition parameter;
calculate a likelihood metric that the region corresponds to the object;
adjust the frame selection rate when the likelihood metric is less than a predetermined threshold; and
display, on a display, information identifying the object.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for recognizing an object in a video stream may include receiving a video stream from a video source, the video stream comprising a plurality of video frames. The method may also include selecting at least one video frame from the video frames according to a frame selection rate. The method may also include partitioning the selected video frame into a first plurality of image blocks. The method may also include recognizing, out of the first plurality of image blocks, a second plurality of image blocks which comprise an image of an object, the recognition being based on an image recognition parameter determined by a machine-learning algorithm. The method may also include determining that at least one of the second plurality of image blocks corresponds to the object based on a likelihood metric, the likelihood metric being determined by the processor based on at least the frame selection rate. The method may further include displaying, on a display, information identifying the object. A system and non-transitory computer-readable medium may also be provided.
-
Citations
20 Claims
-
1. A system for recognizing an object in a video stream, comprising:
-
a memory storing instructions; and a processor configured to execute the stored instructions to; receive the video stream from a video source, the video stream comprising a first set of video frames; select at least one video frame from the first set of the video frames according to a frame selection rate, wherein the frame selection rate determines a number of the selected video frames; partition the one or more selected video frames into one or more sets of image blocks, each set of image blocks corresponding to a respective video frame; identify, within one or more sets of image blocks, a region which comprise an image of an object, the identification being based on a machine-learning algorithm for determining regions characterized by an image recognition parameter; calculate a likelihood metric that the region corresponds to the object; adjust the frame selection rate when the likelihood metric is less than a predetermined threshold; and display, on a display, information identifying the object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method for recognizing an object in a video stream, comprising:
-
receiving the video stream from a video source, the video stream comprising a first set of video frames; selecting at least one video frame from the first set of the video frames according to a frame selection rate, wherein the frame selection rate determines a number of the selected video frames; partitioning the one or more selected video frames into one or more sets of image blocks, each set of image blocks corresponding to a respective video frame; identifying within one or more sets, of image blocks, a region which comprise an image of an object, the identification being based on a machine-learning algorithm for determining regions characterized by an image recognition parameter; calculating a likelihood metric that the region corresponds to the object; adjusting the frame selection rate when the likelihood metric is less than a predetermined threshold; and displaying, on a display, information identifying the object. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer-readable medium storing instructions which, when executed, cause at least one processor to perform operations for recognizing an object in a video stream, the operations comprising:
-
receiving the video stream from a video source, the video stream comprising a first set of video frames; selecting at least one video frame from the first set of the video frames according to a frame selection rate, wherein the frame selection rate determines a number of the selected video frames; partitioning the one or more selected video frames into one or more sets of image blocks, each set of image blocks corresponding to a respective video frame; identifying within one or more sets, of image blocks, a region which comprise an image of an object, the identification being based on a machine-learning algorithm for determining regions characterized by an image recognition parameter; calculating a likelihood metric that the region corresponds to the object; adjusting the frame selection rate when the likelihood metric is less than a predetermined threshold; and displaying, on a display, information identifying the object. - View Dependent Claims (17, 18, 19, 20)
-
Specification