Behavioral recognition system
First Claim
1. A method for processing video image data, the method comprising:
- identifying, by operation of one or more processors, one or more objects depicted in a sequence of video frames capturing a scene;
generating, from the sequence of video frames, a plurality of information streams characterizing one or more of the identified objects; and
generating, from the plurality of information streams, by a machine learning engine, one or more object classifications to assign to objects depicted in the sequence of video frames, wherein the machine learning engine is configured to derive patterns of behavior engaged in by objects having a common object classification and to identify instances of the patterns of behavior engaged in by objects depicted in the sequence of video frames as said instances occur.
6 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the present invention provide a method and a system for analyzing and learning behavior based on an acquired stream of video frames. Objects depicted in the stream are determined based on an analysis of the video frames. Each object may have a corresponding search model used to track an object'"'"'s motion frame-to-frame. Classes of the objects are determined and semantic representations of the objects are generated. The semantic representations are used to determine objects'"'"' behaviors and to learn about behaviors occurring in an environment depicted by the acquired video streams. This way, the system learns rapidly and in real-time normal and abnormal behaviors for any environment by analyzing movements or activities or absence of such in the environment and identifies and predicts abnormal and suspicious behavior based on what has been learned.
-
Citations
24 Claims
-
1. A method for processing video image data, the method comprising:
-
identifying, by operation of one or more processors, one or more objects depicted in a sequence of video frames capturing a scene; generating, from the sequence of video frames, a plurality of information streams characterizing one or more of the identified objects; and generating, from the plurality of information streams, by a machine learning engine, one or more object classifications to assign to objects depicted in the sequence of video frames, wherein the machine learning engine is configured to derive patterns of behavior engaged in by objects having a common object classification and to identify instances of the patterns of behavior engaged in by objects depicted in the sequence of video frames as said instances occur. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable storage medium containing a program, which, when executed on a processor is configured to perform an operation, comprising:
-
identifying, by operation the processor, one or more objects depicted in a sequence of video frames capturing a scene; generating, from the sequence of video frames, a plurality of information streams characterizing one or more of the identified objects; generating, from the plurality of information streams, by a machine learning engine, one or more object classifications to assign to objects depicted in the sequence of video frames, wherein the machine learning engine is configured to derive patterns of behavior engaged in by objects having a common object classification and to identify instances of the patterns of behavior engaged in by objects depicted in the sequence of video frames as said instances occur. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system, comprising:
-
a video input source; a processor; and a memory storing; a computer vision engine, wherein the computer vision engine is configured to; identify, by operation of one or more processors, one or more objects depicted in a sequence of video frames capturing a scene, and generate, from the sequence of video frames, a plurality of information streams characterizing one or more of the identified objects; and a machine learning engine, wherein the machine learning engine is configured to; generate, from the plurality of information streams, one or more object classifications to assign to objects depicted in the sequence of video frames, wherein the machine learning engine is configured to derive patterns of behavior engaged in by objects having a common object classification and to identify instances of the patterns of behavior engaged in by objects depicted in the sequence of video frames as said instances occur. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification