Background model for complex and dynamic scenes
First Claim
Patent Images
1. A computer-implemented method, comprising:
- receiving a sequence of video frames from a video camera;
receiving a request to view a scene depicted in the sequence of video frames;
identifying and tracking at least one object between separate frames of the sequence of video frames;
classifying each tracked object based on a known category of objects;
generating a stream of context events associated with each tracked object;
generating a sequence of primitive events based on the stream of context events;
storing the stream of context events and the sequence of primitive events in one or more adaptive resonance theory (ART) networks;
storing detailed data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events;
storing generalized data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; and
evaluating the stream of context events, the sequence of primitive events, the detailed data, and the generalized data with the one or more ART networks to learn patterns of behavior that occur within the scene.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for viewing a scene depicted in a sequence of video frames and identifying and tracking objects between separate frames of the sequence. Each tracked object is classified based on known categories and a stream of context events associated with the object is generated. A sequence of primitive events based on the stream of context events is generated and stored together, along with detailed data and generalized data related to an event. All of the data is then evaluated to learn patterns of behavior that occur within the scene.
57 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
receiving a sequence of video frames from a video camera; receiving a request to view a scene depicted in the sequence of video frames; identifying and tracking at least one object between separate frames of the sequence of video frames; classifying each tracked object based on a known category of objects; generating a stream of context events associated with each tracked object; generating a sequence of primitive events based on the stream of context events; storing the stream of context events and the sequence of primitive events in one or more adaptive resonance theory (ART) networks; storing detailed data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; storing generalized data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; and evaluating the stream of context events, the sequence of primitive events, the detailed data, and the generalized data with the one or more ART networks to learn patterns of behavior that occur within the scene. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system, comprising:
-
a processor; and one or more adaptive resonance theory (ART) networks in communication with the processor when the system is in operation, the one or more ART networks having stored thereon instructions that upon execution by the processor at least cause the system to; receive a sequence of video frames from a video camera; receive a request to view a scene depicted in the sequence of video frames; identify and track at least one object between separate frames of the sequence of video frames; classify each tracked object based on a known category of objects; generate a stream of context events associated with each tracked object; generate a sequence of primitive events based on the stream of context events; store the stream of context events and the sequence of primitive events in the one or more ART networks; store detailed data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; store generalized data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; and evaluate the stream of context events, the sequence of primitive events, the detailed data, and the generalized data with the one or more ART networks to learn patterns of behavior that occur within the scene. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer-implemented method, comprising:
-
receiving a sequence of video frames from a video camera; receiving a request to view a scene depicted in the sequence of video frames; retrieving a background image and one or more foreground images associated with the scene; identifying and tracking at least some of the one or more foreground images between separate frames of the sequence of video frames; classifying each tracked foreground image based on a known category of objects; generating a stream of context events associated with each tracked foreground image; generating a sequence of primitive events based on the stream of context events; storing the stream of context events and the sequence of primitive events in an adaptive resonance theory (ART) network; storing detailed data in the adaptive resonance theory (ART) network related to an event based on the stream of context events and the sequence of primitive events; storing generalized data in the adaptive resonance theory (ART) network related to an event based on the stream of context events and the sequence of primitive events; and evaluating the stream of context events, the sequence of primitive events, the detailed data, and the generalized data with the one or more ART networks to learn patterns of behavior that occur within the scene. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A system, comprising:
-
a processor; and one or more adaptive resonance theory (ART) networks in communication with the processor when the system is in operation, the one or more ART networks having stored thereon instructions that upon execution by the processor at least cause the system to; receive a sequence of video frames from a video camera; receive a request to view a scene depicted in the sequence of video frames; retrieve background image and one or more foreground images associated with the scene; identify and tracking at least some of the one or more foreground images between separate frames of the sequence of video frames; classify each tracked foreground image based on a known category of objects; generate a stream of context events associated with each tracked foreground image; generate a sequence of primitive events based on the stream of context events; store the stream of context events and the sequence of primitive events in the one or more ART networks; store detailed data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; store generalized data in the one or more ART networks related to an event based on the stream of context events and the sequence of primitive events; and evaluate the stream of context events, the sequence of primitive events, the detailed data, and the generalized data with the one or more ART networks to learn patterns of behavior that occur within the scene. - View Dependent Claims (20)
-
Specification