Visualizing and updating classifications in a video surveillance system
First Claim
1. A computer-implemented method for a video surveillance system to process a sequence of video frames depicting a scene captured by a video camera, comprising:
- receiving a request to view an object classification type assigned to a foreground object depicted in the sequence of video frames, wherein the object classification type classifies the foreground object as being an instance of one of a plurality of foreground object classification types, wherein the object classification type is assigned to the foreground object based on a plurality of micro-features derived from analyzing pixels depicting the foreground object in the sequence of video frames, wherein the object classification type is generated by mapping micro-features derived from a plurality of foreground objects to nodes of a self-organizing map (SOM) and wherein an Adaptive Resonance Theory (ART) network clusters resulting nodes in the SOM, and wherein each ART network cluster corresponds to one of the plurality of foreground classification types;
generating a visual representation of the requested object classification type;
outputting the generated visual representation for display;
receiving user input requesting to modify a metadata attribute of the object classification type assigned to the foreground object; and
modifying the metadata attribute of the object classification, based on the received user input.
6 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for visually conveying classifications derived from pixel-level micro-features extracted from image data. The image data may include an input stream of video frames depicting one or more foreground objects. The classifications represent information learned by a video surveillance system. A request may be received to view a classification. A visual representation of the classification may be generated. A user interface may be configured to display the visual representation of the classification and to allow a user to view and/or modify properties associated with the classification.
-
Citations
15 Claims
-
1. A computer-implemented method for a video surveillance system to process a sequence of video frames depicting a scene captured by a video camera, comprising:
-
receiving a request to view an object classification type assigned to a foreground object depicted in the sequence of video frames, wherein the object classification type classifies the foreground object as being an instance of one of a plurality of foreground object classification types, wherein the object classification type is assigned to the foreground object based on a plurality of micro-features derived from analyzing pixels depicting the foreground object in the sequence of video frames, wherein the object classification type is generated by mapping micro-features derived from a plurality of foreground objects to nodes of a self-organizing map (SOM) and wherein an Adaptive Resonance Theory (ART) network clusters resulting nodes in the SOM, and wherein each ART network cluster corresponds to one of the plurality of foreground classification types; generating a visual representation of the requested object classification type; outputting the generated visual representation for display; receiving user input requesting to modify a metadata attribute of the object classification type assigned to the foreground object; and modifying the metadata attribute of the object classification, based on the received user input. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory computer-readable storage medium containing a program which, when executed by a video surveillance system, performs an operation to process a sequence of video frames depicting a scene captured by a video camera, the operation comprising:
-
receiving a request to view an object classification type assigned to a foreground object depicted in the sequence of video frames, wherein the object classification type classifies the foreground object as being an instance of one of a plurality of foreground object classification types, wherein the object classification type is assigned to the foreground object based on a plurality of micro-features derived from analyzing pixels depicting the foreground object in the sequence of video frames, and wherein the object classification type is generated by mapping micro-features derived from a plurality of foreground objects to nodes of a self-organizing map (SOM), wherein an Adaptive Resonance Theory (ART) network clusters resulting nodes in the SOM, and wherein each ART network cluster corresponds to one of the plurality of foreground classification types; generating a visual representation of the requested object classification type; and outputting the generated visual representation for display; receiving user input requesting to modify a metadata attribute of the object classification type assigned to the foreground object; and modifying the metadata attribute of the object classification, based on the received user input. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A video surveillance system, comprising:
-
a video input source configured to provide a sequence of video frames, each depicting a scene; a processor; and a memory containing a program, which when executed by the processor is configured to perform an operation to process the scene depicted in the sequence of video frames, the operation comprising; receiving a request to view an object classification type assigned to a foreground object depicted in the sequence of video frames, wherein the object classification type classifies the foreground object as being an instance of one of a plurality of foreground object classification types, wherein the object classification type is assigned to the foreground object based on a plurality of micro-features derived from analyzing pixels depicting the foreground object in the sequence of video frames, wherein the object classification type is generated by mapping micro-features derived from a plurality of foreground objects to nodes of a self-organizing map (SOM) and wherein an Adaptive Resonance Theory (ART) network clusters resulting nodes in the SOM and wherein each ART network cluster corresponds to one of the plurality of foreground classification types, generating a visual representation of the requested object classification type, outputting the generated visual representation for display, receiving user input requesting to modify a metadata attribute of the object classification type assigned to the foreground object, and modifying the metadata attribute of the object classification, based on the received user input. - View Dependent Claims (13, 14, 15)
-
Specification