BACKGROUND MODEL FOR COMPLEX AND DYNAMIC SCENES

US 20130136353A1
Filed: 01/22/2013
Published: 05/30/2013
Est. Priority Date: 08/18/2009
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented method for generating a background model of a scene depicted in a sequence of video frames captured by a video camera, the method comprising:

receiving a video frame, wherein the video frame includes one or more appearance values for each of a plurality of pixels;

for one or more of the pixels;

passing the appearance values for the pixel to an input layer of an adaptive resonance theory (ART) network corresponding to the pixel;

mapping, by the ART network, the appearance values to one of one or more clusters of the ART network;

classifying the pixel as depicting one of scene background and scene foreground, based on the mapping of the appearance values to the cluster of the ART network.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Techniques are disclosed for learning and modeling a background for a complex and/or dynamic scene over a period of observations without supervision. A background/foreground component of a computer vision engine may be configured to model a scene using an array of ART networks. The ART networks learn the regularity and periodicity of the scene by observing the scene over a period of time. Thus, the ART networks allow the computer vision engine to model complex and dynamic scene backgrounds in video.

Citations

25 Claims

1. A computer-implemented method for generating a background model of a scene depicted in a sequence of video frames captured by a video camera, the method comprising:
- receiving a video frame, wherein the video frame includes one or more appearance values for each of a plurality of pixels;
  
  for one or more of the pixels;
  
  passing the appearance values for the pixel to an input layer of an adaptive resonance theory (ART) network corresponding to the pixel;
  
  mapping, by the ART network, the appearance values to one of one or more clusters of the ART network;
  
  classifying the pixel as depicting one of scene background and scene foreground, based on the mapping of the appearance values to the cluster of the ART network.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method of claim 1, wherein the pixel is classified as depicting scene background in response to determining that the cluster to which the appearance values are mapped to is a matured cluster in the ART network.
  - 3. The method of claim 1, wherein the appearance values for the pixel comprise a set of RGB color values.
  - 4. The method of claim 1, wherein mapping the appearance values to the cluster of the ART network comprises generating a new cluster in the ART network.
  - 5. The method of claim 1, further comprising, decaying a first cluster out of the ART network in response to determining the first cluster has not been reinforced by the mapping of the appearance values to the first cluster.
  - 6. The method of claim 1, further comprising merging two or more clusters in the ART network.
  - 7. The method of claim 1, wherein mapping the appearance values to the cluster of the ART network comprises evaluating the appearance values according to a vigilance test and a choice test.
  - 8. The method of claim 7, wherein the appearance values for the pixel comprise a set of RGB color values input to the ART network and the choice test comprises determining a Euclidean distance between the RGB color values input to the ART network and a prototype set of RGB values associated with the cluster.
  - 9. The method of claim 7, wherein the appearance values for the pixel comprise a set of RGB color values input to the ART network and the vigilance test comprises determining a cosine angle and a prototype set of RGB values associated with the cluster, relative to an origin of the RGB space.

10. A computer-readable storage medium containing a program, which when executed on a processor, performs an operation for generating a background model of a scene depicted in a sequence of video frames captured by a video camera, the operation comprising:
- receiving a video frame, wherein the video frame includes one or more appearance values for each of a plurality of pixels;
  
  for one or more of the pixels;
  
  passing the appearance values for the pixel to an input layer of an adaptive resonance theory (ART) network corresponding to the pixel;
  
  mapping, by the ART network, the appearance values to one of one or more clusters of the ART network;
  
  classifying the pixel as depicting one of scene background and scene foreground, based on the mapping of the appearance values to the cluster of the ART network.
- View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
- - 11. The computer-readable storage medium of claim 10, wherein the pixel is classified as depicting scene background in response to determining that the cluster to which the appearance values are mapped to is a matured cluster in the ART network.
  - 12. The computer-readable storage medium of claim 10, wherein mapping the appearance values to the cluster of the ART network comprises generating a new cluster in the ART network.
  - 13. The computer-readable storage medium of claim 10, wherein the operation further comprises, decaying a first cluster out of the ART network in response to determining the first cluster has not been reinforced by the mapping of the appearance values to the first cluster.
  - 14. The computer-readable storage medium of claim 10, wherein the operation further comprises merging two or more clusters in the ART network.
  - 15. The computer-readable storage medium of claim 10, wherein mapping the appearance values to the cluster of the ART network comprises evaluating the appearance values according to a vigilance test and a choice test.
  - 16. The computer-readable storage medium of claim 15, wherein the appearance values for the pixel comprise a set of RGB color values input to the ART network and the choice test comprises determining a Euclidean distance between the RGB color values input to the ART network and a prototype set of RGB values associated with the cluster.
  - 17. The computer-readable storage medium of claim 15, wherein the appearance values for the pixel comprise a set of RGB color values input to the ART network and the vigilance test comprises determining a cosine angle and a prototype set of RGB values associated with the cluster, relative to an origin of the RGB space.

18. A system, comprising:
- a video input source configured to provide a sequence of video frames, each depicting a scene;
  
  a processor; and
  
  a memory containing a program, which, when executed on the processor is configured to perform an operation for generating a background model of a scene depicted in a sequence of video frames captured by a video camera, the operation comprising;
  
  receiving a video frame, wherein the video frame includes one or more appearance values for each of a plurality of pixels;
  
  for one or more of the pixels;
  
  passing the appearance values for the pixel to an input layer of an adaptive resonance theory (ART) network corresponding to the pixel;
  
  mapping, by the ART network, the appearance values to one of one or more clusters of the ART network;
  
  classifying the pixel as depicting one of scene background and scene foreground, based on the mapping of the appearance values to the cluster of the ART network.
- View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
- - 19. The system of claim 18, wherein the pixel is classified as depicting scene background in response to determining that the cluster to which the appearance values are mapped to is a matured cluster in the ART network.
  - 20. The system of claim 18, wherein mapping the appearance values to the cluster of the ART network comprises generating a new cluster in the ART network.
  - 21. The system of claim 18, wherein the operation further comprises, decaying a first cluster out of the ART network in response to determining the first cluster has not been reinforced by the mapping of the appearance values to the first cluster.
  - 22. The system of claim 18, wherein the operation further comprises merging two or more clusters in the ART network.
  - 23. The system of claim 18, wherein mapping the appearance values to the cluster of the ART network comprises evaluating the appearance values according to a vigilance test and a choice test.
  - 24. The system of claim 23, wherein the appearance values for the pixel comprise a set of RGB color values input to the ART network and the choice test comprises determining a Euclidean distance between the RGB color values input to the ART network and a prototype set of RGB values associated with the cluster.
  - 25. The system of claim 23, wherein the appearance values for the pixel comprise a set of RGB color values input to the ART network and the vigilance test comprises determining a cosine angle and a prototype set of RGB values associated with the cluster, relative to an origin of the RGB space.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Motorola Solutions, Inc.
Original Assignee
Behavioral Recognition Systems Incorporated
Inventors
COBB, Wesley Kenneth, SEOW, Ming-Jung, YANG, Tao

Granted Patent

US 10,032,282 B2
Time in Patent Office

Days
Field of Search
US Class Current

382/165
CPC Class Codes

G06F 18/23   Clustering techniques

G06F 18/23211   with adaptive number of clu...

G06T 2207/10016   Video; Image sequence

G06T 2207/20084   Artificial neural networks ...

G06T 2207/30196   Human being; Person

G06T 2207/30232   Surveillance

G06T 7/20   Analysis of motion motion e...

G06T 7/254   involving subtraction of im...

G06V 10/763   Non-hierarchical techniques...

G06V 20/41   Higher-level, semantic clus...

G06V 20/44   Event detection

G06V 20/52   Surveillance or monitoring ...

G06V 40/20   Movements or behaviour, e.g...

BACKGROUND MODEL FOR COMPLEX AND DYNAMIC SCENES

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

Citations

25 Claims

Specification

Solutions

Use Cases

Quick Links

BACKGROUND MODEL FOR COMPLEX AND DYNAMIC SCENES

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

25 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links