System and method for video content analysis using depth sensing

US 9,805,266 B2
Filed: 01/26/2016
Issued: 10/31/2017
Est. Priority Date: 01/17/2012
Status: Active Grant

First Claim

Patent Images

1. A video content analysis method comprising:

capturing a video sequence that includes a plurality of frames, each frame including a video image;

for each frame, receiving two-dimensional (2D) image data of the video image and also receiving depth data associated with the image data;

analyzing the 2D image data, and based on an analysis of the 2D image data without the depth data detecting one or more objects depicted in the video sequence as potential human beings;

using the depth data along with the one or more detected objects to classify at least a first object of the one or more detected objects as a person to be tracked, wherein a volume of the one or more detected objects is used to classify at least the first object as a person to be tracked;

performing tracking on at least the first classified object; and

performing event detection analysis on the first classified object,wherein the volume is determined by using the depth data along with the 2D image data to determine a plurality of convex hull slices on different Z-planes, and by summing areas of the plurality of convex hull slices.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and system for performing video content analysis based on two-dimensional image data and depth data are disclosed. Video content analysis may be performed on the two-dimensional image data, and then the depth data may be used along with the results of the video content analysis of the two-dimensional data for tracking and event detection.

64 Citations

View as Search Results

13 Claims

1. A video content analysis method comprising:
- capturing a video sequence that includes a plurality of frames, each frame including a video image;
  
  for each frame, receiving two-dimensional (2D) image data of the video image and also receiving depth data associated with the image data;
  
  analyzing the 2D image data, and based on an analysis of the 2D image data without the depth data detecting one or more objects depicted in the video sequence as potential human beings;
  
  using the depth data along with the one or more detected objects to classify at least a first object of the one or more detected objects as a person to be tracked, wherein a volume of the one or more detected objects is used to classify at least the first object as a person to be tracked;
  
  performing tracking on at least the first classified object; and
  
  performing event detection analysis on the first classified object,wherein the volume is determined by using the depth data along with the 2D image data to determine a plurality of convex hull slices on different Z-planes, and by summing areas of the plurality of convex hull slices.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The video content analysis method of claim 1, further comprising:
    - using the depth data along with the one or more detected objects to additionally classify at least a second object of the one or more detected objects as an object not to be tracked.
  - 3. The video content analysis method of claim 1, wherein:
    - classifying the first object as an object to be tracked includes classifying the first object as an object above a predetermined height or volume threshold.
  - 4. The video content analysis method of claim 1, wherein analyzing the image data to detect one or more objects depicted in the video sequence includes detecting at least one blob that corresponds to the one or more objects.
  - 5. The video content analysis method of claim 1, further comprising:
    - classifying at least the first object of the one or more detected objects as a person to be tracked by using the depth data associated with the one or more detected objects and without analyzing depth data associated with a portion of the video image that is not part of the one or more detected objects.
  - 6. The video content analysis method of claim 1, wherein the depth data is determined by a single depth sensor.
  - 7. The video content analysis method of claim 1, wherein analyzing the 2D image data to detect one or more objects depicted in the video sequence as potential human beings includes performing two-dimensional (2D) analysis on the image data to perform motion and change detection.
  - 8. The video content analysis method of claim 7, wherein:
    - analyzing the 2D image data to detect one or more objects depicted in the video sequence as potential human beings further includes, based on the motion and change detection, detecting at least one blob that corresponds to the one or more objects; and
      
      using the depth data along with the one or more detected objects to classify at least the first object of the one or more detected objects as a person to be tracked includes classifying only part of the blob as a target to be tracked.
  - 9. The video content analysis method of claim 8, further comprising:
    - using the depth data with the detected blob to determine that part of the blob does not correspond to the target to be tracked.
  - 10. The video content analysis method of claim 8, wherein using the depth data along with the one or more detected objects to classify at least the first object of the one or more detected objects as a person to be tracked includes using the depth data to determine that the one or more detected objects include two people.

11. A video surveillance system comprising:
- one or more sensors that capture two-dimensional (2D) image data and depth data; and
  
  a video content analysis system configured to;
  
  receive a video sequence that includes a plurality of frames, each frame including the 2D image data;
  
  for each frame, receive the 2D image data for that frame, and also receive depth data associated with the video image;
  
  analyze the 2D image data to detect at least a first image blob in the video sequence;
  
  use the depth data to project the first image blob onto a plurality of Z-planes, thereby creating a plurality of blob slices;
  
  based on a height threshold, separate the blob slices into a ground plane blob slice, and non-ground plane blob slices;
  
  create a refined blob that includes non-ground plane blob slices, and only a portion of the ground plane blob slice;
  
  perform object detection on the refined blob, to determine that the blob corresponds to a human object, thereby detecting a person in the video;
  
  perform tracking on the detected person; and
  
  perform event detection analysis on the detected person.
- View Dependent Claims (12, 13)
- - 12. The video content analysis method of claim 11, wherein performing tracking includes performing motion and change detection.
  - 13. The video content analysis method of claim 11, wherein performing object detection on the refined blob includes determining that the refined blob includes two people.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Motorola Solutions, Inc.
Original Assignee
Avigilon Fortress Corporation
Inventors
Zhang, Zhong, Myers, Gary W., Venetianer, Peter L.
Primary Examiner(s)
CESE, KENNY A

Application Number

US15/006,117
Publication Number

US 20160140397A1
Time in Patent Office

644 Days
Field of Search
US Class Current
CPC Class Codes

A61B 2505/07   Home care

A61B 5/0013   Medical image data A61B1/00...

A61B 5/0046   Arrangements of imaging app...

A61B 5/0077   Devices for viewing the sur...

A61B 5/1072   measuring distances on the ...

A61B 5/1073   Measuring volume, e.g. of l...

A61B 5/1079   using optical or photograph...

A61B 5/1113   Local tracking of patients,...

A61B 5/1116   Determining posture transit...

A61B 5/1117   Fall detection

A61B 5/1128   using image analysis A61B5/...

A61B 5/1176   Recognition of faces

A61B 5/7282   Event detection, e.g. detec...

A61B 5/746   Alarms related to a physiol...

G06T 2207/10016   Video; Image sequence

G06T 2207/30196   Human being; Person

G06T 2207/30232   Surveillance

G06T 7/0016   involving temporal comparison

G06T 7/246   using feature-based methods...

G06T 7/50   Depth or shape recovery

G06T 7/55 : from multiple images

G06T 7/579 : from motion

G06T 7/62 : of area, perimeter, diamete...

G06T 7/73 : using feature-based methods

G06V 20/41 : Higher-level, semantic clus...

G06V 20/44 : Event detection

G06V 20/52 : Surveillance or monitoring ...

G06V 40/103 : Static body considered as a...

G06V 40/1365 : Matching; Classification

G08B 13/19615 : wherein said pattern is def...

G08B 21/043 : detecting an emergency even...

G08B 21/0476 : Cameras to detect unsafe co...

H04N 7/18 : Closed-circuit television [...

H04N 7/181 : for receiving images from a...

View All

System and method for video content analysis using depth sensing

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

64 Citations

13 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for video content analysis using depth sensing

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

64 Citations

13 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links