System and method for video content analysis using depth sensing
First Claim
Patent Images
1. A video content analysis method comprising:
- capturing a video sequence that includes a plurality of frames, each frame including a video image;
for each frame, receiving two-dimensional (2D) image data of the video image and also receiving depth data associated with the image data;
analyzing the 2D image data, and based on an analysis of the 2D image data without the depth data detecting one or more objects depicted in the video sequence as potential human beings;
using the depth data along with the one or more detected objects to classify at least a first object of the one or more detected objects as a person to be tracked, wherein a volume of the one or more detected objects is used to classify at least the first object as a person to be tracked;
performing tracking on at least the first classified object; and
performing event detection analysis on the first classified object,wherein the volume is determined by using the depth data along with the 2D image data to determine a plurality of convex hull slices on different Z-planes, and by summing areas of the plurality of convex hull slices.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for performing video content analysis based on two-dimensional image data and depth data are disclosed. Video content analysis may be performed on the two-dimensional image data, and then the depth data may be used along with the results of the video content analysis of the two-dimensional data for tracking and event detection.
64 Citations
13 Claims
-
1. A video content analysis method comprising:
-
capturing a video sequence that includes a plurality of frames, each frame including a video image; for each frame, receiving two-dimensional (2D) image data of the video image and also receiving depth data associated with the image data; analyzing the 2D image data, and based on an analysis of the 2D image data without the depth data detecting one or more objects depicted in the video sequence as potential human beings; using the depth data along with the one or more detected objects to classify at least a first object of the one or more detected objects as a person to be tracked, wherein a volume of the one or more detected objects is used to classify at least the first object as a person to be tracked; performing tracking on at least the first classified object; and performing event detection analysis on the first classified object, wherein the volume is determined by using the depth data along with the 2D image data to determine a plurality of convex hull slices on different Z-planes, and by summing areas of the plurality of convex hull slices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A video surveillance system comprising:
-
one or more sensors that capture two-dimensional (2D) image data and depth data; and a video content analysis system configured to; receive a video sequence that includes a plurality of frames, each frame including the 2D image data; for each frame, receive the 2D image data for that frame, and also receive depth data associated with the video image; analyze the 2D image data to detect at least a first image blob in the video sequence; use the depth data to project the first image blob onto a plurality of Z-planes, thereby creating a plurality of blob slices; based on a height threshold, separate the blob slices into a ground plane blob slice, and non-ground plane blob slices; create a refined blob that includes non-ground plane blob slices, and only a portion of the ground plane blob slice; perform object detection on the refined blob, to determine that the blob corresponds to a human object, thereby detecting a person in the video; perform tracking on the detected person; and perform event detection analysis on the detected person. - View Dependent Claims (12, 13)
-
Specification