System and method for monitoring a retail environment using video content analysis with depth sensing
First Claim
Patent Images
1. A method of monitoring a retail environment comprising:
- taking a video at a store with a video sensor, the video comprising a plurality of frames, each frame including two-dimensional (2D) image data;
for each frame, receiving depth data associated with the 2D image data, the depth data corresponding to one or more distances from the video sensor to features represented by the 2D image data;
analyzing the 2D image data without analyzing the depth data to detect one or more objects in the video;
using the depth data to classify the one or more detected objects in the store depicted in the video, classification of the one or more detected objects comprising at least one of person classification and inventory classification and being based on a volume of the one or more detected objects; and
detecting an event of the classified one or more objects,wherein the volume is determined by using the depth data along with the 2D image data to determine a plurality of convex hull slices on different Z-planes, and by summing areas of the plurality of convex hull slices.
5 Assignments
0 Petitions
Accused Products
Abstract
A method and system for monitoring a retail environment by performing video content analysis based on two-dimensional image data and depth data are disclosed. Accuracy in customer actions to provide assistance, change marketing behavior, safety and theft, for example, is increase by analyzing video containing two-dimensional image data and associated depth data. Height data may be obtained from depth data to assist in object detection, object classification (e.g., detection a customer or inventory) and/or event detection.
74 Citations
32 Claims
-
1. A method of monitoring a retail environment comprising:
-
taking a video at a store with a video sensor, the video comprising a plurality of frames, each frame including two-dimensional (2D) image data; for each frame, receiving depth data associated with the 2D image data, the depth data corresponding to one or more distances from the video sensor to features represented by the 2D image data; analyzing the 2D image data without analyzing the depth data to detect one or more objects in the video; using the depth data to classify the one or more detected objects in the store depicted in the video, classification of the one or more detected objects comprising at least one of person classification and inventory classification and being based on a volume of the one or more detected objects; and detecting an event of the classified one or more objects, wherein the volume is determined by using the depth data along with the 2D image data to determine a plurality of convex hull slices on different Z-planes, and by summing areas of the plurality of convex hull slices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification