Plan-view projections of depth image data for object tracking
First Claim
1. A method of tracking an object observable in a time series of video frames representative of a visual scene, comprising:
- generating a three-dimensional point cloud having members with one or more associated attributes obtained from the time series of video frames and representing selected image pixels in a three-dimensional coordinate system spanned by a ground plane and a vertical axis orthogonal to the ground plane;
partitioning the three-dimensional point cloud into a set of vertically-oriented bins;
mapping the partitioned three-dimensional point cloud into one or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon one or more attributes of the three-dimensional point cloud members occupying the corresponding vertically-oriented bin; and
tracking the object based at least in part upon one or more of the plan-view images.
4 Assignments
0 Petitions
Accused Products
Abstract
Object tracking systems and methods are described. In one aspect, a three-dimensional point cloud is generated from a time series of video frames and partitioned into a set of vertically-oriented bins. The point cloud is mapped into one or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon one or more attributes of the point cloud members occupying the corresponding vertically-oriented bin. The object is tracked based at least in part upon one or more of the plan-view images. In another aspect, one or more original object templates are extracted from at least one of the one or more plan-view images, and the object is tracked based at least in part upon a comparison of at least one of the object templates with regions of the corresponding plan-view images. In another aspect, the point cloud may be discretized along the vertical axis into two or more horizontal partitions.
314 Citations
68 Claims
-
1. A method of tracking an object observable in a time series of video frames representative of a visual scene, comprising:
-
generating a three-dimensional point cloud having members with one or more associated attributes obtained from the time series of video frames and representing selected image pixels in a three-dimensional coordinate system spanned by a ground plane and a vertical axis orthogonal to the ground plane; partitioning the three-dimensional point cloud into a set of vertically-oriented bins; mapping the partitioned three-dimensional point cloud into one or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon one or more attributes of the three-dimensional point cloud members occupying the corresponding vertically-oriented bin; and tracking the object based at least in part upon one or more of the plan-view images. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A system for tracking an object observable in a time series of video frames representative of a visual scene, comprising one or more processing modules operable to:
-
generate a three-dimensional point cloud having members with one or more associated attributes obtained from the time series of video frames and representing selected image pixels in a three-dimensional coordinate system spanned by a ground plane and a vertical axis orthogonal to the ground plane; partition the three-dimensional point cloud into a set of vertically-oriented bins; map the partitioned three-dimensional point cloud into one or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon one or more attributes of the three-dimensional point cloud members occupying the corresponding vertically-oriented bin; and track the object based at least in part upon one or more of the plan-view images.
-
-
42. A method of tracking an object observable in a time series of video frames representative of a visual scene, comprising:
-
generating a three-dimensional point cloud having members with one or more associated attributes obtained from the time series of video frames and representing selected pixels in a three-dimensional coordinate system spanned by a ground plane and a vertical axis orthogonal to the ground plane; partitioning the three-dimensional point cloud into a set of vertically-oriented bins; mapping the partitioned three-dimensional point cloud into one or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon the three-dimensional point cloud members occupying the corresponding vertically-oriented bin; extracting one or more object templates from at least one of the one or more plan-view images; and tracking the object based at least in part upon a comparison of at least one of the object templates with regions of the corresponding plan-view images. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. A system for tracking an object observable in a time series of video frames representative of a visual scene, comprising one or more processing modules operable to:
-
generate a three-dimensional point cloud having members with one or more associated attributes obtained from the time series of video frames and representing selected pixels in a three-dimensional coordinate system spanned by a ground plane and a vertical axis orthogonal to the ground plane; partition the three-dimensional point cloud into a set of vertically-oriented bins; map the partitioned three-dimensional point cloud into one or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon the three-dimensional point cloud members occupying the corresponding vertically-oriented bin; extract one or more original object templates from at least one of the one or more plan-view images; track the object based at least in part upon a comparison of at least one of object templates with regions of the corresponding plan-view images.
-
-
62. A method of tracking an object observable in a time series of video frames representative of a visual scene, comprising:
-
generating a three-dimensional point cloud having members with one or more associated attributes obtained from the time series of video frames and representing selected image pixels in a three-dimensional coordinate system spanned by a ground plane and a vertical axis orthogonal to the ground plane; partitioning the three-dimensional point cloud into a set of vertically-oriented bins; discretizing the three-dimensional point cloud along the vertical axis into two or more horizontal partitions; mapping the partitioned three-dimensional point cloud into two or more plan-view images each containing for each vertically-oriented bin a corresponding pixel having one or more values computed based upon the three-dimensional point cloud members occupying a respective horizontal partition of the corresponding vertically-oriented bin; and tracking the object based at least in part upon one or more of the plan-view images. - View Dependent Claims (63, 64, 65, 66, 67, 68)
-
Specification