Transform coding of dense motion vector fields for frame and object based video coding applications
First Claim
1. A compression method of representing changes between corresponding pixels in first and second video image frames, comprising:
- determining motion vectors for all pixel locations in each region in a set of regions of the first video image frame by estimating the motion of all of the pixels in each region relative to pixels in the second video image frame;
wherein the regions correspond to regions of a first image feature having a non-rectangular boundary in the first image frame and the motion vectors for at least one of the regions represent a motion vector field having a non-rectangular shape corresponding to the boundary of the first image feature;
transforming the motion vector field having a non-rectangular shape to a motion vector field of rectangular configuration for encoding in the first compressed format, where the rectangular configuration comprises a two dimensional array of motion vectors, and transforming the motion vector field includes assigning motion vectors to pixel locations within the rectangular configuration but not overlapping the non-rectangular shape of the motion vector field; and
encoding the motion vectors for each of the regions in a first compressed format to produce compressed motion vectors for each region, including transforming the motion vectors for each region into spatial frequency components and transforming the motion vectors for each pixel location in the rectangular configuration into spatial frequency components.
2 Assignments
0 Petitions
Accused Products
Abstract
A video compression encoding process compresses or encodes dense motion vector fields to provide and improved compression ratio. In accordance with this method, a dense motion vector field is obtained for an object or a portion of an object between a pair of video image frames. The dense motion vector field may correspond, for example, to an object or other image portion of arbitrary configuration or size. The configuration of the dense motion vector field is extrapolated to a regular, preferably rectangular, configuration to facilitate encoding or compression. The dense motion vector field with its extrapolated regular configuration is encoded or compressed according to conventional encoding transformations such as, for example, discrete cosine transformation (DCT) or lattice wavelet compression.
-
Citations
11 Claims
-
1. A compression method of representing changes between corresponding pixels in first and second video image frames, comprising:
-
determining motion vectors for all pixel locations in each region in a set of regions of the first video image frame by estimating the motion of all of the pixels in each region relative to pixels in the second video image frame;
wherein the regions correspond to regions of a first image feature having a non-rectangular boundary in the first image frame and the motion vectors for at least one of the regions represent a motion vector field having a non-rectangular shape corresponding to the boundary of the first image feature;transforming the motion vector field having a non-rectangular shape to a motion vector field of rectangular configuration for encoding in the first compressed format, where the rectangular configuration comprises a two dimensional array of motion vectors, and transforming the motion vector field includes assigning motion vectors to pixel locations within the rectangular configuration but not overlapping the non-rectangular shape of the motion vector field; and encoding the motion vectors for each of the regions in a first compressed format to produce compressed motion vectors for each region, including transforming the motion vectors for each region into spatial frequency components and transforming the motion vectors for each pixel location in the rectangular configuration into spatial frequency components. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable medium storing computer-executable programming for compressing changes between corresponding pixels in first and second video image frames, the medium comprising:
-
programming for determining motion vectors for all pixel locations in each region in a set of regions of the first video image frame by estimating the motion of all of the pixels in each region relative to pixels in the second video image frame; and programming for encoding all of the motion vectors for each of the regions in a first compressed format to produce compressed motion vectors for each region, including transforming the motion vectors for each region into spatial frequency components and quantizing the spatial frequency components; wherein the regions correspond to regions of a first image feature having a non-rectangular boundary in the first image frame and the motion vectors for at least one of the regions represent a motion vector field having a non-rectangular shape corresponding to the boundary of the first image feature, the medium further comprising programming for transforming the motion vector field having a non-rectangular shape to a motion vector field of rectangular configuration for encoding in the first compressed format, where the rectangular configuration comprises a two dimensional array of motion vectors, and the programming for transforming the motion vector field includes programming for assigning motion vectors to pixel locations within the rectangular configuration but not overlapping the non-rectangular shape of the motion vector field; and programming for transforming the motion vectors for each pixel location in the rectangular configuration into spatial frequency components. - View Dependent Claims (9, 10, 11)
-
Specification