Method for segmenting 3D objects from compressed videos
First Claim
1. A method for segmenting a three dimensional object from a compressed video, the compressed video including a plurality of frames separated in time, and each frame including a plurality of macro-blocks separated in space, comprising:
- parsing transformed coefficients for each macro block;
determining a spatial/temporal gradient for each macro-block based on the transformed coefficients;
selecting a particular macro-block with a minimum spatial/temporal gradient magnitude as a seed macro-block;
measuring distances between the seed macro-block and spatially and temporally adjacent macro-blocks based on the transformed coefficients; and
growing a volume around the seed macro-block using the adjacent macro-blocks having distances less than a predetermined threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
A method segments a video into objects, without user assistance. An MPEG compressed video is converted to a structure called a pseudo spatial/temporal data using DCT coefficients and motion vectors. The compressed video is first parsed and the pseudo spatial/temporal data are formed. Seeds macro-blocks are identified using, e.g., the DCT coefficients and changes in the motion vector of macro-blocks.
A video volume is “grown” around each seed macro-block using the DCT coefficients and motion distance criteria. Self-descriptors are assigned to the volume, and mutual descriptors are assigned to pairs of similar volumes. These descriptors capture motion and spatial information of the volumes. Similarity scores are determined for each possible pair-wise combination of volumes. The pair of volumes that gives the largest score is combined iteratively. In the combining stage, volumes are classified and represented in a multi-resolution coarse-to-fine hierarchy of video objects.
37 Citations
23 Claims
-
1. A method for segmenting a three dimensional object from a compressed video, the compressed video including a plurality of frames separated in time, and each frame including a plurality of macro-blocks separated in space, comprising:
-
parsing transformed coefficients for each macro block; determining a spatial/temporal gradient for each macro-block based on the transformed coefficients; selecting a particular macro-block with a minimum spatial/temporal gradient magnitude as a seed macro-block; measuring distances between the seed macro-block and spatially and temporally adjacent macro-blocks based on the transformed coefficients; and growing a volume around the seed macro-block using the adjacent macro-blocks having distances less than a predetermined threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
Specification