Method for encoding and transcoding multiple video objects with variable temporal resolution
First Claim
Patent Images
1. A method for coding a video, comprising:
- partitioning the video into a plurality of objects;
measuring, over time, differences in shape of each object to determine a plurality of shape features of each object;
combining, the plurality of shape features to determine a temporal resolution for each object; and
coding each object according to the corresponding temporal resolution of the object.
2 Assignments
0 Petitions
Accused Products
Abstract
A video is first partitioned into video objects. If the video is uncompressed, then the partitioning is done with segmentation planes. In the case where the video is compressed, a demultiplexer is used for the partitioning. Over time, shape features are extracted from each partitioned object. The extracted shape features are combined to determined a temporal resolution for each object over time. The temporal resolutions are subsequently used to encode or transcode the video objects as an output compressed video.
-
Citations
19 Claims
-
1. A method for coding a video, comprising:
-
partitioning the video into a plurality of objects;
measuring, over time, differences in shape of each object to determine a plurality of shape features of each object;
combining, the plurality of shape features to determine a temporal resolution for each object; and
coding each object according to the corresponding temporal resolution of the object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
where α
1(m,n) and α
2(m,n) are corresponding segmentation planes at different time instants.
-
-
8. The method of claim 3 wherein the shape difference is a Hausdorff distance, which defines as maxmin function between sets of pixels, associated with the objects.
-
9. The method of claim 8 wherein the maxmin function is
-
10. The method of claim 1 wherein the video includes a plurality of frames, and each frame includes a plurality of macroblocks, and the macroblocks are coded as opaque blocks, transparent blocks, and boundary blocks.
-
11. The method of claim 1 further comprising:
coding the shape features of the objects as meta-data.
-
12. The method of claim 1 further comprising;
-
extracting, overtime, a motion feature from each object;
combining, over time, the motion features with the shape features to determine the temporal resolution for each object over time.
-
-
13. The method of claim 1 further comprising;
-
extracting, overtime, a coding complexity from each object;
combining, over time, the coding complexity with the shape features to determine the temporal resolution for each object over time.
-
-
14. The method of claim 1 wherein the shape features of the objects are extracted from a plurality of cue levels of the video.
-
15. An apparatus for coding a video, comprising:
-
means for partitioning a video into a plurality of objects;
means for measuring, over time, differences in shape of each object to determine a plurality of shape features of each object;
means for combining, the plurality of shape features to determine a temporal resolution for each object; and
means for coding each object according to the corresponding temporal resolution of the object. - View Dependent Claims (16, 17, 18, 19)
-
Specification