Pre-processing method and system for data reduction of video sequences and bit rate reduction of compressed video sequences using spatial filtering
First Claim
1. A method for pre filtering processing an original video sequence, the original video sequence comprising a plurality of video frames, the method comprising:
- for each video frame of the plurality of video frames of the original video sequence;
identifying a bounding geometric shape that encloses at least a portion of an important region-of-interest in the video frame, the bounding geometric shape serving as a foreground region;
identifying a portion of the video frame outside the bounding geometric shape as an unimportant background region;
applying a first filter operation in the foreground region and not in the background region, the first filter operation providing data reduction in the foreground region; and
applying a second filter operation in the background region and not in the foreground region, the second filter operation providing greater data reduction in the unimportant background region than the first filter operation would provide if applied to a same region; and
encoding the plurality of video frames after the first and second filter operations have been applied to each of the plurality of video frames.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods for pre-processing video sequences prior to compression to provide data reduction of the video sequence. Also, after compression of the pre-processed video sequence, the bit rate of the pre-processed and compressed video sequence will be lower than the bit rate of the video sequence after compression but without pre-processing. Pre-processing may include spatial anisotropic diffusion filtering such as Perona-Malik filtering, Fallah-Ford filtering, or omni-directional filtering that extends Perona-Malik filtering to perform filtering in at least one diagonal direction. Pre-processing may also include performing filtering differently on a foreground region than on a background region of a video frame. This method includes identifying pixel locations having pixel values matching characteristics of human skin and determining a bounding shape for each contiguous grouping of matching pixel locations. The foreground region is comprised of pixel locations contained in a bounding shape and the background region is comprised of all other pixel locations.
52 Citations
32 Claims
-
1. A method for pre filtering processing an original video sequence, the original video sequence comprising a plurality of video frames, the method comprising:
-
for each video frame of the plurality of video frames of the original video sequence; identifying a bounding geometric shape that encloses at least a portion of an important region-of-interest in the video frame, the bounding geometric shape serving as a foreground region; identifying a portion of the video frame outside the bounding geometric shape as an unimportant background region; applying a first filter operation in the foreground region and not in the background region, the first filter operation providing data reduction in the foreground region; and applying a second filter operation in the background region and not in the foreground region, the second filter operation providing greater data reduction in the unimportant background region than the first filter operation would provide if applied to a same region; and encoding the plurality of video frames after the first and second filter operations have been applied to each of the plurality of video frames. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for processing an original video sequence, the original video sequence comprising a plurality of video frames, each video frame comprising a plurality of pixel locations, the method comprising:
-
for each video frame; specifying a bounding shape that encloses at least a portion of a region-of-interest in the video frame from the plurality of video frames of the original video sequence; filtering pixel locations in the bounding shape differently than other pixel locations in the video frame; outputting a pre-filtered video sequence comprising a plurality of pre-filtered video frames; and compressing the pre-filtered video sequence using a compression method to produce a pre-filtered and compressed video sequence, wherein a bit rate associated with the pre-filtered and compressed video sequence is lower than a bit rate that would result from compressing the original video sequence using the compression method without performing the specifying of the bounding shape and the filtering of pixel locations. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A non-transitory computer readable storage medium storing a computer program for pre-filtering an original video sequence, the original video sequence comprising a plurality of video frames, the computer program executable by at least one processor, the computer program comprising sets of instructions for:
-
specifying a bounding geometric shape that encloses at least a portion of a region-of-interest of a video frame from the plurality of video frames of the original video sequence; and applying a pre-filter operation to a region outside the bounding geometric shape and not to a region bounded by the bounding geometric shape, the pre-filter operation for reducing data content of unimportant regions in the video frame while preserving sharp edges inside the bounding geometric shape. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A method for processing a plurality of video frames, each video frame comprising a plurality of pixel locations, each pixel location comprising a pixel value, the method comprising:
-
automatically identifying a foreground region for a video frame by identifying a bounding geometric shape that encloses at least a portion of a region-of-interest in the video frame and associating a region enclosed by the bounding geometric shape with the foreground region; identifying a region outside the geometric bounding shape as a background region of the video frame; filtering the background region and not the foreground region with a pre-filter operation, the pre-filter operation for reducing data content of unimportant regions in the video frame while preserving sharp edges inside the bounding geometric shape; and after filtering the background region in the video frame, encoding the video frame. - View Dependent Claims (20, 21)
-
-
22. A method for pre-filtering a plurality of video frames to reduce data content for encoding, each video frame comprising a plurality of pixel locations, each pixel location comprising first and second pixel values, the method comprising:
-
automatically identifying a foreground region for a video frame by identifying a region-of-interest in the video frame and associating the region-of-interest with the foreground region, wherein automatically identifying the foreground region comprises identifying, from the plurality of pixel locations, a set of pixel locations that each comprises (i) a first pixel value between a first minimum chrominance value and a first maximum chrominance value, and (ii) a second pixel value between a second minimum chrominance value and a second maximum chrominance value; identifying a background region of the video frame as pixel locations not in the foreground region; and filtering the video frame by filtering the foreground region and the background region differently. - View Dependent Claims (23, 24, 25, 26)
-
-
27. A method for pre-filtering a plurality of video frames for a video conference, wherein each video frame comprises a plurality of pixel locations, the method comprising:
-
identifying a region-of-interest of a video frame of the plurality of video frames by identifying a plurality of pixel locations having attributes similar to a human skin tone; bounding, with a bounding geometric shape, an approximation of the region-of-interest to identify a foreground region as a region within the bounding geometric shape; defining a first binary mask for a pixel locations inside the bounding geometric shape; defining a second binary mask for a background region for the video frame, the background region covering pixel locations outside the bounding geometric shape; filtering the video frame by using the first binary mask to filter the foreground region using a first pre-filter operation and using the second binary mask to filter the background region using a second pre-filter operation; and combining the filtered foreground region and the filtered background region to form a single filtered video frame. - View Dependent Claims (28, 29, 30, 31, 32)
-
Specification