Feature-based hybrid video codec comparing compression efficiency of encodings
First Claim
1. A computer implemented method of processing a series of video frames of video data, comprising the computer implemented steps of:
- encoding one or more portions of the video data using a first encoding process and one or more other portions of the video data using a feature-based encoding process by;
processing a plurality of decoded video frames in the series to detect one or more instances of a candidate feature, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels;
said detection including determining positional information for the instances of the candidate feature in the one or more decoded video frames, the positional information for a respective instance of the candidate feature including one or more of;
a frame identifier for the respective instance of the candidate feature, a position within that respective frame, or a spatial perimeter of the respective instance of the candidate feature;
determining, using a motion compensated prediction process, an instance of the candidate feature in a subject video frame in the series using the one or more decoded video frames, where said motion compensated prediction is initialized with the positional information from instances of the candidate feature in the decoded video frame;
aggregating one or more of the candidate feature instances;
transforming one or more of the candidate feature instances;
forming a first feature-based model based on the aggregated, transformed candidate feature instances, the first feature-based model enabling prediction in the subject video frame of an appearance and a source position of a substantially matching feature instance, where the substantially matching feature instance is a key feature instance, the first feature-based model resulting in a feature-based encoding;
comparing compression efficiency of the feature-based encoding to an encoding from the first video encoding process; and
using results of the comparing step, applying feature-based encoding to portions of one or more of the video frames, and applying the first encoding process to other portions of the one or more video frames.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods of processing video data are provided. Video data having a series of video frames is received and processed. One or more instances of a candidate feature are detected in the video frames. The previously decoded video frames are processed to identify potential matches of the candidate feature. When a substantial amount of portions of previously decoded video frames include instances of the candidate feature, the instances of the candidate feature are aggregated into a set. The candidate feature set is used to create a feature-based model. The feature-based model includes a model of deformation variation and a model of appearance variation of instances of the candidate feature. The feature-based model compression efficiency is compared with the conventional video compression efficiency.
185 Citations
58 Claims
-
1. A computer implemented method of processing a series of video frames of video data, comprising the computer implemented steps of:
-
encoding one or more portions of the video data using a first encoding process and one or more other portions of the video data using a feature-based encoding process by; processing a plurality of decoded video frames in the series to detect one or more instances of a candidate feature, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; said detection including determining positional information for the instances of the candidate feature in the one or more decoded video frames, the positional information for a respective instance of the candidate feature including one or more of;
a frame identifier for the respective instance of the candidate feature, a position within that respective frame, or a spatial perimeter of the respective instance of the candidate feature;determining, using a motion compensated prediction process, an instance of the candidate feature in a subject video frame in the series using the one or more decoded video frames, where said motion compensated prediction is initialized with the positional information from instances of the candidate feature in the decoded video frame; aggregating one or more of the candidate feature instances; transforming one or more of the candidate feature instances; forming a first feature-based model based on the aggregated, transformed candidate feature instances, the first feature-based model enabling prediction in the subject video frame of an appearance and a source position of a substantially matching feature instance, where the substantially matching feature instance is a key feature instance, the first feature-based model resulting in a feature-based encoding; comparing compression efficiency of the feature-based encoding to an encoding from the first video encoding process; and using results of the comparing step, applying feature-based encoding to portions of one or more of the video frames, and applying the first encoding process to other portions of the one or more video frames. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A digital processing system for processing video data having one or more video frames comprising:
-
one or more computer processors executing an encoder; the encoder configured to use feature-based encoding to encode portions of the video frames and a first encoding process to encode other portions of the video frames by; detecting one or more instances of a candidate feature in one or more of the video frames, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; using a motion compensated prediction process, segmenting the one or more instances of the candidate feature from non-features in the one or more video frames, the motion compensated prediction process selecting decoded video frames having features corresponding to the one or more instances of the candidate feature; determining positional information from the one or more decoded video frames; forming a feature-based model based on aggregating the candidate feature instances, the feature-based model including the positional information from the decoded video frames; normalizing the one or more candidate feature instances using the feature-based model, said normalizing using the positional information from the one or more decoded video frames as a positional prediction, resulting normalization being prediction of the one or more candidate feature instances in a subject video frame in the series; comparing the feature-based model to a video encoding resulting from the first video encoding process for the one or more instances of the candidate feature, and determining from the comparison which enables greater encoding compression; and using results of the comparing, applying feature-based encoding to portions of one or more of the video frames, and applying the first video encoding process to other portions of the one or more video frames. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method of processing video data comprising:
-
receiving video data having a series of video frames; detecting a candidate feature in one or more of the video frames, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; segmenting the candidate feature from non-features in the video frame by employing reference frame processing used in a motion compensated prediction process; processing at least one or more portions of decoded video frames to identify potential matches in the one or more previously decoded video frames; determining that one or more of the decoded video frames include instances of the candidate feature; forming a representative feature model based on aggregating one or more of the candidate features by; aggregating the instances of the candidate feature into a set of instances of the candidate feature; and processing the candidate feature set to create a feature-based model, where the feature-based model includes one or more of a model of deformation variation or a model of appearance variation of the instances of the candidate feature, the appearance variation models being created by modeling pel variation of the instances of the candidate feature, the deformation variation models being created by modeling pel correspondence variation of the instances of the candidate feature; determining compression efficiency associated with using the feature-based model to model the candidate feature set; determining compression efficiency associated with using a first video encoding process to model the candidate feature set; comparing the feature-based model compression efficiency with the first video modeling compression efficiency, and determining which one is of greater compression value; encoding the video data using the feature-based models and the first video encoding process based on the compression efficiency. - View Dependent Claims (12, 13, 14)
-
-
15. A digital processing system for processing video data having one or more video frames comprising:
-
one or more computer processors executing an encoder; the encoder configured to use feature-based encoding to encode portions of the video frames by; detecting a candidate feature in one or more of the video frames, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; segmenting the candidate feature from non-features in the video frame by employing reference frame processing used in a motion compensated prediction process; processing at least the one or more portions of decoded video frames to identify potential matches of the candidate feature; determining that an amount of the portions of the decoded video frames include instances of the candidate feature; forming a representative feature model based on aggregating one or more of the candidate features by; aggregating the instances of the candidate feature into a set of instances of the candidate feature; and processing the candidate feature set to create a feature-based model, where the feature-based model includes a model of deformation variation and a model of appearance variation of the instances of the candidate feature, the appearance variation models being created by modeling pel variation of the instances of the candidate feature, the structural variation models being created by modeling pel correspondence variation of the instances of the candidate feature; determining compression efficiency associated with using the feature-based model to model the candidate feature set; determining compression efficiency associated with using the first video encoding process to model the candidate feature set; comparing the feature-based model compression efficiency with the video modeling compression efficiency of the first video encoding process; and encoding the video data using the feature-based models and the first video encoding process based on the compression efficiency. - View Dependent Claims (16, 17, 18)
-
-
19. A data processing system including:
-
an encoder configured to process a series of video frames of video data, such that portions of the video data are encoded using a first encoding process and portions are encoded using a feature-based compression process by; processing a plurality of decoded video frames in the series to detect one or more instances of a candidate feature, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; determining positional information for the detected instances of the candidate feature in the one or more decoded video frames; using a motion compensated prediction process to facilitate determining an instance of the candidate feature in a subject video frame in the series by initializing the motion compensated prediction process with the positional information for at least one of the detected instances of the candidate feature in the one or more decoded video frames; transforming one or more of the determined or the detected candidate feature instances; aggregating the transformed one or more of the determined or detected feature instances to create a feature-based encoding; and comparing compression efficiency of the feature-based encoding with an encoding from the first encoding process to determine which encoding process to apply. - View Dependent Claims (20, 21)
-
-
22. A computer program product stored on a non-transitory computer useable medium configured to be executed by one or more processors, the computer program product being configured to cause the one or more processors to process video data by:
-
encoding portions of the video data using a first encoding process and portions using a feature-based compression process by; processing a plurality of decoded video frames in the series to detect one or more instances of a candidate feature, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; determining positional information for at least one of the detected instances of the candidate feature in the one or more decoded video frames; using a motion compensated prediction process to facilitate determining an instance of the candidate feature in a subject video frame in the series by initializing the motion compensated prediction process with the positional information for at least one of the detected instances of the candidate feature in the one or more decoded video frames; transforming one or more of the determined or the detected candidate feature instances; aggregating the transformed one or more of the determined or detected feature instances to create a feature-based encoding; and comparing compression efficiency of the feature-based encoding with an encoding from the first encoding process to determine which encoding process to apply.
-
-
23. A computer implemented method of processing a series of video frames of video data, comprising the computer implemented steps of:
-
encoding portions of the video data using a first encoding process and portions using a feature-based compression process by; processing a plurality of decoded video frames in the series to detect one or more instances of a candidate feature, said candidate feature being a region of pels exhibiting encoding complexity relative to neighboring pels; determining positional information for at least one of the detected instances of the candidate feature in the one or more decoded video frames; using a motion compensated prediction process to facilitate determining an instance of the candidate feature in a subject video frame in the series by initializing the motion compensated prediction process with the positional information for at least one of the detected instances of the candidate feature in the one or more decoded video frames; transforming one or more of the determined or the detected candidate feature instances; aggregating one or more of the transformed candidate feature instances; using the aggregated and transformed feature instances to create a feature based model configured as a feature-based encoding; and comparing compression efficiency of the feature-based encoding with an encoding from the first encoding process to determine which encoding process to apply. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58)
-
Specification