Methods and apparatus for error-resilient video coding
First Claim
1. A method of coding a input frame of a video sequence for transmission over a channel, comprising:
- decomposing said input frame into a plurality of subbands;
dividing said plurality of subbands into a plurality of blocks corresponding to a region of said input frame;
selecting said plurality of blocks in a plurality of highest frequency subbands of said plurality of subbands based upon a luminance component of said input frame;
classifying said plurality of blocks in said plurality of highest frequency subbands into a plurality of classes to provide a plurality of class labels for said plurality of blocks in said plurality of highest frequency subbands;
collecting said plurality of class labels to form a subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands;
constructing a global classification map from a majority evaluation of said subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands; and
grouping said plurality of blocks within said plurality of subbands having one of said plurality of class labels to form a plurality of subband class sequences.
7 Assignments
0 Petitions
Accused Products
Abstract
Apparatus and methods are provided for encoding an input frame of a video sequence for transmission over a channel. The method and apparatus decompose the input frame into multiple subbands and divide the multiple subbands into multiple blocks corresponding to a region of the input frame. The blocks in the highest frequency subbands of the multiple blocks are selected based upon a luminance component of the input frame and the multiple blocks in the highest frequency subbands are classified into a multiple classes to provide a multiple class labels. The multiple class labels are collected to form a subband class map for each of the multiple blocks in the highest frequency subbands and a global class map is constructed from a majority evaluation of the subband map for each of the multiple blocks. The multiple blocks within the multiple subbands are grouped which have one of the class labels to form multiple subband class sequences.
81 Citations
20 Claims
-
1. A method of coding a input frame of a video sequence for transmission over a channel, comprising:
-
decomposing said input frame into a plurality of subbands;
dividing said plurality of subbands into a plurality of blocks corresponding to a region of said input frame;
selecting said plurality of blocks in a plurality of highest frequency subbands of said plurality of subbands based upon a luminance component of said input frame;
classifying said plurality of blocks in said plurality of highest frequency subbands into a plurality of classes to provide a plurality of class labels for said plurality of blocks in said plurality of highest frequency subbands;
collecting said plurality of class labels to form a subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands;
constructing a global classification map from a majority evaluation of said subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands; and
grouping said plurality of blocks within said plurality of subbands having one of said plurality of class labels to form a plurality of subband class sequences. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A apparatus for coding a input frame of a video sequence for transmission over a channel, comprising:
-
a decomposer configured to receive said input frame and decompose said input frame into a plurality of subbands that are divided into a plurality of blocks corresponding to a region of said input frame; and
a classifier configured to receive said plurality of subbands that are divided into said plurality of blocks corresponding to said region of said input frame, said classifier also configured to;
select said plurality of blocks in a plurality of highest frequency subbands of said plurality of subbands based upon a luminance component of said input frame;
classify said plurality of blocks in said plurality of highest frequency subbands into a plurality of classes to provide a plurality of class labels for said plurality of blocks in said plurality of highest frequency subbands;
collect said plurality of class labels to form a subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands;
construct a global classification map from a majority evaluation of said subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands; and
grouping said plurality of blocks within said plurality of subbands having one of said plurality of class labels to form a plurality of subband class sequences. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method for coding a video sequence for transmission over a channel, comprising:
-
receiving the video sequence, the video sequence having a plurality of frames;
grouping said plurality of frames into a group of pictures;
coding an initial frame of said group of pictures to produce a coded initial frame;
transmitting said coded initial frame over the channel;
decoding said coded initial frame to produce a decoded initial frame;
constructing a first error frame estimate with said decoded initial frame;
obtaining a first error frame of a first frame from said first error frame estimate and said first frame;
coding said first error frame to produce a first coded error frame, said coding said first error frame comprising;
decomposing said first error frame into a plurality of subbands;
dividing said plurality of subbands into a plurality of blocks corresponding to a region of said first error frame;
selecting said plurality of blocks in a plurality of highest frequency subbands of said plurality of subbands based upon a luminance component of said first error frame;
classifying said plurality of blocks in said plurality of highest frequency subbands into a plurality of classes to provide a plurality of class labels for said plurality of blocks in said plurality of highest frequency subbands;
collecting said plurality of class labels to form a subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands;
constructing a global classification map from a majority evaluation of said subband classification map for each of said plurality of blocks in said plurality of highest frequency subbands;
grouping said plurality of blocks within said plurality of subbands having one of said plurality of class labels to form a plurality of subband class sequences; and
encoding said plurality of subband class sequences to form said first coded error frame; and
transmitting said first coded error frame over the channel. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
decoding said first coded error frame to produce a decoded first error frame;
constructing a second error frame estimate with said decoded first error frame;
obtaining a second error frame of a second frame from said second error frame estimate and said second frame;
coding said second error frame to produce a second coded error frame; and
transmitting said second coded error frame over the channel.
-
-
15. The method of claim 14, wherein coding said second error frame to produce a second coded error frame comprises:
-
decomposing said second error frame into a second plurality of subbands;
dividing said second plurality of subbands into a second plurality of blocks corresponding to a second region of said second error frame;
selecting said second plurality of blocks in a second plurality of highest frequency subbands of said second plurality of subbands based upon a second luminance component of said second error frame;
classifying said second plurality of blocks in said second plurality of highest frequency subbands into a second plurality of classes to provide a second plurality of class labels for said second plurality of blocks in said second plurality of highest frequency subbands;
collecting said second plurality of class labels to form a second subband classification map for each of said second plurality of blocks in said second plurality of highest frequency subbands;
constructing a second global class map from a second majority evaluation of said second subband classification map for each of said second plurality of blocks in said second plurality of highest frequency subbands;
grouping said second plurality of blocks within said second plurality of subbands having one of said second plurality of class labels to form a second plurality of subband class sequences; and
encoding said second plurality of subband class sequences to form said second coded error frame.
-
-
16. The method of claim 13, wherein said decomposing said first error frame into a plurality of subbands is a two-dimensional discrete wavelet transform (DWT).
-
17. The method of claim 13, further comprising encoding said plurality of subband class sequences.
-
18. The method of claim 17, wherein said encoding is channel-optimized trellis coded quantization (COTCQ) encoding.
-
19. The method of claim 13, further comprising phase scrambling said plurality of subband class sequences.
-
20. The method of claim 13, further comprising normalizing said plurality of subband class sequences.
Specification