Spatially scalable video coding facilitating the derivation of variable-resolution images
First Claim
1. Apparatus for encoding digital video signals, comprising:
- means for receiving a digital video input signal including a succession of digital representations related to picture elements of a first video image;
means for coding a reduced resolution digital signal related to the picture elements of said first video image, employing in said coding, if said first video image is not the initial image for which an input signal was received, a prediction of said first video image based upon a previously coded video image from a previously received input signal;
means for producing a temporal prediction of said first video image from said reduced resolution digital signal;
means for producing a spatial prediction of said first video image based upon said temporal prediction produced from said reduced resolution digital signal; and
means for coding a second digital signal related to the picture elements of said first video image, adapted to determine if an estimate based upon said temporal prediction of said first video image, or an estimate based upon said spatial prediction of said first video image will be employed in the encoding of said second digital signal.
7 Assignments
0 Petitions
Accused Products
Abstract
An adaptive technique for encoding and decoding which facilitates the transmission, reception, storage, or retrieval of a scalable video signal. The invention allows this scaling to be performed entirely in the spatial domain. In a specific embodiment of the invention this scaling is realized by adaptively encoding a video signal based upon a selection taken from among a multiplicity of predictions from previously decoded images, and a selection of compatible predictions obtained from up-sampling lower resolution decoded images of the current temporal reference. A technical advantage of the invention is that both the syntax and signal multiplexing structure of at least one encoded lower-resolution scale of video is compatible with the MPEG-1 standards.
87 Citations
72 Claims
-
1. Apparatus for encoding digital video signals, comprising:
-
means for receiving a digital video input signal including a succession of digital representations related to picture elements of a first video image; means for coding a reduced resolution digital signal related to the picture elements of said first video image, employing in said coding, if said first video image is not the initial image for which an input signal was received, a prediction of said first video image based upon a previously coded video image from a previously received input signal; means for producing a temporal prediction of said first video image from said reduced resolution digital signal; means for producing a spatial prediction of said first video image based upon said temporal prediction produced from said reduced resolution digital signal; and means for coding a second digital signal related to the picture elements of said first video image, adapted to determine if an estimate based upon said temporal prediction of said first video image, or an estimate based upon said spatial prediction of said first video image will be employed in the encoding of said second digital signal. - View Dependent Claims (2, 3)
-
-
4. Method for encoding digital video signals, comprising the steps of:
-
receiving a digital video input signal including a succession of digital representations related to picture elements of a first video image; coding a reduced resolution digital signal related to the picture elements of said first video image, employing in said coding, if said first video image is not the initial image for which an input signal was received, a prediction of said first video image based upon a previously coded video image from a previously received input signal; producing a temporal prediction of said first video image from said reduced resolution digital signal; producing a spatial prediction of said first video image based upon said temporal prediction produced from said reduced resolution digital signal; and coding a second digital signal related to the picture elements of said first video image, adapted to determine if an estimate based upon said temporal prediction of said first video image, or an estimate based upon said spatial prediction of said first video image will be employed in the encoding of said second digital signal. - View Dependent Claims (5, 6)
-
-
7. Apparatus for encoding digital video signals, comprising:
-
means for receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced fields; means for coding a reduced resolution digital signal related to the picture elements of a first field of a received frame, employing in said coding, if said received frame is not the initial frame received, a prediction of said first field based upon a previously coded field from a previously received frame; means for producing a temporal prediction of said first field from said reduced resolution digital signal; means for producing a spatial prediction of said first video field based upon said temporal prediction produced from said reduced resolution digital signal; and means for coding a second digital signal related to the picture elements of said first video field, adapted to determine if an estimate based upon said temporal prediction of said first video field, or an estimate based upon said spatial prediction of said first video field will be employed in the encoding of said second digital signal. - View Dependent Claims (8, 9)
-
-
10. Method for encoding digital video signals, comprising the steps of:
-
receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced fields; coding a reduced resolution digital signal related to the picture elements of a first field of a received frame, employing in said coding, if said received frame is not the first initial received, a prediction of said first field based upon a previously coded field from a previously received frame; producing a temporal prediction of said first field from said reduced resolution digital signal; producing a spatial prediction of said first video field based upon said temporal prediction produced from said reduced resolution digital signal; and coding a second digital signal related to the picture elements of said first video field, adapted to determine if an estimate based upon said temporal prediction of said first video field, or an estimate based upon said spatial prediction of said first video field will be employed in the encoding of said second digital signal. - View Dependent Claims (11, 12)
-
-
13. Apparatus for encoding digital video signals, comprising:
-
means for receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced odd and even fields; means for coding a reduced resolution digital signal related to the picture elements of an odd field of a received frame, employing, if said received frame is not the initial frame received, a prediction of said odd field of said received frame based upon a previously coded odd field from a previously received frame; means for producing a temporal prediction of said odd field of said received frame from said reduced resolution digital signal; means for producing a spatial prediction of said odd field of said received frame based upon said temporal prediction produced from said reduced resolution digital signal; and means for coding a second digital signal related to the picture elements of said odd field of said received frame, adapted to determine if an estimate based upon said temporal prediction of said odd field of said received frame, or an estimate based upon said spatial prediction of said odd field of said received frame will be employed in the encoding of said second digital signal. - View Dependent Claims (14, 15)
-
-
16. Method for encoding digital video signals, comprising the steps of:
-
receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced odd and even fields; coding a reduced resolution digital signal related to the picture elements of an odd field of a received frame, employing, if said received frame is not the initial frame received, a prediction of said odd field of said received frame based upon a previously coded odd field from a previously received frame; producing a temporal prediction of said odd field of said received frame from said reduced resolution digital signal; producing a spatial prediction of said odd field of said received frame based upon said temporal prediction produced from said reduced resolution digital signal; and coding a second digital signal related to the picture elements of said odd field of said received frame, adapted to determine if an estimate based upon said temporal prediction of said odd field of said received frame, or an estimate based upon said spatial prediction of said odd field of said received frame will be employed in the encoding of said second digital signal. - View Dependent Claims (17, 18)
-
-
19. Apparatus for encoding digital video signals, comprising:
-
means for receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced odd and even fields; means for coding a reduced resolution digital signal related to the picture elements of an even field of a received frame, employing, if said received frame is not the initial frame received, a prediction of said even field of said received frame based upon a previously coded even field from a previously received frame; means for producing a temporal prediction of said even field of said received frame from said reduced resolution digital signal; means for producing a spatial prediction of said even field of said received frame based upon said temporal prediction produced from said reduced resolution digital signal; and means for coding a second digital signal related to the picture elements of said even field of said received frame, adapted to determine if an estimate based upon said temporal prediction of said even field of said received frame, or an estimate based upon said spatial prediction of said even field of said received frame will be employed in the encoding of said second digital signal. - View Dependent Claims (20, 21)
-
-
22. Method for encoding digital video signals, comprising the steps of:
-
receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced odd and even fields; coding a reduced resolution digital signal related to the picture elements of an even field of a received frame, employing, if said received frame is not the initial frame received, a prediction of said even field of said received frame based upon a previously coded even field from a previously received frame; producing a temporal prediction of said even field of said received frame from said reduced resolution digital signal; producing a spatial prediction of said even field of said received frame based upon said temporal prediction produced from said reduced resolution digital signal; and coding a second digital signal related to the picture elements of said even field of said received frame, adapted to determine if an estimate based upon said temporal prediction of said even field of said received frame, or an estimate based upon said spatial prediction of said even field of said received frame will be employed in the encoding of said second digital signal. - View Dependent Claims (23, 24)
-
-
25. Apparatus for encoding digital video signals, comprising:
-
means for receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image; means for processing said received digital video input signal so as to obtain a signal representing a reduced resolution image of the received frame; means for coding a reduced resolution digital signal related to the picture elements of said received frame, employing in said coding, if said received frame is not the initial frame received, a prediction of said received frame based upon a previously coded reduced resolution image of a previously received frame; means for producing a temporal prediction of said received frame from said reduced resolution digital signal; means for producing a spatial prediction of said received frame based upon said temporal prediction produced from said reduced resolution digital signal; and means for coding a second digital signal related to the picture elements of said received frame, adaptively employing an estimate based upon said temporal and said spatial predictions of said received frame. - View Dependent Claims (26, 27)
-
-
28. Method for encoding digital video signals, comprising the steps of:
-
receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image; processing said received digital video input signal so as to obtain a signal representing a reduced resolution image of the received frame; coding a reduced resolution digital signal related to the picture elements of said received frame, employing in said coding, if said received frame is not the initial frame received, a prediction of said received frame based upon a previously coded reduced resolution image of a previously received frame; producing a temporal prediction of said received frame from said reduced resolution digital signal; producing a spatial prediction of said received frame based upon said temporal prediction produced from said reduced resolution digital signal; and coding a second digital signal related to the picture elements of said received frame, adaptively employing an estimate based upon said temporal and said spatial predictions of said received frame. - View Dependent Claims (29, 30)
-
-
31. Apparatus for encoding digital video signals, comprising:
-
means for receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced odd and even fields; means for coding a first reduced resolution digital signal related to the picture elements of an odd field of a received frame, employing, if said received frame is not the initial frame received, a prediction of said odd field of said received frame based upon a previously coded odd field from a previously received frame; means for producing a temporal prediction of said odd field of said received frame from said first reduced resolution digital signal; means for coding a second reduced resolution digital signal related to the picture elements of an even field of said received frame, employing, if said received frame is not the initial frame received, a prediction of said even field of said received frame based upon a previously coded even field from a previously received frame; means for producing a temporal prediction of said even field of said received frame from said second reduced resolution digital signal; means for producing a spatial prediction of said received frame based upon said temporal predictions produced from said first and second reduced resolution digital signals; means for producing a temporal prediction of said received frame; and means for coding a third digital signal related to the picture elements of said received frame, adaptively employing an estimate based upon said temporal and said spatial predictions of said received frame. - View Dependent Claims (32, 33)
-
-
34. Method for encoding digital video signals, comprising:
-
receiving a digital video input signal including a succession of digital representations related to picture elements of at least two frames of a video image, each of said frames comprising a plurality of interlaced odd and even fields; coding a first reduced resolution digital signal related to the picture elements of an odd field of a received frame, employing, if said received frame is not the initial frame received, a prediction of said odd field of said received frame based upon a previously coded odd field from a previously received frame; producing a temporal prediction of said odd field of said received frame from said first reduced resolution digital signal; coding a second reduced resolution digital signal related to the picture elements of an even field of said received frame, employing, if said received frame is not the initial frame received, a prediction of said even field of said received frame based upon a previously coded even field from a previously received frame; producing a temporal prediction of said even field of said received frame from said second reduced resolution digital signal; producing a spatial prediction of said received frame based upon said temporal predictions produced from said first and second reduced resolution digital signals; producing a temporal prediction of said received frame; and coding a third digital signal related to the picture elements of said received frame, adaptively employing an estimate based upon said temporal and said spatial predictions of said received frame. - View Dependent Claims (35, 36)
-
-
37. Apparatus for decoding digital video signals, comprising:
-
means for receiving a first and second digital signals, said first digital signal representing a reduced resolution depiction of a first video image, and said second digital signal representing a full resolution depiction of said first video image; means for decoding from said first digital signal said reduced resolution depiction of said first video image, employing in said decoding, if said first video image is not the initial image for which an input signal was received, a prediction of said first video image based upon a previously decoded video image from a previously received input signal; means for producing a temporal prediction of said first video image from said decoded reduced resolution signal; means for producing a spatial prediction of said received first video image based upon said reduced resolution video field; and means for decoding from said second digital signal a full resolution depiction of said first video image, adapted to determine if an estimate based upon said temporal prediction of said first video image, or an estimate based upon said spatial prediction of said first video image will be employed in the decoding of said second digital signal. - View Dependent Claims (38, 39)
-
-
40. Method for decoding digital video signals, comprising the steps of:
-
receiving a first and second digital signals, said first digital signal representing a reduced resolution depiction of a first video image, and said second digital signal representing a full resolution depiction of said first video image; decoding from said first digital signal said reduced resolution depiction of said first video image, employing in said decoding, if said first video image is not the initial image for which an input signal was received, a prediction of said first video image based upon a previously decoded video image from a previously received input signal; producing a temporal prediction of said first video image from said decoded reduced resolution signal; producing a spatial prediction of said received first video image based upon said reduced resolution video field; and decoding from said second digital signal a full resolution depiction of said first video image, adapted to determine if an estimate based upon said temporal prediction of said first video image, or an estimate based upon said spatial prediction of said first video image will be employed in the decoding of said second digital signal. - View Dependent Claims (41, 42)
-
-
43. Apparatus for decoding digital video signals, comprising:
-
means for receiving a first and second digital signals, said first digital signal representing a reduced resolution field of a first frame of video, and said second digital signal representing a full resolution field of said first frame of video; means for decoding from said first digital signal said reduced resolution field, employing in said decoding, if said decoded reduced resolution field is not the initial field to be decoded by said apparatus, a prediction of said reduced resolution field based upon a previously decoded reduced resolution field from a previous frame; means for producing a temporal prediction of said field of said first frame of video from said decoded reduced resolution field; means for producing a spatial prediction of said video field of said first frame of video based upon said reduced resolution video field; and means for decoding from said second digital signal said full resolution video field, adapted to determine if an estimate based upon said temporal prediction of said field of said first frame of video, or an estimate based upon said spatial prediction of said field of said first frame of video will be employed in the decoding of said second digital signal. - View Dependent Claims (44, 45)
-
-
46. Method for decoding digital video signals, comprising the steps of:
-
receiving a first and second digital signals, said first digital signal representing a reduced resolution field of a first frame of video, and said second digital signal representing a full resolution field of said first frame of video; decoding from said first digital signal said reduced resolution field, employing in said decoding, if said decoded reduced resolution field is not the initial field to be decoded by said apparatus, a prediction of said reduced resolution field based upon a previously decoded reduced resolution field from a previous frame; producing a temporal prediction of said field of said first frame of video from said decoded reduced resolution field; producing a spatial prediction of said received first field of said first frame of video based upon said reduced resolution video field; and decoding from said second digital signal said full resolution video field, adapted to determine if an estimate based upon said temporal prediction of said field of said first frame of video, or an estimate based upon said spatial prediction of said field of said first frame of video will be employed in the decoding of said second digital signal. - View Dependent Claims (47, 48)
-
-
49. Apparatus for decoding digital video signals, comprising:
-
means for receiving a first and second digital signals, said first digital signal representing a reduced resolution odd field of a first frame of video, and said second digital signal representing a full resolution odd field of said first frame of video; means for decoding from said first digital signal said reduced resolution odd field, employing in said decoding, if said decoded reduced resolution odd field is not the initial odd field to be decoded by said apparatus, a prediction of said reduced resolution odd field based upon a previously decoded reduced resolution odd field from a previous frame; means for producing a temporal prediction of said odd field of said first frame from said decoded reduced resolution field; means for producing a spatial prediction of said odd field of said first frame based upon said reduced resolution odd field; and means for decoding from said second digital signal said full resolution odd field, adapted to determine if an estimate based upon said temporal prediction of said odd field of said first frame, or an estimate base upon said spatial prediction of said odd field of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (50, 51)
-
-
52. Method for decoding digital video signals, comprising the steps of:
-
receiving a first and second digital signals, said first digital signal representing a reduced resolution odd field of a first frame of video, and said second digital signal representing a full resolution odd field of said first frame of video; decoding from said first digital signal said reduced resolution odd field, employing in said decoding, if said decoded reduced resolution odd field is not the initial odd field to be decoded by said apparatus, a prediction of said reduced resolution odd field based upon a previously decoded reduced resolution odd field from a previous frame; producing a temporal prediction of said odd field of said first frame from said decoded reduced resolution field; producing a spatial prediction of said odd field of said first frame based upon said reduced resolution odd field; and decoding from said second digital signal said full resolution odd field, adapted to determine if an estimate based upon said temporal prediction of said odd field of said first frame, or an estimate based upon said spatial prediction of said odd field of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (53, 54)
-
-
55. Apparatus for decoding digital video signals, comprising:
-
means for receiving a first and second digital signals, said first digital signal representing a reduced resolution even field of a first frame of video, and said second digital signal representing a full resolution even field of said first frame of video; means for decoding from said first digital signal said reduced resolution even field, employing in said decoding, if said decoded reduced resolution even field is not the initial even field to be decoded by said apparatus, a prediction of said reduced resolution even field based upon a previously decoded reduced resolution even field from a previous frame; means for producing a temporal prediction of said even field of said first frame from said decoded reduced resolution field; means for producing a spatial prediction of said even field of said first frame based upon said reduced resolution even field; and means for decoding from said second digital signal said full resolution even field, adapted to determine if an estimate based upon said temporal prediction of said even field of said first frame, or an estimate based upon said spatial prediction of said even field of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (56, 57)
-
-
58. Method for decoding digital video signals, comprising the steps of:
-
receiving a first and second digital signals, said first digital signal representing a reduced resolution even field of a first frame of video, and said second digital signal representing a full resolution even field of said first frame of video; decoding from said first digital signal said reduced resolution even field, employing in said decoding, if said decoded reduced resolution even field is not the initial even field to be decoded by said apparatus, a prediction of said reduced resolution even field based upon a previously decoded reduced resolution even field from a previous frame; producing a temporal prediction of said even field of said first frame from said decoded reduced resolution field; producing a spatial prediction of said even field of said first frame based upon said reduced resolution even field; and decoding from said second digital signal said full resolution even field, adapted to determine if an estimate based upon said temporal prediction of said even field of said first frame, or an estimate based upon said spatial prediction of said even field of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (59, 60)
-
-
61. Apparatus for decoding digital video signals, comprising:
-
means for receiving a first and second digital signals, said first digital signal representing a reduced resolution frame of a first frame of video, and said second digital signal representing a full resolution frame of said first frame of video; means for decoding from said first digital signal said reduced resolution frame, employing in said decoding, if said decoded reduced resolution frame is not the initial frame to be decoded by said apparatus, a prediction of said reduced resolution frame based upon a previously decoded reduced resolution even frame from a previous frame; means for producing a temporal prediction of said first frame from said decoded reduced resolution field; means for producing a spatial prediction of said first frame based upon said decoded reduced resolution frame; and means for decoding from said second digital signal said full resolution frame of said first frame, adapted to determine if an estimate based upon said temporal prediction of said first frame, or an estimate based upon said spatial prediction of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (62, 63)
-
-
64. Method for decoding digital video signals, comprising the steps of:
-
receiving a first and second digital signals, said first digital signal representing a reduced resolution frame of a first frame of video, and said second digital signal representing a full resolution frame of said first frame of video; decoding from said first digital signal said reduced resolution frame, employing in said decoding, if said decoded reduced resolution frame is not the initial frame to be decoded by said apparatus, a prediction of said reduced resolution frame based upon a previously decoded reduced resolution even frame from a previous frame; producing a temporal prediction of said first frame from said decoded reduced resolution field; producing a spatial prediction of said first frame based upon said decoded reduced resolution frame; and decoding from said second digital signal said full resolution frame of said first frame, adapted to determine if an estimate based upon said temporal prediction of said first frame, or an estimate based upon said spatial prediction of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (65, 66)
-
-
67. Apparatus for decoding digital video signals, comprising:
-
means for receiving a first, second, and third digital signals, said first digital signal representing a reduced resolution frame of an odd field of a first frame of video, said second digital signal representing a reduced resolution frame of an even field of said first frame of video, and said third digital signal representing a full resolution frame of said first frame of video; means for decoding from said first digital signal said reduced resolution odd field, employing in said decoding, if said first frame is not the initial frame being decoded by said apparatus, a prediction of said reduced resolution odd field based upon a previously decoded reduced resolution odd field from a previous frame; means for producing a temporal prediction of said odd field of said first frame from said decoded reduced resolution odd field; means for decoding from said second digital signal said reduced resolution even field, employing in said decoding, if said first frame is not the initial frame being decoded by said apparatus, a prediction of said reduced resolution even field based upon a previously decoded reduced resolution even field from a previous frame; means for producing a temporal prediction of said even field of said first frame from said decoded reduced resolution even field; means for producing a spatial prediction of said first frame based upon said temporal predictions produced from said first and second reduced resolution digital signals; and means for decoding from said third digital signal said full resolution frame of said first frame, adapted to determine if an estimate based upon said temporal prediction of said first frame, or an estimate based upon said spatial prediction of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (68, 69)
-
-
70. Method for decoding digital video signals, comprising:
-
receiving a first, second, and third digital signals, said first digital signal representing a reduced resolution frame of an odd field of a first frame of video, said second digital signal representing a reduced resolution frame of an even field of said first frame of video, and said third digital signal representing a full resolution frame of said first frame of video; decoding from said first digital signal said reduced resolution odd field, employing in said decoding, if said first frame is not the initial frame being decoded by said apparatus, a prediction of said reduced resolution odd field based upon a previously decoded reduced resolution odd field from a previous frame; producing a temporal prediction of said odd field of said first frame from said decoded reduced resolution odd field; decoding from said second digital signal said reduced resolution even field, employing in said decoding, if said first frame is not the initial frame being decoded by said apparatus, a prediction of said reduced resolution even field based upon a previously decoded reduced resolution even field from a previous frame; producing a temporal prediction of said even field of said first frame from said decoded reduced resolution even field; producing a spatial prediction of said first frame based upon said temporal predictions produced from said first and second reduced resolution digital signals; and decoding from said third digital signal said full resolution frame of said first frame, adapted to determine if an estimate based upon said temporal prediction of said first frame, or an estimate based upon said spatial prediction of said first frame will be employed in the decoding of said second digital signal. - View Dependent Claims (71, 72)
-
Specification