Layered DCT video coder for packet switched ATM networks
First Claim
1. An apparatus for encoding a digital video frame fi, having a matrix with N rows and M columns of pixels Δ
- xi (n,m), for transmission over a digital communications channel, comprisingan inter-frame encoder for encoding a said pixel xi (n,m) of said video frame fi into a corresponding differentially encoded pixel Δ
xi (n,m)=xi (n,m)-xi (n,m) of a difference frame Δ
fi dependent on the corresponding pixel xi-1 (n,m) of a previous video frame fi-1,an intra-frame encoder for encoding a said pixel xi (n,m) of frame fi into a corresponding differentially encoded pixel Δ
xi (n,m) of said difference frame Δ
fi (n,m) dependent on other said pixels of the same said video frame fi, andan encoding selector for selecting between said inter-frame encoder and said intra-frame encoder for differentially encoding pixels xi (n,m) of frame fi into corresponding differentially encoded pixels Δ
xi (n,m) of said difference frame Δ
fi, said encoding selector being responsive to the relative motion between said video frame fi and said previous video frame fi-1,said coding selector further including a motion detector for detecting the relative motion between said video frame fi and said previous vide frame fi-1, and for providing a decision parameter K representing the level of detected relative motion, K being computed by
space="preserve" listing-type="equation">K=k.sub.1 Δ
x.sub.i +k.sub.2 σ
where Δ
xi and σ
2 are an estimate of the mean and variance, respectively, of difference pixel value Δ
xi (n,m), k1 and k2 are weighing coefficients each having a value between 0 and 1 such that k1 +k2 =1,a layered resolution encoder for encoding said differential pixels Δ
xi (n,m) of frame Δ
fi into a plurality of separable data sets, each said data set representing video information, within a particular range of video image resolution, about said differential pixels Δ
xi (n,m),a packetizer for formatting said plurality of data sets, into at least one asynchronous transfer mode (ATM) packet for transmission over the digital communications channel, said ATM packet comprising a header field portion having data for establishing a virtual communications channel between selected devices on the digital communications channel, and an information field portion for transferring said plurality of data sets between said selected devices,wherein said encoding selector responds to decision parameter K by selecting said inter-frame encoder when K<
T and by selecting said intra-frame encoder when K≧
T, where T represents a threshold parameter T having a value between 0 and 1.
2 Assignments
0 Petitions
Accused Products
Abstract
A digital video encoder encodes a video frame into a differential video frame for transmission over a packet switched network. The video encoder includes an inter-frame encoder, an intra-frame encoder, and an encoding selector for selecting between the inter-frame and intra-frame encoder depending on the relative motion between the video frame being encoded and the previous video frame.
A composite frame combiner provides a composite intra/inter-frame encoded difference frame having one set of pixels encoded by the inter-frame encoder, and another set of pixels encoded by the intra-frame encoder. The set of intra-frame encoded pixels includes at least one square or rectangular pixel block, a vertical strip of pixel blocks, and a horizontal strip of pixel blocks.
Difference frames are encoded into separable data sets representing video information within a particular range of image resolution. A discrete cosine transform (DCT) is used to transform the difference pixels into corresponding DCT coefficients which are separable, by resolution, into the data sets providing coefficient layers.
A packetizer formats the data sets into asynchronous transfer mode (ATM) packets for transmission over network.
252 Citations
33 Claims
-
1. An apparatus for encoding a digital video frame fi, having a matrix with N rows and M columns of pixels Δ
- xi (n,m), for transmission over a digital communications channel, comprising
an inter-frame encoder for encoding a said pixel xi (n,m) of said video frame fi into a corresponding differentially encoded pixel Δ
xi (n,m)=xi (n,m)-xi (n,m) of a difference frame Δ
fi dependent on the corresponding pixel xi-1 (n,m) of a previous video frame fi-1,an intra-frame encoder for encoding a said pixel xi (n,m) of frame fi into a corresponding differentially encoded pixel Δ
xi (n,m) of said difference frame Δ
fi (n,m) dependent on other said pixels of the same said video frame fi, andan encoding selector for selecting between said inter-frame encoder and said intra-frame encoder for differentially encoding pixels xi (n,m) of frame fi into corresponding differentially encoded pixels Δ
xi (n,m) of said difference frame Δ
fi, said encoding selector being responsive to the relative motion between said video frame fi and said previous video frame fi-1,said coding selector further including a motion detector for detecting the relative motion between said video frame fi and said previous vide frame fi-1, and for providing a decision parameter K representing the level of detected relative motion, K being computed by
space="preserve" listing-type="equation">K=k.sub.1 Δ
x.sub.i +k.sub.2 σwhere Δ
xi and σ
2 are an estimate of the mean and variance, respectively, of difference pixel value Δ
xi (n,m), k1 and k2 are weighing coefficients each having a value between 0 and 1 such that k1 +k2 =1,a layered resolution encoder for encoding said differential pixels Δ
xi (n,m) of frame Δ
fi into a plurality of separable data sets, each said data set representing video information, within a particular range of video image resolution, about said differential pixels Δ
xi (n,m),a packetizer for formatting said plurality of data sets, into at least one asynchronous transfer mode (ATM) packet for transmission over the digital communications channel, said ATM packet comprising a header field portion having data for establishing a virtual communications channel between selected devices on the digital communications channel, and an information field portion for transferring said plurality of data sets between said selected devices, wherein said encoding selector responds to decision parameter K by selecting said inter-frame encoder when K<
T and by selecting said intra-frame encoder when K≧
T, where T represents a threshold parameter T having a value between 0 and 1. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16)
- xi (n,m), for transmission over a digital communications channel, comprising
-
11. The apparatus of claim 11 wherein said DCT coefficients within each said coefficient block are separated into said coefficient layers.
-
17. A method for encoding a digital video frame fi, having N rows and M columns of pixels xi (n,m), for transmission over a digital communications channel, comprising the steps of
selecting between an inter-frame encoding mode and an intra-frame encoding mode for differentially encoding pixels xi (n,m) of a video frame fi into corresponding differentially encoded pixels Δ - xi (n,m) of a difference frame Δ
fi, said selection being determined by the relative motion between said video frame fi and a previous video frame fi-1,said step of selecting between an inter-frame encoding mode and an intra-frame encoding mode further including detecting the relative motion between said video frame fi and said previous video frame fi-1, and computing a decision parameter K representing the level of detected relative motion, K being computed by
space="preserve" listing-type="equation">K=k.sub.1 Δ
x.sub.i +k.sub.2 σwhere Δ
xi and σ
2 are an estimate of the mean and variance, respectively, of difference pixel values Δ
xi (n,m), k1 and k2 are weighting coefficients each having a value between 0 and 1 such that k1 +k2 =1, selecting said inter-frame encoder when K<
T, selecting said intra-frame encoder when K≧
T, where T represents a threshold parameter T having a value between 0 and 1,if said inter-frame encoding mode is selected, then encoding a said pixel xi (n,m) of frame fi into a corresponding differentially encoded pixel Δ
xi (n,m) of said difference frame Δ
fi, dependent on the corresponding pixel xi-1 (n,m) of said previous vide frame fi-1, andif said intra-frame encoding mode is selected, then encoding a said pixel xi (n,m) of said frame fi into a corresponding differentially encoded pixel Δ
xi (n,m) computed by Δ
xi (n,m)=xi (n,m)-xi-1 (n,m) dependent on other said pixels of the same said video frame fi,layer encoding said differential pixels Δ
xi (n,m) of frame Δ
fi into a plurality of separable data sets, each said data set representing video information, within a particular range of video image resolution, about said differential pixels Δ
xi (n,m),formatting said plurality of data sets, into at least one asynchronous transfer mode (ATM) packet for transmission over the digital communications channel, said ATM packet comprising a header field portion having data for establishing a virtual communications channel between selected devices on the digital communications channel, and an information field portion for transferring said plurality of data sets between said selected devices. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 30, 31, 32)
- xi (n,m) of a difference frame Δ
-
33. An apparatus for transferring a digital video frame fi, having N rows and M columns of pixels xi (n,m), for transmission over a digital communications channel, comprising
an inter-frame encoder for encoding a said pixel xi (n,m) of a video frame fi into a corresponding differentially encoded pixel Δ - xi (n,m) of a difference frame Δ
fi dependent on the corresponding pixel xi-1 (n,m) of a previous video frame fi-1,an intra-frame encoder for encoding a said pixel xi (n,m) of a video frame fi into a corresponding differentially encoded pixel Δ
xi (n,m) dependent on other pixels of the same said frame fi,an encoding selector for selecting between said inter-frame encoder and said intra-frame encoder for differentially encoding pixels Δ
xi (n,m) of said frame fi into corresponding differentially encoded pixels Δ
xi (n,m) of said frame Δ
fi, said encoding selector being responsive to the relative motion between said video frame fi and said previous vide frame fi-1, anda layered resolution encoder for encoding said differential pixels Δ
xi (n,m) of frames Δ
fi into a plurality of separable data sets, each said data set representing video information, within a particular range of video image resolution, about said differential pixels Δ
xi (n,m),a packetizer for formatting said plurality of data sets into at least one asynchronous transfer mode (ATM) packet for transmission over the digital communications channel, said ATM packet comprising a header field portion having data for establishing a virtual communications channel between selected devices on the digital communications channel, and an information field portion for transferring said plurality of data sets between said selected devices, said information field portion having an adaptation overhead field portion including a cell sequence number for indicating the temporal relationship of said ATM packet relative to other said ATM packets, and a sync flag taking one of a plurality of states for indicating the composition of the remainder of said information field portion, said sync flag taking a first state when said information field portion includes a first type of information field, and a second state when said information field portion includes a second type of information field, said encoding selector comprises a composite frame combiner for providing a differentially encoded difference frame Δ
fi having a first set of pixels Δ
xi (n,m) encoded by said inter-frame encoder, and a second set of pixels Δ
xi (n,m) encoded by said intra-frame encoder, andsaid first type of information field comprises and adaptation overhead field portion further comprising a strip location field for indicating the location of said second set of pixels relative to said first set of pixels within said encoded difference frame Δ
fi. - View Dependent Claims (27)
- xi (n,m) of a difference frame Δ
Specification