System and method for providing error resilience, random access and rate control in scalable video communications
First Claim
1. A digital video decoding system, the system comprising:
- a decoder logic configured to decode a received digital video signal, which is coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability,wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers,wherein, for decoding a picture at a target spatial or quality layer higher than the corresponding base layer, the decoder logic is configured to use coded information from a spatial or quality layer of said picture lower than the target layer in the threaded prediction structure when a portion of the target layer'"'"'s coded information is lost or not available,wherein the digital video decoding system is disposed in a receiving endpoint, the system further comprising;
a linking communication network;
a conferencing server computer linked to the receiving endpoint and at least one transmitting endpoint by at least one communication channel each over the linking communication network, andat least one endpoint that transmits the coded digital video that is coded in the scalable video coding format,wherein the conferencing server computer is configured to selectively eliminate portions of input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer, prior to creating an output video signal that is forwarded to the receiving endpoint.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for error resilient transmission, rate control, and random access in video communication systems that use scalable video coding are provided. Error resilience is obtained by using information from low resolution layers to conceal or compensate loss of high resolution layer information. The same mechanism is used for rate control by selectively eliminating high resolution layer information from transmitted signals, which elimination can be compensated at the receiver using information from low resolution layers. Further, random access or switching between low and high resolutions is also achieved by using information from low resolution layers to compensate for high resolution spatial layer packets that may have not been received prior to the switching time.
-
Citations
30 Claims
-
1. A digital video decoding system, the system comprising:
-
a decoder logic configured to decode a received digital video signal, which is coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers, wherein, for decoding a picture at a target spatial or quality layer higher than the corresponding base layer, the decoder logic is configured to use coded information from a spatial or quality layer of said picture lower than the target layer in the threaded prediction structure when a portion of the target layer'"'"'s coded information is lost or not available, wherein the digital video decoding system is disposed in a receiving endpoint, the system further comprising; a linking communication network; a conferencing server computer linked to the receiving endpoint and at least one transmitting endpoint by at least one communication channel each over the linking communication network, and at least one endpoint that transmits the coded digital video that is coded in the scalable video coding format, wherein the conferencing server computer is configured to selectively eliminate portions of input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer, prior to creating an output video signal that is forwarded to the receiving endpoint. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for decoding a digital video signal, comprising:
-
receiving the digital video signal at a decoder logic, the digital video signal being coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers; and decoding a picture at a target spatial or quality layer higher than the corresponding base layer using coded information from a spatial or quality layer of said picture lower than the target layer in the threaded prediction structure when a portion of the target layer'"'"'s coded information is lost or not available; wherein the decoder is disposed in a receiving endpoint in a linking communication network, wherein a conferencing server computer is linked to the receiving endpoint and at least one transmitting endpoint by at least one communication channel each over the linking communication network, and wherein the at least one transmitting endpoint transmits the coded digital video that is coded in the scalable video coding format; the method further comprising; at the conferencing server computer, selectively eliminating portions of input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer, prior to creating an output video signal that is forwarded to the receiving endpoint. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A non-transitory computer readable medium comprising a set of executable instructions to direct a processor to decode a digital video signal, by:
-
receiving the digital video signal at a decoder logic, the digital video signal being coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers; and decoding a picture at a target spatial or quality layer higher than the corresponding base layer using coded information from a spatial or quality layer of said picture lower than the target layer in the threaded prediction structure when a portion of the target layer'"'"'s coded information is lost or not available; wherein the decoder logic is disposed in a receiving endpoint in a linking communication network, wherein a conferencing server computer is linked to the receiving endpoint and at least one transmitting endpoint by at least one communication channel each over the linking communication network, and wherein the at least one transmitting endpoint transmits the coded digital video that is coded in the scalable video coding format; at the conferencing server computer, selectively eliminating portions of input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer, prior to creating an output video signal that is forwarded to the receiving endpoint. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification