System and method for providing error resilence, random access and rate control in scalable video communications
First Claim
1. A video communication system comprising:
- a communication network,a conferencing server (including a combination of hardware and software) disposed in the network and linked to at least one receiving and at least one transmitting endpoint by at least one communication channel each over the communication network,at least one endpoint that transmits coded digital video using a scalable video coding format, andat least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability,wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers,wherein the conferencing server is configured to selectively eliminate or modify portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer, prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer, andwherein the conferencing server is further configured to control the transmission rate of the output video signal that is forwarded to the at least one receiving endpoint so that the retained portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for error resilient transmission, rate control, and random access in video communication systems that use scalable video coding are provided. Error resilience is obtained by using information from low resolution layers to conceal or compensate loss of high resolution layer information. The same mechanism is used for rate control by selectively eliminating high resolution layer information from transmitted signals, which elimination can be compensated at the receiver using information from low resolution layers. Further, random access or switching between low and high resolutions is also achieved by using information from low resolution layers to compensate for high resolution spatial layer packets that may have not been received prior to the switching time.
63 Citations
36 Claims
-
1. A video communication system comprising:
-
a communication network, a conferencing server (including a combination of hardware and software) disposed in the network and linked to at least one receiving and at least one transmitting endpoint by at least one communication channel each over the communication network, at least one endpoint that transmits coded digital video using a scalable video coding format, and at least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers, wherein the conferencing server is configured to selectively eliminate or modify portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer, prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer, and wherein the conferencing server is further configured to control the transmission rate of the output video signal that is forwarded to the at least one receiving endpoint so that the retained portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A video communication system comprising:
-
a communication network, one endpoint (including a combination of hardware and software) that transmits coded digital video using a scalable video coding format, and at least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers, wherein the transmitting endpoint is configured to selectively eliminate or modify portions of its coded video signal that correspond to layers higher than the base spatial or quality layer, prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer, and wherein the transmitting endpoint is further configured to control the transmission rate of the output video signal that is transmitted to the at least one receiving endpoint so that the retained portions of its input video signal that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for video communication over a communication network, having a conferencing server disposed therein and linked to at least one receiving and at least one transmitting endpoint by at least one communication channel each over the communication network, the at least one endpoint transmitting coded digital video using a scalable video coding format, and the at least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers,
the method comprising: -
at the conferencing server (including a combination of hardware and software), selectively eliminating or modifying portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer, and at the conferencing server, controlling the transmission rate of the output video signal that is forwarded to the at least one receiving endpoint so that the retained portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A video communication method comprising:
-
a communication network, one endpoint (including a combination of hardware and software) that transmits coded digital video using a scalable video coding format, and at least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers, at the transmitting endpoint, selectively eliminating or modifying portions of its coded video signal that correspond to layers higher than the base spatial or quality layer, prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer, and at the transmitting endpoint, controlling the transmission rate of the output video signal that is transmitted to the at least one receiving endpoint so that the retained portions of its input video signal that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34)
-
-
35. A non-transitory computer readable medium comprising a set of executable instructions to direct a processor to:
-
communicate over a communication network, having a conferencing server disposed therein and linked to at least one receiving and at least one transmitting endpoint by at least one communication channel each over the communication network, the at least one endpoint transmitting coded digital video using a scalable video coding format, and the at least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers, at the conferencing server (including a combination of hardware and software), selectively eliminate or modify portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer at the conferencing server, control the transmission rate of the output video signal that is forwarded to the at least one receiving endpoint so that the retained portions of the input video signals received from transmitting endpoints that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate.
-
-
36. A non-transitory computer readable medium comprising a set of executable instructions to direct a processor to:
-
communicate over a communication network coupled to one endpoint (including a combination of hardware and software) that transmits coded digital video using a scalable video coding format, and further coupled to at least one receiving endpoint for decoding a digital video signal coded in a scalable video coding format supporting temporal scalability and at least one of spatial and quality scalability, wherein the scalable video coding format for spatial scalability includes a base spatial and at least one spatial enhancement layer, for quality scalability includes a base quality layer and at least one quality enhancement layer, and for temporal scalability includes a base temporal layer and at least one temporal enhancement layer, wherein the base temporal layers and enhancement temporal layers are interlinked by a threaded picture prediction structure for at least one of the spatial or quality scalability layers, at the transmitting endpoint, selectively eliminate or modify portions of its coded video signal that correspond to layers higher than the base spatial or quality layer, prior to creating the output video signal that is forwarded to the at least one receiving endpoint, so that use of lower spatial or quality layer data is signaled or explicitly coded in the output video signal for use in decoding pictures at resolutions higher than the base spatial or quality layer, and at the transmitting endpoint, control the transmission rate of the output video signal that is transmitted to the at least one receiving endpoint so that the retained portions of its input video signal that correspond to layers higher than the base spatial or quality layer do not adversely affect the smoothness of the output bit rate.
-
Specification