SYSTEMS AND METHODS FOR ERROR RESILIENCE AND RANDOM ACCESS IN VIDEO COMMUNICATION SYSTEMS
First Claim
1. A system for media communications between a transmitting endpoint and one or more receiving endpoint(s) over a communications network, the network having one or more layered media transport link(s) including a more reliable media transport link to or from each endpoint, the system comprising:
- an encoder which encodes transmitted media as frames in a threaded coding structure having a number of different frame types, the frames including a sequence of reference frames;
a decoder which decodes received media based on reference frames therein; and
a particular select subset of the frames in the threaded coding structure that is designated as a type (“
R”
) for reliable transport to the decoder over the more reliable transport link, wherein the subset of the frames of the type R is particularly selected to include at least the frames of the lowest temporal layer in the threaded coding structure and such that the decoder can decode at least a portion of the received media based on a reliably received frame of the type R after packet loss or error and thereafter is synchronized with the encoder.
9 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for error resilient transmission and for random access in video communication systems are provided. The video communication systems are based on single-layer, scalable video, or simulcast video coding with temporal scalability, which may be used in video communication systems. A set of video frames or pictures in a video signal transmission is designated for reliable or guaranteed delivery to receivers using secure or high reliability links, or by retransmission techniques. The reliably-delivered video frames are used as reference pictures for resynchronization of receivers with the transmitted video signal after error incidence and for random access.
265 Citations
91 Claims
-
1. A system for media communications between a transmitting endpoint and one or more receiving endpoint(s) over a communications network, the network having one or more layered media transport link(s) including a more reliable media transport link to or from each endpoint, the system comprising:
-
an encoder which encodes transmitted media as frames in a threaded coding structure having a number of different frame types, the frames including a sequence of reference frames;
a decoder which decodes received media based on reference frames therein; and
a particular select subset of the frames in the threaded coding structure that is designated as a type (“
R”
) for reliable transport to the decoder over the more reliable transport link,wherein the subset of the frames of the type R is particularly selected to include at least the frames of the lowest temporal layer in the threaded coding structure and such that the decoder can decode at least a portion of the received media based on a reliably received frame of the type R after packet loss or error and thereafter is synchronized with the encoder. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
9. A system for media communications between a transmitting endpoint and one or more receiving endpoint(s) or server(s) over a communication network, the system comprising:
-
an encoder which encodes transmitted media as frames in a threaded coding structure having a number of different layers including a lowest temporal layer, wherein transmitted pictures comprise data elements that indicate;
for the lowest temporal level pictures, a sequence number identifying said pictures, and for other temporal level pictures, a reference to the sequence number of the most recent, in decoding order, lowest temporal level picture, such that a receiving endpoint or server can detect if a lowest temporal level picture has been lost by examining if the picture corresponding to the referenced picture frame index has been received at the receiving endpoint or server. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
40. A system for decoding compressed digital video that is coded using a technique that provides two or more temporal layers, wherein compressed video pictures are structured into one or more packets, the system comprising:
-
a packet header containing data elements that indicate;
for the lowest temporal level pictures, a sequence number identifying the pictures, for other temporal level pictures, a reference to the sequence number of the most recent, in decoding order, lowest temporal level picture, so that a receiver can detect loss of a lowest temporal level picture by examining if the picture corresponding to the referenced picture frame index is available at the receiver. - View Dependent Claims (41, 42)
-
-
43. A system for decoding compressed digital video that is coded using a technique that provides two or more temporal layers, wherein compressed video pictures are structured into one or more packets, and received over an IP-based network using RTP, the system comprising:
a RTP header extension that includes;
a series number associated with each layer, a sequence number that is associated with each lowest temporal layer picture, and a flag that is used to indicate if a packet contains a picture or picture fragment of the lowest layer temporal picture, wherein the sequence number is referenced by all other pictures that use said lowest temporal layer picture as reference, such that loss of a lowest temporal level picture can be immediately detected by examining if the picture corresponding to the referenced series number and sequence number is available at the system. - View Dependent Claims (44)
-
45. A system for decoding compressed digital video that is coded using a technique that provides two or more temporal layers, wherein compressed video pictures are structured into one or more packets, and packets of at least the lowest temporal level pictures may be received in a compressed digital video stream from a sender faster than real-time and after their intended presentation time, in case of errors or upon initiating decoding, the system comprising:
-
a decoder, which in case of errors or upon initiating decoding, decodes the received pictures faster than real-time and, if decoded after their intended presentation time, without displaying them, whereby the decoder can synchronize with the received compressed digital video stream and then after such synchronization is achieved perform normal decoding and presentation.
-
-
46. A method for media communications between a transmitting endpoint and one or more receiving endpoint(s) over a communications network, the network having one or more layered media transport link(s) including a more reliable media transport link to or from each endpoint, the endpoints including:
-
an encoder which encodes transmitted media as frames in a threaded coding structure having a number of different frame types, the frames including a sequence of reference frames; and
a decoder which decodes received media based on reference frames therein;
the method comprising;
designating a particular select subset of the frames in the threaded coding structure as a type (“
R”
) for reliable transport to the decoder over the more reliable transport link,wherein the subset of the frames of the type R is particularly selected to include at least the frames of the lowest temporal layer in the threaded coding structure and such that the decoder can decode at least a portion of the received media based on a reliably received frame of the type R after packet loss or error and thereafter is synchronized with the encoder; and
,sending the frames of the type R to the decoder. - View Dependent Claims (47, 48, 49, 50, 51, 52, 53, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 91)
-
-
54. A method for media communications between a transmitting endpoint and one or more receiving endpoint(s) or server(s) over a communication network, wherein transmitted media is encoded as frames in a threaded coding structure having a number of different layers including a lowest temporal layer, the method comprising providing data elements that indicate:
-
for the lowest temporal level pictures, a sequence number identifying said pictures, and for other temporal level pictures a reference to the sequence number of the most recent, in decoding order, lowest temporal level picture, so that a receiving endpoint or server can detect if a lowest temporal level picture has been lost by examining if the picture corresponding to the referenced picture frame index is available at the receiving endpoint. - View Dependent Claims (55, 56, 57, 58, 59, 60, 61)
-
-
85. A method for decoding compressed digital video that is coded using a technique that provides two or more temporal layers, wherein compressed video pictures are structured into one or more packets, the method comprising:
-
providing data elements in a transmitted packet header to indicate;
for the lowest temporal level pictures, a sequence number identifying the pictures, for other temporal level pictures, a reference to the sequence number of the most recent, in decoding order, lowest temporal level picture, so that a receiver can detect loss of a lowest temporal level picture by examining if the picture corresponding to the referenced picture frame index is available at the receiver. - View Dependent Claims (86, 87)
-
-
88. A method for decoding compressed digital video that is coded using a technique that provides two or more temporal layers, wherein compressed video pictures are structured into one or more packets, and received over an IP-based network using RTP, the method comprising:
-
providing an RTP header extension that includes;
a series number associated with each layer, a sequence number that is associated with each lowest temporal layer picture, and a flag that is used to indicate if a packet contains a picture or picture fragment of the lowest layer temporal picture, wherein the sequence number is referenced by all other pictures that use said lowest temporal layer picture as reference, and;
examining the RTP header extension in a received picture to verify availability of the picture corresponding to the referenced series number and sequence number so that loss of a lowest temporal level picture can be detected. - View Dependent Claims (89)
-
-
90. A method for decoding compressed digital video that is coded using a technique that provides two or more temporal layers, wherein compressed video pictures are structured into one or more packets, and packets of at least the lowest temporal level pictures may be received in a compressed digital video stream from a sender faster than real-time and after their intended presentation time, in case of errors or upon initiating decoding, the method comprising:
-
decoding pictures received at an endpoint faster than real-time and, if decoded after their intended presentation time, without displaying them, whereby the decoder can synchronize with the received compressed digital video stream; and
,after such synchronization is achieved, performing normal decoding and presentation.
-
Specification