Media coding for loss recovery with remotely predicted data units
First Claim
1. A system for decoding video data streamed in encoded form over a network characterized by variable available bandwidth and latency, the system including:
- a processor;
memory; and
computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder;
the parser being adapted for parsing part of a bitstream to determine a type value for a unit of video data in one video image, wherein the type value is one of plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value; and
the decoder being adapted for;
when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using an intra decoding mode including intra decoding;
when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a first inter decoding mode that favors quality at the expense of decoding flexibility for a given bitrate, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding mode; and
when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a second inter decoding mode different than the first inter decoding mode, wherein the second inter decoding mode favors decoding flexibility at the expense of quality for the given bitrate, and wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding mode;
thereby enabling the decoder to select between the second inter decoding mode and the first inter decoding mode during decoding.
1 Assignment
0 Petitions
Accused Products
Abstract
An improved loss recovery method for coding streaming media classifies each data unit in the media stream as an independent data unit (I unit), a remotely predicted unit (R unit) or a predicted data unit (P unit). Each of these units is organized into independent segments having an I unit, multiple P units and R units interspersed among the P units. The beginning of each segment is the start of a random access point, while each R unit provides a loss recovery point that can be placed independently of the I unit. This approach separates the random access point from the loss recovery points provided by the R units, and makes the stream more impervious to data losses without substantially impacting coding efficiency. The most important data units are transmitted with the most reliability to ensure that the majority of the data received by the client is usable. The I units are the least sensitive to transmission losses because they are coded using only their own data. While they provide the best coding efficiency, the P units are the most sensitive to data loss because the loss of one P unit renders useless all of the P units that depend on it. The remotely predicted units are dependent on the I unit, or in an alternative implementation, on another R unit.
138 Citations
20 Claims
-
1. A system for decoding video data streamed in encoded form over a network characterized by variable available bandwidth and latency, the system including:
-
a processor; memory; and computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder; the parser being adapted for parsing part of a bitstream to determine a type value for a unit of video data in one video image, wherein the type value is one of plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value; and the decoder being adapted for; when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using an intra decoding mode including intra decoding; when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a first inter decoding mode that favors quality at the expense of decoding flexibility for a given bitrate, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding mode; and when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a second inter decoding mode different than the first inter decoding mode, wherein the second inter decoding mode favors decoding flexibility at the expense of quality for the given bitrate, and wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding mode; thereby enabling the decoder to select between the second inter decoding mode and the first inter decoding mode during decoding. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for decoding video data including plural units, wherein each of the plural units is for video data in one video image, the system comprising:
-
a processor; memory; and computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder; the parser being adapted for, for each of the plural units, parsing part of a bitstream to determine a type value for the unit, wherein the type value is one of plural possible type values, the plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value; and the decoder being adapted for, for each of the plural units; when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using an intra decoding mode; when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a first inter decoding mode, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding mode; and when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a second inter decoding mode, wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding mode, and wherein the second inter decoding mode differs from the first inter decoding mode; wherein, for motion compensation in the first and second inter decoding modes, the decoder is further adapted to select from among plural reference images based upon one or more reference identifiers parsed from the bitstream in addition to the type values for the plural units. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system for decoding video data, the system comprising:
-
a processor; memory; and computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder; the parser being adapted for parsing part of a bitstream to determine a type value for a unit of the video data, wherein the type value is one of plural possible type values, the plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value, wherein the type value for the unit is in encoded form in the bitstream, and wherein the parsing comprises entropy decoding the type value for the unit; and the decoder being adapted for; when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using intra decoding; when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using first inter decoding, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding; and when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using second inter decoding different than the first inter decoding, wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding. - View Dependent Claims (17, 18, 19, 20)
-
Specification