Media coding for loss recovery with remotely predicted data units

US 7,734,821 B2
Filed: 03/22/2005
Issued: 06/08/2010
Est. Priority Date: 03/12/1999
Status: Expired due to Term

First Claim

Patent Images

1. A system for decoding video data streamed in encoded form over a network characterized by variable available bandwidth and latency, the system including:

a processor;

memory; and

computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder;

the parser being adapted for parsing part of a bitstream to determine a type value for a unit of video data in one video image, wherein the type value is one of plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value; and

the decoder being adapted for;

when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using an intra decoding mode including intra decoding;

when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a first inter decoding mode that favors quality at the expense of decoding flexibility for a given bitrate, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding mode; and

when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a second inter decoding mode different than the first inter decoding mode, wherein the second inter decoding mode favors decoding flexibility at the expense of quality for the given bitrate, and wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding mode;

thereby enabling the decoder to select between the second inter decoding mode and the first inter decoding mode during decoding.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An improved loss recovery method for coding streaming media classifies each data unit in the media stream as an independent data unit (I unit), a remotely predicted unit (R unit) or a predicted data unit (P unit). Each of these units is organized into independent segments having an I unit, multiple P units and R units interspersed among the P units. The beginning of each segment is the start of a random access point, while each R unit provides a loss recovery point that can be placed independently of the I unit. This approach separates the random access point from the loss recovery points provided by the R units, and makes the stream more impervious to data losses without substantially impacting coding efficiency. The most important data units are transmitted with the most reliability to ensure that the majority of the data received by the client is usable. The I units are the least sensitive to transmission losses because they are coded using only their own data. While they provide the best coding efficiency, the P units are the most sensitive to data loss because the loss of one P unit renders useless all of the P units that depend on it. The remotely predicted units are dependent on the I unit, or in an alternative implementation, on another R unit.

138 Citations

20 Claims

1. A system for decoding video data streamed in encoded form over a network characterized by variable available bandwidth and latency, the system including:
- a processor;
  
  memory; and
  
  computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder;
  
  the parser being adapted for parsing part of a bitstream to determine a type value for a unit of video data in one video image, wherein the type value is one of plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value; and
  
  the decoder being adapted for;
  
  when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using an intra decoding mode including intra decoding;
  
  when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a first inter decoding mode that favors quality at the expense of decoding flexibility for a given bitrate, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding mode; and
  
  when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a second inter decoding mode different than the first inter decoding mode, wherein the second inter decoding mode favors decoding flexibility at the expense of quality for the given bitrate, and wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding mode;
  
  thereby enabling the decoder to select between the second inter decoding mode and the first inter decoding mode during decoding.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The system of claim 1 wherein the unit is a video frame or a macroblock of the video image.
  - 3. The system of claim 1 wherein the unit is less than the entire video image.
  - 4. The system of claim 1 wherein, for motion compensation in the first and second inter decoding modes, the decoder is further adapted to select from among plural reference images based upon one or more reference identifiers parsed from the bitstream in addition to the type value for the unit.
  - 5. The system of claim 4 wherein the decoder is further adapted to update the plural reference images, including adding a current image to the plural reference images and removing a previous image from the plural reference images.
  - 6. The system of claim 1 wherein the bitstream includes plural reference identifiers for plural blocks in addition to the type value for the unit.
  - 7. The system of claim 6 wherein for at least one block the decoder uses one motion vector and one reference identifier and for at least one block the decoder uses one motion vector and no reference identifier.
  - 8. The system of claim 6 wherein the plural blocks are macroblocks.
  - 9. The system of claim 1 wherein the type value for the unit is in encoded form in the bitstream, and wherein the parsing includes entropy decoding the type value for the unit.
  - 10. The system of claim 1 wherein the parser includes means for performing the parsing, and wherein the decoder includes means for performing the decoding.

11. A system for decoding video data including plural units, wherein each of the plural units is for video data in one video image, the system comprising:
- a processor;
  
  memory; and
  
  computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder;
  
  the parser being adapted for, for each of the plural units, parsing part of a bitstream to determine a type value for the unit, wherein the type value is one of plural possible type values, the plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value; and
  
  the decoder being adapted for, for each of the plural units;
  
  when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using an intra decoding mode;
  
  when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a first inter decoding mode, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding mode; and
  
  when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using a second inter decoding mode, wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding mode, and wherein the second inter decoding mode differs from the first inter decoding mode;
  
  wherein, for motion compensation in the first and second inter decoding modes, the decoder is further adapted to select from among plural reference images based upon one or more reference identifiers parsed from the bitstream in addition to the type values for the plural units.
- View Dependent Claims (12, 13, 14, 15)
- - 12. The system of claim 11 wherein at least one of the plural units is less than an entire video image.
  - 13. The system of claim 12 wherein at least one of the plural units is a macroblock of a video image.
  - 14. The system of claim 11 wherein the decoder is further adapted to update the plural reference images, including adding a current image to the plural reference images and removing a previous image from the plural reference images.
  - 15. The system of claim 11 wherein for at least one block the decoder uses one motion vector and one reference identifier and for at least one block the decoder uses one motion vector and no reference identifier.

16. A system for decoding video data, the system comprising:
- a processor;
  
  memory; and
  
  computer-readable media storing computer-executable instructions for causing the system to operate as a parser and a decoder;
  
  the parser being adapted for parsing part of a bitstream to determine a type value for a unit of the video data, wherein the type value is one of plural possible type values, the plural possible type values including an intra type value, a first single-unit single-reference inter-prediction type value, and a second single-unit single-reference inter-prediction type value, wherein the type value for the unit is in encoded form in the bitstream, and wherein the parsing comprises entropy decoding the type value for the unit; and
  
  the decoder being adapted for;
  
  when the type value for the unit is the intra type value, decoding at least some of the video data for the unit using intra decoding;
  
  when the type value for the unit is the first single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using first inter decoding, wherein the decoder uses at most one motion vector and one reference identifier per block in the first inter decoding; and
  
  when the type value for the unit is the second single-unit single-reference inter-prediction type value, decoding at least some of the video data for the unit using second inter decoding different than the first inter decoding, wherein the decoder also uses at most one motion vector and one reference identifier per block in the second inter decoding.
- View Dependent Claims (17, 18, 19, 20)
- - 17. The system of claim 16 wherein the unit is a video frame or a macroblock of a video image.
  - 18. The system of claim 16 wherein the unit is less than an entire video image.
  - 19. The system of claim 16 wherein, for the motion compensation in the first and second inter decoding, the decoder is further adapted to select from among plural reference images based upon one or more reference identifiers parsed from the bitstream in addition to the type value for the unit.
  - 20. The system of claim 16 wherein for at least one block the decoder uses one motion vector and one reference identifier, and for at least one block the decoder uses one motion vector and no reference identifier.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Wang, Albert Szu-Chi, Lee, Ming-Chieh
Primary Examiner(s)
Nawaz; Asad M

Application Number

US11/088,696
Publication Number

US 20050198346A1
Time in Patent Office

1,904 Days
Field of Search

709201-202, 709/246, 709230-237, 348/384.1, 348/390.1, 348/401.1, 348/405.1, 348/402.1, 348/413.1, 348/409.1, 348/416.1, 348/387.1, 725/19, 725/20, 708/1, 708164-170, 708203-206, 341 50-107
US Class Current

709/246
CPC Class Codes

G10L 19/005   Correction of errors induce...

H04L 65/70   Media network packetisation

H04N 19/107   between spatial and tempora...

H04N 19/114   Adapting the group of pictu...

H04N 19/124   Quantisation

H04N 19/137   Motion inside a coding unit...

H04N 19/166   concerning the amount of tr...

H04N 19/176   the region being a block, e...

H04N 19/177   the unit being a group of p...

H04N 19/20   using video object coding

H04N 19/46   Embedding additional inform...

H04N 19/50   using predictive coding H04...

H04N 19/503   involving temporal predicti...

H04N 19/513   Processing of motion vectors

H04N 19/573   Motion compensation with mu...

H04N 19/577   Motion compensation with bi...

H04N 19/58   Motion compensation with lo...

H04N 19/593   involving spatial predictio...

H04N 19/61   in combination with predict...

H04N 19/89   involving methods or arrang...

H04N 19/91 : Entropy coding, e.g. variab...

H04N 21/6377 : directed to server one-way ...

H04N 21/658 : Transmission by the client ...

View All

Media coding for loss recovery with remotely predicted data units

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

138 Citations

20 Claims

Specification

Use Cases

Quick Links

Others

Media coding for loss recovery with remotely predicted data units

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

138 Citations

20 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others