System and method for thinning of scalable video coding bit-streams
First Claim
1. A method for processing an input digital video signal, encoded in a scalable video coding format that supports one or more of spatial and quality scalability in order to produce an output digital video signal at an intended resolution, the method comprising:
- for the output signal, replacing information in a plurality of scalable layers of at least one input video signal lower than a target layer that corresponds to the intended resolution such that information not required to decode the output video signal at the intended resolution is replaced by information that requires fewer bits in the output video signal, and wherein the output video signal with the replaced information is still conforming to the scalable video coding format, wherein a side information input data enables the replacement of the information not required to decode the output video signal at the intended resolution of the side information input data without fully parsing the entire input video signal.
6 Assignments
0 Petitions
Accused Products
Abstract
A system for videoconferencing that offers, among other features, extremely low end-to-end delay as well as very high scalability. The system accommodates heterogeneous receivers and networks, as well as the best-effort nature of networks such as those based on the Internet Protocol. The system relies on scalable video coding to provide a coded representation of a source video signal at multiple temporal, quality, and spatial resolutions. These resolutions are represented by distinct bitstream components that are created at each end-user encoder. System architecture and processes called SVC Thinning allow the separation of data into data used for prediction in other pictures and data not used for prediction in other pictures. SVC Thinning processes, which can be performed at video conferencing endpoints or at MCUs, can selectively remove or replace with fewer bits the data not used for prediction in other pictures from transmitted bit streams. This separation and selective removal or replacement of data for transmission allows a trade-off between scalability support (i.e. number of decodable video resolutions), error resiliency and coding efficiency.
-
Citations
20 Claims
-
1. A method for processing an input digital video signal, encoded in a scalable video coding format that supports one or more of spatial and quality scalability in order to produce an output digital video signal at an intended resolution, the method comprising:
for the output signal, replacing information in a plurality of scalable layers of at least one input video signal lower than a target layer that corresponds to the intended resolution such that information not required to decode the output video signal at the intended resolution is replaced by information that requires fewer bits in the output video signal, and wherein the output video signal with the replaced information is still conforming to the scalable video coding format, wherein a side information input data enables the replacement of the information not required to decode the output video signal at the intended resolution of the side information input data without fully parsing the entire input video signal. - View Dependent Claims (2, 5, 6, 7)
-
3. A method for processing an input digital video signal, encoded in a scalable video coding format that supports one or more of spatial and quality scalability in order to produce an output digital video signal at an intended resolution, the method comprising:
for the output signal, removing information in a plurality of scalable layers of at least one input video signal lower than a target layer that corresponds to the intended resolution such that information not required to decode the output video signal at the intended resolution is removed in the output video signal, wherein a side information input data enables the elimination of the information not required to decode the output video signal at the intended resolution of the side information input data without fully parsing the entire input video signal. - View Dependent Claims (4, 8, 9, 10)
-
11. A non-transitory computer readable medium comprising a set of instructions to direct a processor to:
-
process an input digital video signal, encoded in a scalable video coding format that supports one or more of spatial and quality scalability in order to produce an output digital video signal at an intended resolution, by; for the output signal, replacing information in a plurality of scalable layers of at least one input video signal lower than a target layer that corresponds to the intended resolution such that information not required to decode the output video signal at the intended resolution is replaced by information that requires fewer bits in the output video signal, and wherein the output video signal with the replaced information is still conforming to the scalable video coding format, wherein a side information input data enables the replacement of the information not required to decode the output video signal at the intended resolution of the side information input data without fully parsing the entire input video signal. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer readable medium comprising a set of instructions to direct a processor to:
-
process an input digital video signal, encoded in a scalable video coding format that supports one or more of spatial and quality scalability in order to produce an output digital video signal at an intended resolution, by; for the output signal, removing information in a plurality of scalable layers of at least one input video signal lower than a target layer that corresponds to the intended resolution such that information not required to decode the output video signal at the intended resolution is removed in the output video signal, wherein a side information input data enables the elimination of the information not required to decode the output video signal at the intended resolution of the side information input data without fully parsing the entire input video signal. - View Dependent Claims (17, 18, 19, 20)
-
Specification