Enhanced distortion signaling for MMT assets and ISOBMFF with improved MMT QoS descriptor having multiple QoE operating points
First Claim
1. An apparatus for providing media content in a computer network, the apparatus comprising:
- a memory configured to store the media content, the media content including a segment comprising a group of frames; and
at least one processing device configured to;
determine a transmission rate for traffic between the apparatus and a client device;
select a subset of frames to drop from the group of frames based on (i) the transmission rate and (ii) a frame difference distortion (FDIFF) metric of each frame in the subset of frames;
calculate a frame significance (FSIG) value that indicates a relative importance of the frames in the group of frames in a sequence, wherein the FSIG value is defined according to an equation;
vk=d(fk, fk−
1)=(S·
fk−
S·
fk−
1)TATA(S·
fk−
S·
fk−
1),where vk is a vector representation of the FSIG value, fk denotes a kth frame, fk−
1, denotes a previous frame, S denotes a bi-cubicle filtering and down-sampling function, and A denotes a metric,shape the segment by dropping the selected subset of frames from the group of frames, wherein the shaped segment has a lower bitrate than the segment; and
initiate transmission of the shaped segment to the client device.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for providing media content in a computer network includes storing the media content, where the media content includes a segment having a group of frames. The method also includes determining a transmission rate for traffic to a client device. The method further includes selecting a subset of frames to drop from the group of frames based on (i) the transmission rate and (ii) a frame difference distortion (FDIFF) metric of each frame in the subset of frames. The method also includes shaping the segment by dropping the selected subset of frames from the group of frames, where the shaped segment has a lower bitrate than the segment. In addition, the method includes transmitting the shaped segment to the client device.
-
Citations
18 Claims
-
1. An apparatus for providing media content in a computer network, the apparatus comprising:
-
a memory configured to store the media content, the media content including a segment comprising a group of frames; and at least one processing device configured to; determine a transmission rate for traffic between the apparatus and a client device; select a subset of frames to drop from the group of frames based on (i) the transmission rate and (ii) a frame difference distortion (FDIFF) metric of each frame in the subset of frames; calculate a frame significance (FSIG) value that indicates a relative importance of the frames in the group of frames in a sequence, wherein the FSIG value is defined according to an equation;
vk=d(fk, fk−
1)=(S·
fk−
S·
fk−
1)TATA(S·
fk−
S·
fk−
1),where vk is a vector representation of the FSIG value, fk denotes a kth frame, fk−
1, denotes a previous frame, S denotes a bi-cubicle filtering and down-sampling function, and A denotes a metric,shape the segment by dropping the selected subset of frames from the group of frames, wherein the shaped segment has a lower bitrate than the segment; and initiate transmission of the shaped segment to the client device. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An apparatus for providing media content in a computer network, the apparatus comprising:
-
a memory configured to store the media content, the media content including a segment comprising a group of frames; and at least one processing device configured to; determine a transmission rate for traffic between the apparatus and a client device; select a subset of frames to drop from the group of frames based on (i) the transmission rate and (ii) a frame difference distortion (FDIFF) metric of each frame in the subset of frames; calculate the FDIFF metric according to an equation;
d(fj, fk)=(S·
fj−
S·
fk)TATA(S·
fj−
S·
fk),where fj denotes a frame actually being displayed, fk denotes a frame scheduled to be displayed, S denotes a bi-cubicle filtering and down-sampling function, and A denotes a metric; shape the segment by dropping the selected subset of frames from the group of frames, wherein the shaped segment has a lower bitrate than the segment; and initiate transmission of the shaped segment to the client device. - View Dependent Claims (7)
-
-
8. An apparatus for providing media content in a computer network, the apparatus comprising:
-
a memory configured to store the media content, the media content including a segment comprising a group of frames; and at least one processing device configured to; determine a transmission rate for traffic between the apparatus and a client device; select a subset of frames to drop from the group of frames based on (i) the transmission rate and (ii) a frame difference distortion (FDIFF) metric of each frame in the subset of frames; shape the segment by dropping the selected subset of frames from the group of frames, wherein the shaped segment has a lower bitrate than the segment; and initiate transmission of the shaped segment to the client device wherein; a frame loss temporal distortion (FLTD) metric is an F SIG-weighted FLTD metric based on frame significance (F SIG) values; and the at least one processing device is configured to calculate the F SIG-weighted FLTD metric according to an equation; - View Dependent Claims (9)
-
-
10. A system for providing media content in a computer network, the system comprising:
-
a memory configured to store the media content, the media content including a segment comprising a group of frames; and at least one processing device configured to; generate multiple operating points of bitrate reduction by performing a gradient search for each of the operating points; generate a set of Quality of Experience (QoE) parameters for each of the operating points, wherein the set QoE parameters for each of the operating points includes; a frame drop index set indicating a list of frames that when dropped from the group of frames yield (i) an aggregate bitrate reduction corresponding to the associated operating point and (ii) a shaped segment containing remaining frames; and at least one of; an aggregate bitrate resulting from dropping the frames indicated by the frame drop index set, a spatial distortion metric of the segment, and a frame loss temporal distortion (FLTD) metric of the segment; calculate a frame significance (FSIG) value that indicates a relative importance of the frames in the group of frames in a sequence, the FSIG value defined according to an equation;
vk=d(fk, fk−
1)=(S·
fk−
S·
fk−
1)TATA(S·
fk−
S·
fk−
1),where vk is a vector representation of the FSIG value, fk denotes a kth frame, fk−
1 denotes a previous frame, S denotes a bi-cubicle filtering and down-sampling function, and A denotes a metric; andinitiate transmission of an Asset Delivery Characteristic (ADC) of the media content, the ADC including the operating points and the set of QoE parameters corresponding to each of the operating points. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
16. The system of claim 15, wherein the at least one processing device is further configured to calculate the FSIG-weighted FLTD metric based on a number of dependent frames having a decoding dependency from an encoded frame in order to increase the FSIG-weighted FLTD metric for each of the dependent frames.
-
17. The system of claim 10, further comprising a router configured to:
-
receive the transmitted ADC; determine a transmission rate for traffic between the router and a client device; select a subset of frames to drop from the group of frames based on (i) the transmission rate and (ii) a frame loss temporal distortion (FLTD) metric of each frame in the subset of frames; shape the segment by dropping the selected subset of frames from the group of frames, wherein the shaped segment has a lower bitrate than the segment; and initiate transmission of the shaped segment to the client device.
-
-
18. The system of claim 17, wherein:
-
the router is further configured to select the operating point whose aggregate bitrate is closest to the transmission rate; and the router is configured to select the subset of frames as a list of frames indicated by a frame drop index set associated with the selected operating point.
-
Specification