Method and apparatus for shot detection in video streaming
First Claim
1. A method of processing multimedia data, the method comprising:
- obtaining at least one metric indicative of a difference between a selected frame and frames temporally adjacent to the selected frame in a plurality of video frames, said at least one metric comprising bi-directional motion information and luminance difference information associated with the selected frame and the frames temporally adjacent to the selected frame;
determining a shot event associated with the selected frame based on said at least one metric, wherein said determining a shot event comprises determining that the selected frame contains at least one camera flashlight if the selected frame is not an abrupt frame or a slow change frame, an average luminance of the selected frame minus an average luminance of a temporally adjacent frame of the selected frame is equal to or exceeds a first threshold value, and a forward difference metric and a backward difference metric associated with the selected frame are both equal to or exceed a second threshold value; and
adaptively encoding the selected frame based on the shot event.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention comprises devices and methods for processing multimedia data. Such methods can include obtaining at least one metric indicative of a difference for a selected frame and adjacent frames in a plurality of video frames, the metric comprising bi-directional motion information and luminance difference information, determining a shot event in the selected frame based on the at least one metric, and adaptively encoding the selected frame based on the shot event. An apparatus for processing a multimedia data can include a motion compensator configured to obtain metrics indicative of a difference between adjacent frames of a plurality of video frames, said metrics comprising bi-directional motion information and luminance information, a shot classifier configured to determine a shot event in the plurality of video frames based on said metrics, and an encoder configured to adaptively encode the plurality of frames based on the shot event.
-
Citations
36 Claims
-
1. A method of processing multimedia data, the method comprising:
-
obtaining at least one metric indicative of a difference between a selected frame and frames temporally adjacent to the selected frame in a plurality of video frames, said at least one metric comprising bi-directional motion information and luminance difference information associated with the selected frame and the frames temporally adjacent to the selected frame; determining a shot event associated with the selected frame based on said at least one metric, wherein said determining a shot event comprises determining that the selected frame contains at least one camera flashlight if the selected frame is not an abrupt frame or a slow change frame, an average luminance of the selected frame minus an average luminance of a temporally adjacent frame of the selected frame is equal to or exceeds a first threshold value, and a forward difference metric and a backward difference metric associated with the selected frame are both equal to or exceed a second threshold value; and adaptively encoding the selected frame based on the shot event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An apparatus for processing a multimedia data, comprising:
-
a motion compensator configured to obtain at least one metric indicative of a difference between a selected frame and frames temporally adjacent to the selected frame in a plurality of video frames, said at least one metric comprising bi-directional motion information and luminance information; a shot classifier configured to determine shot events associated with the selected frame based on said at least one metric, wherein the shot classifier is further configured to determine that the selected frame contains at least one camera flashlight if the selected frame is not an abrupt frame or a slow change frame, an average luminance of the selected frame minus an average luminance of a temporally adjacent frame of the selected frame is equal to or exceeds a first threshold value, and a forward difference metric and a backward difference metric associated with the selected frame are both equal to or exceed a second threshold value; and an encoder configured to adaptively encode the selected frame based on the shot event.
-
-
21. An apparatus for processing multimedia data, comprising:
-
means for obtaining at least one metric indicative of a difference between a selected frame and frames temporally adjacent to the selected frame in a plurality of video frames, said at least one metric comprising bi-directional motion information and luminance difference information associated with the selected frame and the frames temporally adjacent to the selected frame; means for determining a shot event associated with the selected frame based on said at least one metric, wherein said means for determining a shot event comprises means for classifying the selected frame as a camera flash scene if the selected frame is not an abrupt frame or a slow change frame, an average luminance of the selected frame minus an average luminance of a temporally adjacent frame of the selected frame is equal to or exceeds a first threshold value, and a forward difference metric and a backward difference metric associated with the selected frame are both equal to or exceed a second threshold value; and means for adaptively encoding the selected frame based on the shot event. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A non-transitory machine readable medium comprising instructions for processing multimedia data, wherein the instructions upon execution cause a machine to:
-
obtain at least one metric indicative of a difference between a selected frame and frames temporally adjacent to the selected frame in a plurality of video frames, said at least one metric comprising bi-directional motion information and luminance difference information associated with the selected frame and the frames temporally adjacent to the selected frame; determine a shot event associated with the selected frame based on said at least one metric, wherein the determination of a shot event comprises determining that the selected frame contains at least one camera flashlight if the selected frame is not an abrupt frame or a slow change frame, an average luminance of the selected frame minus an average luminance of a temporally adjacent frame of the selected frame is equal to or exceeds a first threshold value, and a forward difference metric and a backward difference metric associated with the selected frame are both equal to or exceed a second threshold value; and adaptively encode the selected frame based on the shot event.
-
-
36. A processor for processing multimedia data, said processor comprising a configuration to
obtain at least one metric indicative of a difference between a selected frame and frames temporally adjacent to the selected frame in a plurality of video frames, said at least one metric comprising bi-directional motion information and luminance difference information associated with the selected frame and the frames temporally adjacent to the selected frame; -
determine a shot event associated with the selected frame based on said at least one metric, wherein said processor comprises a further configuration to determine the shot event based on determining that the selected frame contains at least one camera flashlight if the selected frame is not an abrupt frame or a slow change frame, an average luminance of the selected frame minus an average luminance of a temporally adjacent frame of the selected frame is equal to or exceeds a first threshold value, and a forward difference metric and a backward difference metric associated with the selected frame are both equal to or exceed a second threshold value; and adaptively encode the selected frame based on the shot event.
-
Specification