Scene change detection around a set of seed points in media data
First Claim
1. A method for scene change detection in media data, comprising:
- deriving a set of filtered values from the media data;
identifying a plurality of seed time points among time points at which the set of filtered values derived from the media data reach extremum values;
determining one or more statistical patterns of media features in a plurality of time-wise intervals around the plurality of seed time points of the media data using one or more types of features extractable from the media data, at least one of the one or more types of features comprising a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data;
detecting, based on the one or more statistical patterns, a plurality of beginning scene change points and a plurality of ending scene change points in the media data for the plurality of seed time points in the media data;
wherein the method is performed by one or more computing devices.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for scene change detection around seed points in media data are provided. Media features of many different types may be extracted from the media data. One or more statistical patterns of media features in a plurality of time-wise intervals around a plurality of seed time points of the media data may be determined using one or more types of features extractable from the media data. At least one of the one or more types of features comprises a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data. A plurality of beginning scene change points and a plurality of ending scene change points in the media data may be detected, based on the one or more statistical patterns, for the plurality of seed time points in the media data.
69 Citations
20 Claims
-
1. A method for scene change detection in media data, comprising:
-
deriving a set of filtered values from the media data; identifying a plurality of seed time points among time points at which the set of filtered values derived from the media data reach extremum values; determining one or more statistical patterns of media features in a plurality of time-wise intervals around the plurality of seed time points of the media data using one or more types of features extractable from the media data, at least one of the one or more types of features comprising a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data; detecting, based on the one or more statistical patterns, a plurality of beginning scene change points and a plurality of ending scene change points in the media data for the plurality of seed time points in the media data; wherein the method is performed by one or more computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable storage medium, comprising a set of instructions, which when executed by a processing or computing device cause, control or program the device to execute or perform a process, wherein the process comprises the steps of:
-
deriving a set of filtered values from media data; identifying a plurality of seed time points among time points at which the set of filtered values derived from the media data reach extremum values; determining one or more statistical patterns of media features in a plurality of time-wise intervals around the plurality of seed time points of the media data using one or more types of features extractable from the media data, at least one of the one or more types of features comprising a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data; detecting, based on the one or more statistical patterns, a plurality of beginning scene change points and a plurality of ending scene change points in the media data for the plurality of seed time points in the media data.
-
Specification