Systems and methods for semantically classifying and normalizing shots in video
First Claim
1. A method, comprising:
- receiving a video file comprising a plurality of frames, each frame of the plurality of frames associated with a plurality of scene classification scores, each scene classification score of the plurality of scene classification scores representing a probability that a respective frame includes content associated with a scene category of a plurality of scene categories;
determining, based on analyzing the plurality of scene classification scores associated with one or more frames of the plurality of frames, a start time and an end time of each scene of a plurality of scenes;
for at least one scene of the plurality of scenes;
determining that one or more frames from the at least one scene includes content associated with one or more scene categories;
assigning, based on the one or more scene categories associated with the one or more frames, one or more scene categories from the plurality of scene categories to the at least one scene;
storing, in one or more indexes, data indicating (a) the one or more scene categories assigned to the at least one scene of the plurality of scenes, and (b) a start time and an end time for the at least one scene of the plurality of scenes.
10 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure relates to systems and methods for classifying videos based on video content. For a given video file including a plurality of frames, a subset of frames is extracted for processing. Frames that are too dark, blurry, or otherwise poor classification candidates are discarded from the subset. Generally, material classification scores that describe type of material content likely included in each frame are calculated for the remaining frames in the subset. The material classification scores are used to generate material arrangement vectors that represent the spatial arrangement of material content in each frame. The material arrangement vectors are subsequently classified to generate a scene classification score vector for each frame. The scene classification results are averaged (or otherwise processed) across all frames in the subset to associate the video file with one or more predefined scene categories related to overall types of scene content of the video file.
26 Citations
27 Claims
-
1. A method, comprising:
-
receiving a video file comprising a plurality of frames, each frame of the plurality of frames associated with a plurality of scene classification scores, each scene classification score of the plurality of scene classification scores representing a probability that a respective frame includes content associated with a scene category of a plurality of scene categories; determining, based on analyzing the plurality of scene classification scores associated with one or more frames of the plurality of frames, a start time and an end time of each scene of a plurality of scenes; for at least one scene of the plurality of scenes; determining that one or more frames from the at least one scene includes content associated with one or more scene categories; assigning, based on the one or more scene categories associated with the one or more frames, one or more scene categories from the plurality of scene categories to the at least one scene; storing, in one or more indexes, data indicating (a) the one or more scene categories assigned to the at least one scene of the plurality of scenes, and (b) a start time and an end time for the at least one scene of the plurality of scenes. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. One or more non-transitory storage media storing instructions which, when executed by one or more computing devices, cause:
-
receiving a video file comprising a plurality of frames, each frame of the plurality of frames associated with a plurality of scene classification scores, each scene classification score of the plurality of scene classification scores representing a probability that a respective frame includes content associated with a scene category of a plurality of scene categories; determining, based on analyzing the plurality of scene classification scores associated with one or more frames of the plurality of frames, a start time and an end time of each scene of a plurality of scenes; for at least one scene of the plurality of scenes; determining that one or more frames from the at least one scene includes content associated with one or more scene categories; assigning, based on the one or more scene categories associated with the one or more frames, one or more scene categories from the plurality of scene categories to the at least one scene; storing, in one or more indexes, data indicating (a) the one or more scene categories assigned to the at least one scene of the plurality of scenes, and (b) a start time and an end time for the at least one scene of the plurality of scenes. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An apparatus comprising:
-
a subsystem, implemented at least partially in hardware, that receives a video file comprising a plurality of frames, each frame of the plurality of frames associated with a plurality of scene classification scores, each scene classification score of the plurality of scene classification scores representing a probability that a respective frame includes content associated with a scene category of a plurality of scene categories; a subsystem, implemented at least partially in hardware, that determines, based on analyzing the plurality of scene classification scores associated with one or more frames of the plurality of frames, a start time and an end time of each scene of a plurality of scenes; a subsystem, implemented at least partially in hardware, that for at least one scene of the plurality of scenes; determines, that one or more frames from the at least one scene includes content associated with one or more assigns, based on the one or more scene categories associated with the one or more frames, one or more scene categories from the plurality of scene categories to the at least one scene; a subsystem, implemented at least partially in hardware, that stores, in one or more indexes, data indicating (a) the one or more scene categories assigned to the at least one scene of the plurality of scenes, and (b) a start time and an end time for the at least one scene of the plurality of scenes. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification