SYSTEMS AND METHODS FOR SEMANTICALLY CLASSIFYING SHOTS IN VIDEO
First Claim
1. A system for classifying videos based on video content, comprising:
- a processor;
one or more software modules including one or more classifiers;
a computer program product that includes a computer-readable medium that is usable by the processor, the medium having stored thereon a sequence of instructions associated with the one or more software modules that when executed by the processor causes the execution of the steps of;
receiving a video file, the video file including a plurality of frames;
extracting a subset of frames from the video file;
if one or more frames in the extracted subset of frames comprises a dark frame, discarding the one or more dark frames from the subset;
determining whether each frame in the extracted subset includes content associated with a general content category;
for each frame in the extracted subset that includes content associated with the general content category, generating a scene classification score vector for the frame via one or more scene classifiers, the scene classification score vector including one or more scene classification scores associated with one or more predefined scene categories within the general content category;
determining a representative scene classification score vector for the video file based on the generated scene classification score vectors for each extracted frame in the subset that includes content associated with the general content category; and
associating the video file with the one or more predefined scene categories based on the representative scene classification score vector.
10 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure relates to systems and methods for classifying videos based on video content. For a given video file including a plurality of frames, a subset of frames is extracted for processing. Frames that are too dark, blurry, or otherwise poor classification candidates are discarded from the subset. Generally, material classification scores that describe type of material content likely included in each frame are calculated for the remaining frames in the subset. The material classification scores are used to generate material arrangement vectors that represent the spatial arrangement of material content in each frame. The material arrangement vectors are subsequently classified to generate a scene classification score vector for each frame. The scene classification results are averaged (or otherwise processed) across all frames in the subset to associate the video file with one or more predefined scene categories related to overall types of scene content of the video file.
-
Citations
71 Claims
-
1. A system for classifying videos based on video content, comprising:
-
a processor; one or more software modules including one or more classifiers; a computer program product that includes a computer-readable medium that is usable by the processor, the medium having stored thereon a sequence of instructions associated with the one or more software modules that when executed by the processor causes the execution of the steps of; receiving a video file, the video file including a plurality of frames; extracting a subset of frames from the video file; if one or more frames in the extracted subset of frames comprises a dark frame, discarding the one or more dark frames from the subset; determining whether each frame in the extracted subset includes content associated with a general content category; for each frame in the extracted subset that includes content associated with the general content category, generating a scene classification score vector for the frame via one or more scene classifiers, the scene classification score vector including one or more scene classification scores associated with one or more predefined scene categories within the general content category; determining a representative scene classification score vector for the video file based on the generated scene classification score vectors for each extracted frame in the subset that includes content associated with the general content category; and associating the video file with the one or more predefined scene categories based on the representative scene classification score vector. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method for classifying videos based on video content, comprising the steps of:
-
receiving a video file, the video file including a plurality of frames; extracting a subset of frames from the video file; if one or more frames in the extracted subset of frames comprises a dark frame, discarding the one or more dark frames from the subset; determining whether each frame in the extracted subset includes content associated with a general content category; for each frame in the extracted subset that includes content associated with the general content category, generating a scene classification score vector for the frame via one or more scene classifiers, the scene classification score vector including one or more scene classification scores associated with one or more predefined scene categories within the general content category; determining a representative scene classification score vector for the video file based on the generated scene classification score vectors for each extracted frame in the subset that includes content associated with the general content category; and labeling the video file according to the one or more predefined scene categories based on the representative scene classification score vector. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A method for classifying videos based on video content, comprising the steps of:
-
receiving a video file, the video file including a plurality of frames, wherein each frame includes a plurality of pixels; extracting a set of frames from the video file; for each frame in the extracted set of frames, determining whether the frame comprises a poor classification frame; if one or more frames in the extracted set of frames comprises a poor classification frame, removing the one or more poor classification frames from the extracted set of frames; dividing each frame in the extracted set of frames into one or more segments, wherein each segment includes relatively uniform image content; extracting image features from each segment to form a feature vector associated with each segment; generating a material classification score vector for each segment via one or more material classifiers based on the feature vector associated with each segment, wherein each material classification score vector includes one or more material classification scores associated with one or more predefined material content categories; and assigning each material classification score vector associated with its respective segment to each pixel in each respective segment for each respective frame in the set of frames. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A method for classifying a video file according to one or more scene classes, the video file including a plurality of frames, wherein each frame includes a plurality of pixels, and wherein each pixel is associated with a vector of material classification scores describing the material content in its respective frame, comprising the steps of:
-
(a) dividing each frame into a plurality of grid cells; (b) for each frame, retrieving the vector of material classification scores for each pixel in each grid cell; (c) for each grid cell, averaging the material classification scores across each pixel in the grid cell to form a material occurrence vector for the grid cell; (d) concatenating the material occurrence vectors for the plurality of grid cells in each frame to generate a material arrangement vector for each frame; (e) generating a scene classification score associated with each of the one or more scene classes for each frame in the video file via one or more scene classifiers based on the material arrangement vectors generated for each frame; (f) generating a representative scene classification score for the video file for each of the one or more scene classes based on the scene classification scores generated for each frame; and (g) if one or more of the representative scene classification scores is above a predetermined threshold value, labeling the video file according to the respective scene classes associated with the one or more scene classification scores that are above the predetermined threshold value. - View Dependent Claims (52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65)
-
-
66. A method for labeling videos based on video content, comprising the steps of:
-
receiving a video file, wherein the video file includes a plurality of frames; extracting a set of frames from the plurality of frames in the video file; for each frame in the extracted set of frames, calculating a probability that the frame includes content associated with a predefined scene category; determining a representative probability for the set of frames based on the calculated probabilities for each frame; and if the representative probability exceeds a predetermined threshold, associating the scene category with the video file. - View Dependent Claims (67, 68, 69, 70, 71)
-
Specification