COMPUTERIZED MACHINE LEARNING OF INTERESTING VIDEO SECTIONS
First Claim
1. A method comprising:
- receiving, by one or more computing devices, video data;
extracting, by at least one of the one or more computing devices, a plurality of features from the video data;
determining, by at least one of the one or more computing devices, a first set of feature values associated with the plurality of features, the first set of feature values for training a classifier and a scoring model;
determining, by at least one of the one or more computing devices, a second set of feature values based on applying the classifier to the video data;
training, by at least one of the one or more computing devices, the scoring model based on the first set of feature values and the second set of feature values, wherein the scoring model determines a desirability score associated with the video data.
3 Assignments
0 Petitions
Accused Products
Abstract
This disclosure describes techniques for training models from video data and applying the learned models to identify desirable video data. Video data may be labeled to indicate a semantic category and/or a score indicative of desirability. The video data may be processed to extract low and high level features. A classifier and a scoring model may be trained based on the extracted features. The classifier may estimate a probability that the video data belongs to at least one of the categories in a set of semantic categories. The scoring model may determine a desirability score for the video data. New video data may be processed to extract low and high level features, and feature values may be determined based on the extracted features. The learned classifier and scoring model may be applied to the feature values to determine a desirability score associated with the new video data.
141 Citations
20 Claims
-
1. A method comprising:
-
receiving, by one or more computing devices, video data; extracting, by at least one of the one or more computing devices, a plurality of features from the video data; determining, by at least one of the one or more computing devices, a first set of feature values associated with the plurality of features, the first set of feature values for training a classifier and a scoring model; determining, by at least one of the one or more computing devices, a second set of feature values based on applying the classifier to the video data; training, by at least one of the one or more computing devices, the scoring model based on the first set of feature values and the second set of feature values, wherein the scoring model determines a desirability score associated with the video data. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
memory; one or more processors; and one or more modules stored in the memory and executable by the one or more processors, the one or more modules including; an extracting module configured for extracting features from video data and determining a first set of feature values based on the extracted features; and a ranking module configured for determining a desirability score for the video data, the ranking module including; a classifying module configured for applying a classifier to the first set of feature values to determine a second set of feature values; and a scoring module configured for applying a scoring model to the first set of feature values and the second set of feature values to determine a desirability score for the video data. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. One or more computer-readable storage media encoded with instructions that, when executed by a processor, perform acts comprising:
-
receiving video data including a plurality of video frames; extracting a plurality of features from individual video frames of the plurality of video frames to determine a first set of feature values associated with the individual video frames; applying a classifier to the first set of feature values to determine a second set of feature values associated with the individual video frames, wherein the second set of feature values represents probabilities that the individual video frames belong to at least one semantic category of a predefined set of semantic categories; and applying a scoring model to the first set of feature values and the second set of feature values to determine a desirability score associated with the individual video frames. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification