Video search engine using joint categorization of video clips and queries based on multiple modalities
First Claim
1. A method comprising:
- generating a first classification model for determining whether a video belongs to a category;
generating a second classification model for determining whether the video belongs to the category, the first classification model being based on a different modality than the second classification model; and
generating a fusion model that uses the results of the first classification model and the second classification model for determining whether the video belongs to the category.
3 Assignments
0 Petitions
Accused Products
Abstract
A method comprises generating a first classification model, e.g., metadata-based, for determining whether a video belongs to a category; generating a second classification model, e.g., content-based, for determining whether the video belongs to a category, the first classification model and second classification model being based on different modalities; and generating a fusion model that blends the categorization results of the models. Each classification model may classify the video to multiple categories. During operation, a method obtains a video; uses the first classification model, the second classification model and the fusion model to determine whether the video belongs to a category; and indexes the video in a video index. The method may enable selection of a category corresponding to the video search results. The category may be identified based on a query profile, which may be learned from users'"'"' query logs or popular queries and click history.
-
Citations
31 Claims
-
1. A method comprising:
-
generating a first classification model for determining whether a video belongs to a category;
generating a second classification model for determining whether the video belongs to the category, the first classification model being based on a different modality than the second classification model; and
generating a fusion model that uses the results of the first classification model and the second classification model for determining whether the video belongs to the category. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
a first learning engine for generating a first classification model for determining whether a video belongs to a category;
a second leaning engine for generating a second classification model for determining whether the video belongs to the category, the first classification model being based on a different modality than the second classification model; and
a third learning engine for generating a fusion model that uses the results of the first classification model and the second classification model for determining whether the video belongs to the category. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
obtaining a video clip;
using a first classification model to determine whether the video belongs to a category;
using a second classification model to determine whether the video belongs to the category, the first classification model being based on a different modality than the second classification model;
using a fusion model that uses the results of the first classification model and the second classification model to determine whether the video clip belongs to the category; and
indexing the video based on the result of the fusion model in a video index. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system comprising:
-
a first classification model for determining whether a video clip belongs to a category;
a second classification model for determining whether the video clip belongs to the category, the first classification model being based on a different modality than the second classification model;
a fusion model that uses the results of the first classification model and the second classification model for determining whether the video belongs to the category; and
an index building component for indexing the video based on the result of the fusion model in a video index. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31)
-
Specification