Learning multimedia semantics from large-scale unstructured data
First Claim
1. A method comprising:
- extracting, by at least one or more computing devices, visual features from images of a corpus of images;
arranging, by the at least one or more computing devices, the images in clusters based at least in part on similarities of the visual features;
calculating, by the at least one or more computing devices, at least two relevance features, including;
first relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and
second relevance features representing distribution characteristics of distances between different clusters of images; and
refining, by the at least one or more computing devices, the corpus by removing one or more images from the corpus based in part on the at least two relevance features to create a refined corpus.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for learning topic models from unstructured data and applying the learned topic models to recognize semantics for new data items are described herein. In at least one embodiment, a corpus of multimedia data items associated with a set of labels may be processed to generate a refined corpus of multimedia data items associated with the set of labels. Such processing may include arranging the multimedia data items in clusters based on similarities of extracted multimedia features and generating intra-cluster and inter-cluster features. The intra-cluster and the inter-cluster features may be used for removing multimedia data items from the corpus to generate the refined corpus. The refined corpus may be used for training topic models for identifying labels. The resulting models may be stored and subsequently used for identifying semantics of a multimedia data item input by a user.
-
Citations
20 Claims
-
1. A method comprising:
-
extracting, by at least one or more computing devices, visual features from images of a corpus of images; arranging, by the at least one or more computing devices, the images in clusters based at least in part on similarities of the visual features; calculating, by the at least one or more computing devices, at least two relevance features, including; first relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and second relevance features representing distribution characteristics of distances between different clusters of images; and refining, by the at least one or more computing devices, the corpus by removing one or more images from the corpus based in part on the at least two relevance features to create a refined corpus. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
receiving, by at least one or more computing devices, a corpus of images associated with a set of labels; extracting, by the at least one or more computing devices, visual features from the images; arranging, by the at least one or more computing devices;
the images into a plurality of clusters based at least in part on similarities of the visual features;determining, by the at least one or more computing devices, at least two relevance features associated with individual clusters of the plurality of clusters, wherein; first relevance features of the at least two relevance features are based on pairs of images in a first cluster of the plurality of clusters; the first cluster is associated with a first label of the set of labels; and second relevance features of the at least two relevance features are based on the first cluster and at least one second cluster associated with a second label of the set of labels; processing, by the at least one or more computing devices, the corpus of images to generate a refined corpus of images associated with the set of labels based in part on the at least two relevance features; and training, by the at least one or more computing devices, a set of models for identifying individual labels of the set of labels based at least in part on the extracted visual features. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. A system comprising:
-
memory; one or more processors; and one or more modules stored in the memory and executable by the one or more processors, the one or more modules including; a labeling module configured to learn a topic model associated with one or more based at least in part on; extracting visual features from a corpus of images associated with the one or more labels; and processing the corpus of images based in part on at least two relevance features; first relevance features of the at least two relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and second relevance features of the at least two relevance features representing distribution characteristics of distances between different clusters of images. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification