Systems and methods for creating a visual vocabulary
First Claim
1. A method for building a visual vocabulary, the method comprising:
- generating visual words based on a set of features, wherein the visual words are defined in a higher-dimensional space;
projecting the visual words from the higher-dimensional space to a first lower-dimensional space, thereby producing projections of the visual words in the first lower-dimensional space;
generating a first collection of buckets in the first lower-dimensional space based on the projections of the visual words in the first lower-dimensional space;
projecting the visual words from the higher-dimensional space to a second lower-dimensional space, thereby producing projections of the visual words in the second lower-dimensional space;
generating a second collection of buckets in the second lower-dimensional space based on the projections of the visual words in the second lower-dimensional space; and
iteratively selecting a sub-collection of buckets from the first collection of buckets and from the second collection of buckets, wherein bucket selection during any iteration after an initial iteration is based at least in part on feedback from previously selected buckets.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for generating a visual vocabulary build a plurality of visual words via unsupervised learning on set of features of a given type; decompose one or more visual words to a collection of lower-dimensional buckets; generate labeled image representations based on the collection of lower dimensional buckets and labeled images, wherein labels associated with an image are associated with a respective representation of the image; and iteratively select a sub-collection of buckets from the collection of lower-dimensional buckets based on the labeled image representations, wherein bucket selection during any iteration after an initial iteration is based at least in part on feedback from previously selected buckets.
-
Citations
18 Claims
-
1. A method for building a visual vocabulary, the method comprising:
-
generating visual words based on a set of features, wherein the visual words are defined in a higher-dimensional space; projecting the visual words from the higher-dimensional space to a first lower-dimensional space, thereby producing projections of the visual words in the first lower-dimensional space; generating a first collection of buckets in the first lower-dimensional space based on the projections of the visual words in the first lower-dimensional space; projecting the visual words from the higher-dimensional space to a second lower-dimensional space, thereby producing projections of the visual words in the second lower-dimensional space; generating a second collection of buckets in the second lower-dimensional space based on the projections of the visual words in the second lower-dimensional space; and iteratively selecting a sub-collection of buckets from the first collection of buckets and from the second collection of buckets, wherein bucket selection during any iteration after an initial iteration is based at least in part on feedback from previously selected buckets. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. One or more non-transitory computer-readable media storing instructions that, when executed by one or more computing devices, cause the computing devices to perform operations comprising:
-
clustering features from one or more images to form feature clusters, wherein the features and the feature clusters are defined in a higher-dimensional space; projecting the feature clusters from the higher-dimensional space to a first lower-dimensional space to form projections of the feature clusters in the first lower-dimensional space; projecting the feature clusters from the higher-dimensional space to a second lower-dimensional space to form projections of the feature clusters in the second lower-dimensional space; and generating buckets in the first lower-dimensional space and in the second lower-dimensional space based on the projections of the feature clusters in the first lower-dimensional space and on the projections of the feature clusters in the second lower-dimensional space. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A device for building a visual vocabulary, the device comprising:
-
a computer memory; and one or more processors that are coupled to the computer memory and that are configured to cause the device to generate visual words based on a plurality of features, wherein the visual words are defined in a higher-dimensional space, project the visual words from the higher-dimensional space to a first lower-dimensional space, thereby producing projections of the visual words in the first lower-dimensional space, generate a first collection of buckets in the first lower-dimensional space based on the projections of the visual words in the first lower-dimensional space, project the visual words from the higher-dimensional space to a second lower-dimensional space, thereby producing projections of the visual words in the second lower-dimensional space, generate a second collection of buckets in the second lower-dimensional space based on the projections of the visual words in the second lower-dimensional space, and iteratively select a sub-collection of buckets from the first collection of buckets and from the second collection of buckets, wherein bucket selection during any iteration after an initial iteration is based at least in part on feedback from previously selected buckets. - View Dependent Claims (17, 18)
-
Specification