Generic visual categorization method and system
First Claim
Patent Images
1. A method for assigning one of a plurality of classes to an input image, comprising one or more processors implementing the following actions:
- identifying by a key patch detector a plurality of key-patches in the input image;
computing by a feature description module an unordered feature vector for each of the plurality of key-patches;
computing by a multi-histogram computation module a first histogram of fixed length feature vectors for each of the plurality of classes using the plurality of unordered feature vectors computed;
computing by the multi-histogram computation module a second histogram for each of the plurality of classes using the plurality of unordered feature vectors computed and the first histogram of fixed length feature vectors computed; and
,assigning by a classifier training module at least one of the plurality of classes to the input image using the plurality of histograms computed as input to a classifier;
wherein the first and second histograms are iteratively estimated using an Expectation-Maximization based calculation for mixture weight, mean vector, and covariance matrix parameters of the feature vector, wherein the parameters correspond to relative frequencies, averages, and variations of the visual words.
6 Assignments
0 Petitions
Accused Products
Abstract
Generic visual categorization methods complement a general vocabulary with adapted vocabularies that are class specific. Images to be categorized are characterized within different categories through a histogram indicating whether the image is better described by the general vocabulary or the class-specific adapted vocabulary.
52 Citations
20 Claims
-
1. A method for assigning one of a plurality of classes to an input image, comprising one or more processors implementing the following actions:
-
identifying by a key patch detector a plurality of key-patches in the input image; computing by a feature description module an unordered feature vector for each of the plurality of key-patches; computing by a multi-histogram computation module a first histogram of fixed length feature vectors for each of the plurality of classes using the plurality of unordered feature vectors computed; computing by the multi-histogram computation module a second histogram for each of the plurality of classes using the plurality of unordered feature vectors computed and the first histogram of fixed length feature vectors computed; and
,assigning by a classifier training module at least one of the plurality of classes to the input image using the plurality of histograms computed as input to a classifier; wherein the first and second histograms are iteratively estimated using an Expectation-Maximization based calculation for mixture weight, mean vector, and covariance matrix parameters of the feature vector, wherein the parameters correspond to relative frequencies, averages, and variations of the visual words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus for processing an assignment of one of a plurality of classes to an input image, comprising:
-
a memory which stores; a key-patch detector identifying a plurality of key-patches in the input image; a feature description module for computing an unordered feature vector for each of the plurality of key-patches; a multi-histogram computation module computing a first histogram of fixed length feature vectors for each of the plurality of classes using the plurality of unordered feature vectors computed; the multi-histogram computation module computing a second histogram of each of the plurality of classes using the plurality of unordered feature vectors computed and the first histogram of fixed length feature vectors computed; wherein the first and second histograms are iteratively estimated using an Expectation-Maximization based calculation for mixture weight, mean vector, and covariance matrix parameters of the feature vector, wherein the parameters correspond to relative frequencies, averages, and variations of the visual words a classifier for assigning at least one of the plurality of classes to the input image using the plurality of histograms computed as input to the classifier; and a processor in communication with the memory configured for processing the image with the key-patch detector, the feature description module, the multi-histogram computation module, and the classifier. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for a computer-based system for training a classifier, comprising one or more processors implementing the following actions:
-
identifying by a key patch detector key-patches in images of a plurality of class training sets; computing by a feature description module feature vectors for the identified key-patches; computing by a multi-histogram computation module a general vocabulary by clustering the computed feature vectors; for each of a plurality of classes, computing by a multi-histogram computation module an adapted vocabulary using the general vocabulary; comparing the adapted vocabulary to the general vocabulary; storing by a memory the adapted vocabulary if it is changed as compared to the general vocabulary; computing by a class-specific histogram calculator a histogram for each of the plurality of classes; training by a classifier training module the classifier using the histograms for each of the plurality of classes; wherein the first and second histograms are iteratively estimated using an Expectation-Maximization based calculation for mixture weight, mean vector, and covariance matrix parameters of the feature vector, wherein the parameters correspond to relative frequencies, averages, and variations of the visual words. - View Dependent Claims (18, 19, 20)
-
Specification