Local feature representation for image recognition
First Claim
1. A computer-implemented method, comprising:
- receiving a digital image including a plurality of image features;
dividing the digital image into image patches, such that each image patch represents only a portion of the received digital image;
generating an image patch vector for each image patch;
comparing each image patch vector to Gaussian mixture components of a Gaussian Mixture Model (GMM), each Gaussian mixture component being a vector, thereby generating a similarity score for each image patch vector;
for each Gaussian mixture component, eliminating one or more image patch vectors associated with a similarity score that is below a given threshold;
concatenating a plurality of remaining image patch vectors of all the Gaussian mixture components to generate a final image feature vector that represents the plurality of image features in the received digital image; and
categorizing the digital image using the final image feature vector.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for image feature representation. The techniques exhibit discriminative power that can be used in any number of classification tasks, and are particularly effective with respect to fine-grained image classification tasks. In an embodiment, a given image to be classified is divided into image patches. A vector is generated for each image patch. Each image patch vector is compared to the Gaussian mixture components (each mixture component is also a vector) of a Gaussian Mixture Model (GMM). Each such comparison generates a similarity score for each image patch vector. For each Gaussian mixture component, the image patch vectors associated with a similarity score that is too low are eliminated. The selectively pooled vectors from all the Gaussian mixture components are then concatenated to form the final image feature vector, which can be provided to a classifier so the given input image can be properly categorized.
16 Citations
19 Claims
-
1. A computer-implemented method, comprising:
-
receiving a digital image including a plurality of image features; dividing the digital image into image patches, such that each image patch represents only a portion of the received digital image; generating an image patch vector for each image patch; comparing each image patch vector to Gaussian mixture components of a Gaussian Mixture Model (GMM), each Gaussian mixture component being a vector, thereby generating a similarity score for each image patch vector; for each Gaussian mixture component, eliminating one or more image patch vectors associated with a similarity score that is below a given threshold; concatenating a plurality of remaining image patch vectors of all the Gaussian mixture components to generate a final image feature vector that represents the plurality of image features in the received digital image; and categorizing the digital image using the final image feature vector. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transient computer program product having instructions encoded thereon that when executed by one or more processors causes a process to be carried out, the process comprising:
-
dividing a digital image into image patches, each of which represents only a portion of the digital image, wherein the digital image includes a plurality of features; generating an image patch vector for each image patch; comparing each image patch vector to Gaussian mixture components of a Gaussian Mixture Model (GMM), thereby generating a similarity score for each image patch vector; for each Gaussian mixture component, eliminating one or more image patch vectors associated with a similarity score that is below a given threshold; and concatenating a plurality of remaining image patch vectors of all the Gaussian mixture components to generate a final image feature vector that represents the plurality of features in the digital image; and categorizing the digital image using the final image feature vector. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computing system, comprising:
-
an electronic memory for storing executable instructions; and a processor configured to execute the instructions to; divide a digital image into image patches, each of which represents only a portion of the digital image, wherein the digital image includes a plurality of features; generate an image patch vector for each image patch; compare each image patch vector to Gaussian mixture components of a Gaussian Mixture Model (GMM), each Gaussian mixture component being a vector, thereby generating a similarity score for each image patch vector; for each Gaussian mixture component, eliminate one or more image patch vectors associated with a similarity score that is below a given threshold; concatenate a plurality of remaining image patch vectors of all the Gaussian mixture components to generate a final image feature vector that represents the plurality of features in the digital image; and categorize the digital image using the final image feature vector. - View Dependent Claims (15, 16, 17, 18, 19)
-
Specification