Clustering search results based on image composition
First Claim
1. A computer-implemented method, comprising:
- training a computer-operated convolutional neural network to recognize an object in a region of an image as salient using feature descriptor vectors obtained from extracted features of each saliency region of a training image;
for each image in a set of images, determining a compositional vector representing one or more objects and corresponding locations within the image using the trained computer-operated convolutional neural network;
providing each image through a clustering algorithm to produce one or more clusters based on compositional similarity, wherein the clustering algorithm maps each image to a cluster representing one of a plurality of predetermined compositional classes;
providing images from the set of images clustered by composition, the images including a different listing of images for each of the one or more clusters; and
transmitting, from a server to a client device for display by the client device, a set of search results responsive to a user search query, the set of search results including a prioritized listing of the images from each cluster of compositional similarity identified for display for a respective composition.
1 Assignment
0 Petitions
Accused Products
Abstract
Various aspects of the subject technology relate to systems, methods, and machine-readable media for clustering search results based on image composition. A system may, for each image in a set of images, determine a compositional vector representing one or more objects and corresponding locations within the image using a trained computer-operated convolutional neural network. The system may provide each image through a clustering algorithm to produce one or more clusters based on compositional similarity. The system may provide images from the set of images clustered by composition, in which the images include a different listing of images for each of the one or more clusters. The system may provide a prioritized listing of images responsive to a user search query, in which the prioritized listing of images includes a different listing of images for each cluster of compositional similarity based on the metadata of each image associated with the cluster.
26 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
training a computer-operated convolutional neural network to recognize an object in a region of an image as salient using feature descriptor vectors obtained from extracted features of each saliency region of a training image; for each image in a set of images, determining a compositional vector representing one or more objects and corresponding locations within the image using the trained computer-operated convolutional neural network; providing each image through a clustering algorithm to produce one or more clusters based on compositional similarity, wherein the clustering algorithm maps each image to a cluster representing one of a plurality of predetermined compositional classes; providing images from the set of images clustered by composition, the images including a different listing of images for each of the one or more clusters; and transmitting, from a server to a client device for display by the client device, a set of search results responsive to a user search query, the set of search results including a prioritized listing of the images from each cluster of compositional similarity identified for display for a respective composition. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system, comprising:
-
one or more processors; and a computer-readable storage medium coupled to the one or more processors, the computer-readable storage medium including instructions that, when executed by the one or more processors, cause the one or more processors to; train a computer-operated convolutional neural network to recognize an object in a region of an image as salient using feature descriptor vectors obtained from extracted features of each saliency region of a training image; for each image in a set of images, determine a compositional vector representing one or more objects and corresponding locations within the image using the computer-operated convolutional neural network; provide each image through a clustering algorithm to produce one or more clusters based on compositional similarity, wherein the clustering algorithm maps each image to a cluster representing one of a plurality of predetermined compositional classes; provide images from the set of images clustered by composition, the images including a different listing of images for each of the one or more clusters; store a metadata of each image in an image collection, the metadata of each image indicating a compositional class for the image; and transmit, from a server to a client device for display by the client device, a prioritized listing of images responsive to a user search query, the prioritized listing of images including a different listing of images for each cluster of compositional similarity based on the metadata of each image associated with the cluster identified for display for a respective composition. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A computer-implemented method, comprising:
-
receiving, over a transmission at a server from a client device, a user input via an application on the client device to initiate an image search, the user input indicating one or more queries that define a specific composition for an image; generating, in response to the user input, an image search query from the user input; providing, for transmission, the image search query over a connection to the server, the server including an image search service that obtains a set of images responsive to the image search query based on a cosine similarity between a compositional vector associated with the image search query and one or more compositional vectors of corresponding images from an image collection, the image search service clustering the set of images using a trained computer-operated convolutional neural network configured to recognize an object in a region of an image as salient using feature descriptor vectors obtained from extracted features of each saliency region of a training image, and based on composition similarity using a clustering algorithm mapping each image to a cluster representing one of a plurality of predetermined compositional classes; and receiving a set of search results responsive to the image search query from the server, the set of search results including a prioritized listing of images identified for display by the client device for a respective composition.
-
Specification