Clustering queries for image search
First Claim
1. A computer-implemented method comprising:
- obtaining a pair of search queries including (i) a first search query and (ii) a second search query;
obtaining (i) a first set of multiple images that are identified as responsive to the first search query, and (ii) a second set of multiple images that are identified as responsive to the second search query;
determining, for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query, an image distance between the first image of the pair and the second image of the pair based on information associated with one or more pixels of the first image of the pair and information associated with one or more corresponding pixels of the second image of the pair;
determining a measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii)the second set of multiple images that are identified as responsive to the second search query, based at least on the respective image distances for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query;
determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies a similarity threshold; and
in response to determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies the similarity threshold, assigning the pair of search queries to a cluster of search queries.
2 Assignments
0 Petitions
Accused Products
Abstract
Aspects of the subject matter described herein relate to functions used for retrieving image results based on search queries. More specifically, image search queries can be pre-grouped or classified based on visual and semantic similarity. For example, a pairwise image similarity value for a pair of queries can be computed based on one or more of the sum of all of the overlapping the image results, the sum of the image distances between all of the pairs of images in the image results, and the rank of each of the images in the image results. The pairwise image similarity values can then be used to generate image query clusters. Each image query clusters can include a set of queries with high pairwise image similarity values. In some examples, a distance function can be determined for each image query cluster. This data can be used to provide image results.
11 Citations
14 Claims
-
1. A computer-implemented method comprising:
-
obtaining a pair of search queries including (i) a first search query and (ii) a second search query; obtaining (i) a first set of multiple images that are identified as responsive to the first search query, and (ii) a second set of multiple images that are identified as responsive to the second search query; determining, for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query, an image distance between the first image of the pair and the second image of the pair based on information associated with one or more pixels of the first image of the pair and information associated with one or more corresponding pixels of the second image of the pair; determining a measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii)the second set of multiple images that are identified as responsive to the second search query, based at least on the respective image distances for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query; determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies a similarity threshold; and in response to determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies the similarity threshold, assigning the pair of search queries to a cluster of search queries. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; obtaining a pair of search queries including (i) a first search query and (ii) a second search query; obtaining (i) a first set of multiple images that are identified as responsive to the first search query, and (ii) a second set of multiple images that are identified as responsive to the second search query; determining, for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query, an image distance between the first image of the pair and the second image of the pair based on information associated with one or more pixels of the first image of the pair and information associated with one or more corresponding pixels of the second image of the pair; determining a measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query, based at least on the respective image distances for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query; determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies a similarity threshold; and in response to determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies the similarity threshold, assigning the pair of search queries to a cluster of search queries. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising
obtaining a pair of search queries including (i) a first search query and (ii) a second search query; -
obtaining (i) a first set of multiple images that are identified as responsive to the first search query, and (ii) a second set of multiple images that are identified as responsive to the second search query; determining, for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query, an image distance between the first image of the pair and the second image of the pair based on information associated with one or more pixels of the first image of the pair and information associated with one or more corresponding pixels of the second image of the pair; determining a measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query, based at least on the respective image distances for each pair of images that includes (i) a first image from the first set of multiple images that are identified as responsive to the first search query and (ii) a second image from the second set of multiple images that are identified as responsive to the second search query; determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies a similarity threshold; and in response to determining that the measure of similarity between (i) the first set of multiple images that are identified as responsive to the first search query and (ii) the second set of multiple images that are identified as responsive to the second search query satisfies the similarity threshold, assigning the pair of search queries to a cluster of search queries. - View Dependent Claims (12, 13, 14)
-
Specification