Clustering queries for image search
First Claim
1. A computer-implemented method comprising:
- obtaining a pair of search queries including (i) a first search query and (ii) a second search query;
obtaining (i) a first set of ranked images that are identified as responsive to the first search query, and (ii) a second set of ranked images that are identified as responsive to the second search query;
for each of the images of the first set of images, determining a respective image distance between the image of the first set of images and each image of the second set of images;
generating a score for each pair of images that includes an image from the first set of images and an image from the second set of images, wherein the score for each pair of images is based at least on (i) the rank of the image of the first set of images, (ii) the rank of the image of the second set of images, and (iii) the determined image distance between the image of the first set of images and the image of the second set of images;
determining a measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query based on the scores; and
assigning the pair of search queries to a cluster of search queries based at least on the measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query.
2 Assignments
0 Petitions
Accused Products
Abstract
Aspects of the subject matter described herein relate to functions used for retrieving image results based on search queries. More specifically, image search queries can be pre-grouped or classified based on visual and semantic similarity. For example, a pairwise image similarity value for a pair of queries can be computed based on one or more of the sum of all of the overlapping the image results, the sum of the image distances between all of the pairs of images in the image results, and the rank of each of the images in the image results. The pairwise image similarity values can then be used to generate image query clusters. Each image query clusters can include a set of queries with high pairwise image similarity values. In some examples, a distance function can be determined for each image query cluster. This data can be used to provide image results.
15 Citations
14 Claims
-
1. A computer-implemented method comprising:
-
obtaining a pair of search queries including (i) a first search query and (ii) a second search query; obtaining (i) a first set of ranked images that are identified as responsive to the first search query, and (ii) a second set of ranked images that are identified as responsive to the second search query; for each of the images of the first set of images, determining a respective image distance between the image of the first set of images and each image of the second set of images; generating a score for each pair of images that includes an image from the first set of images and an image from the second set of images, wherein the score for each pair of images is based at least on (i) the rank of the image of the first set of images, (ii) the rank of the image of the second set of images, and (iii) the determined image distance between the image of the first set of images and the image of the second set of images; determining a measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query based on the scores; and assigning the pair of search queries to a cluster of search queries based at least on the measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; obtaining a pair of search queries including (i) a first search query and (ii) a second search query; obtaining (i) a first set of ranked images that are identified as responsive to the first search query, and (ii) a second set of ranked images that are identified as responsive to the second search query; for each of the images of the first set of images, determining a respective image distance between the image of the first set of images and each image of the second set of images; generating a score for each pair of images that includes an image from the first set of images and an image from the second set of images, wherein the score for each pair of images is based at least on (i) the rank of the image of the first set of images, (ii) the rank of the image of the second set of images, and (iii) the determined image distance between the image of the first set of images and the image of the second set of images; determining a measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query based on the scores; and assigning the pair of search queries to a cluster of search queries based at least on the measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
obtaining a pair of search queries including (i) a first search query and (ii) a second search query; obtaining (i) a first set of ranked images that are identified as responsive to the first search query, and (ii) a second set of ranked images that are identified as responsive to the second search query; for each of the images of the first set of images, determining a respective image distance between the image of the first set of images and each image of the second set of images; generating a score for each pair of images that includes an image from the first set of images and an image from the second set of images, wherein the score for each pair of images is based at least on (i) the rank of the image of the first set of images, (ii) the rank of the image of the second set of images, and (iii) the determined image distance between the image of the first set of images and the image of the second set of images; determining a measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query based on the scores; and assigning the pair of search queries to a cluster of search queries based at least on the measure of similarity between (i) the first set of images that are identified as responsive to the first search query, and (ii) the second set of images that are identified as responsive to the second search query. - View Dependent Claims (14)
-
Specification