Object retrieval using visual query context
First Claim
Patent Images
1. A method comprising:
- under control of one or more processors specifically configured with executable instructions,receiving, at a search engine, a query associated with a query image, the query image including a plurality of pixels;
categorizing a first portion of the plurality of pixels as foreground pixels;
categorizing a second portion of the plurality of pixels as background pixels;
determining saliency scores for each of the plurality of pixels based on the foreground pixels and the background pixels, the saliency scores comprising contrast-based saliency scores;
normalizing the saliency scores to create normalized saliency scores;
transforming the normalized saliency scores to create a prior probability distribution;
determining a search intent based on the prior probability distribution using a dual-sigmoid approximation; and
identifying, from a set of images, one or more images that match the query image based on the search intent.
2 Assignments
0 Petitions
Accused Products
Abstract
Some implementations provide techniques and arrangements to perform image retrieval. For example, some implementations identify an object of interest and a visual context in a first image. In some implementations, a second image that includes a second object of interest and a second visual context may be compared to the object of interest and the visual content, respectively, to determine whether the second image matches the first image.
-
Citations
20 Claims
-
1. A method comprising:
-
under control of one or more processors specifically configured with executable instructions, receiving, at a search engine, a query associated with a query image, the query image including a plurality of pixels; categorizing a first portion of the plurality of pixels as foreground pixels; categorizing a second portion of the plurality of pixels as background pixels; determining saliency scores for each of the plurality of pixels based on the foreground pixels and the background pixels, the saliency scores comprising contrast-based saliency scores; normalizing the saliency scores to create normalized saliency scores; transforming the normalized saliency scores to create a prior probability distribution; determining a search intent based on the prior probability distribution using a dual-sigmoid approximation; and identifying, from a set of images, one or more images that match the query image based on the search intent. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. Computer-readable storage media including instructions executable by one or more processors to perform operations comprising:
-
receiving data identifying a region of interest associated with a first image, the first image including a plurality of pixels; determining contrast-based saliency scores for each of the plurality of pixels; normalizing the contrast-based saliency scores to create normalized saliency scores; determining transforming the normalized saliency scores to create a prior probability distribution; determining a search intent based on a dual-sigmoid approximation in which a first sigmoid approximation is used for a first dimension of the region of interest and a second sigmoid approximation is used for a second dimension of the region of interest; and performing a first comparison between the first image and a second image based on the search intent. - View Dependent Claims (8, 9, 10, 11, 12, 20)
-
-
13. A computing device comprising:
-
one or more processors; computer-readable storage media accessible to the one or more processors; a communication interface to receive a query associated with a first image, the query including data identifying a region of interest that includes a portion of the first image; a saliency detection module to determine contrast-based saliency scores for the first image, normalize the saliency scores to create normalized saliency scores, and determine a prior probability distribution based on the normalized saliency scores; a search intent detection module to determine a search intent associated with the query based on the prior probability distribution, the search intent determined via a dual-sigmoid approximation, the dual sigmoid function including a first sigmoid function associated with a first dimension of the region of interest and a second sigmoid function associated with a second dimension of the region of interest; a visual words identification module to select visual words based on the search intent; and a context-based object retrieval module to perform a search of a plurality of images based on the visual words to determine whether at least one image, of the plurality of images matches the first image. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification