DATA DRIVEN LOCALIZATION USING TASK-DEPENDENT REPRESENTATIONS
First Claim
1. A method for object localization in an image comprising:
- for an input image, generating a task-dependent representation of the input image based on relevance scores for an object to be localized, the relevance scores being output by a classifier for a plurality of locations in the input image;
identifying at least one similar image from a set of images, based on the task-dependent representation of the input image and task-dependent representations of images in the set of images; and
identifying a location of the object in the input image based on an object location annotation for at least one of the at least one similar images identified in the set of images,wherein at least one of the generating of the task-dependent representation, identifying of the at least one similar image, and the identifying a location of the object in the input image is performed with a computer processor.
7 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method for localization of an object, such as a license plate, in an input image includes generating a task-dependent representation of the input image based on relevance scores for the object to be localized. The relevance scores are output by a classifier for a plurality of locations in the input image, such as patches. The classifier is trained on patches extracted from training images and their respective relevance labels. One or more similar images are identified from a set of images, based on a comparison of the task-dependent representation of the input image and task-dependent representations of images in the set of images. A location of the object in the input image is identified based on object location annotations for the similar images.
-
Citations
24 Claims
-
1. A method for object localization in an image comprising:
-
for an input image, generating a task-dependent representation of the input image based on relevance scores for an object to be localized, the relevance scores being output by a classifier for a plurality of locations in the input image; identifying at least one similar image from a set of images, based on the task-dependent representation of the input image and task-dependent representations of images in the set of images; and identifying a location of the object in the input image based on an object location annotation for at least one of the at least one similar images identified in the set of images, wherein at least one of the generating of the task-dependent representation, identifying of the at least one similar image, and the identifying a location of the object in the input image is performed with a computer processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for object localization in images comprising:
-
a patch representation generator which generates patch-based representations of a plurality of patches of an input image; a classifier component which classifies each of the patches with a trained classifier model based on the respective patch-based representation; a signature generator which generates a task-dependent representation of the input image based on the classifications of the patches; a retrieval component configured for retrieving at least one similar image from a set of images, based on a comparison measure between the task-dependent representation of the input image and task-dependent representations of images in the set of images; a segmentation component which segments the input image based on a location of an object in the at least one similar image and identifying a location of an object in the input image based on the segmentation; and a processor which implements the patch representation generator, classifier component, signature generator, and segmentation component. - View Dependent Claims (21, 22, 23)
-
-
24. A method for object localization in an image comprising:
with a processor; for each of a set of test images, generating a patch-based representation of a plurality of patches of the test image with a generative model; classifying each of the test image patches with a trained classifier model based on the respective patch-based representation; generating a task-dependent representation of each of the test images based on the classifications of the patches of the test image; generating patch-based representations of a plurality of patches of an input image with the generative model; classifying each of the patches of the input image with the trained classifier model based on the respective patch-based representation; generating a task-dependent representation of the input image based on the classifications of the patches of the input image; retrieving at least one similar test image from the set of test images, based on a comparison measure between the task-dependent representation of the input image and the task-dependent representations of the test images in the set of test images; and segmenting the input image based on a location of an object in the at least one similar test image and identifying a location of an object in the input image based on the segmentation.
Specification