METHOD AND SYSTEM FOR FAST AND ROBUST IDENTIFICATION OF SPECIFIC PRODUCT IMAGES
First Claim
1. Method of identification of objects in images characterised in that it comprises the following stages:
- (i) a feature extraction stage including the following steps for both;
reference images, i.e. images representing each at least a single reference object, and at least one query image, i.e. an image representing unknown objects to be identified;
(a) identification of key-points, i.e. salient image regions;
(b) post-processing of key-points where key-points that are not useful for the identification process are eliminated;
(c) computation of the descriptors, i.e. representations, of the key-points,(ii) an indexing stage of reference images including the following steps;
(a) key-point extraction;
(b) post-processing of key-points where key-points that are not useful for the identification process are eliminated;
(c) assignment of key-points to visual words of a visual word vocabulary created from a collection of training images, wherein the visual words are centres of clusters of key-point descriptors;
(d) addition of key-points to an inverted file structure, wherein the inverted file structure comprises a hit list for every visual word that stores all occurrences of the word in the reference images and wherein every hit stores an identifier of the reference image where the key-point was detected; and
(iii) a stage of recognition of objects present in the query image including the following steps;
(a) key-point extraction;
(b) post-processing of key-points where key-points that are not useful for the identification process are eliminated;
(c) assignment of key-points to visual words of the visual word vocabulary;
(d) for each pairing of a key-point from the query image and one of the hits assigned to the same visual word aggregating a vote into an accumulator corresponding to the reference image of the hit; and
(e) identification of the matching scores corresponding to the reference images based on the votes of the accumulators.
2 Assignments
0 Petitions
Accused Products
Abstract
Identification of objects in images. All images are scanned for key-points and a descriptor is computed for each region. A large number of descriptor examples are clustered into a Vocabulary of Visual Words. An inverted file structure is extended to support clustering of matches in the pose space. It has a hit list for every visual word, which stores all occurrences of the word in all reference images. Every hit stores an identifier of the reference image where the key-point was detected and its scale and orientation. Recognition starts by assigning key-points from the query image to the closest visual words. Then, every pairing of the key-point and one of the hits from the list casts a vote into a pose accumulator corresponding to the reference image where the hit was found. Every pair key-point/hit predicts specific orientation and scale of the model represented by the reference image.
55 Citations
17 Claims
-
1. Method of identification of objects in images characterised in that it comprises the following stages:
-
(i) a feature extraction stage including the following steps for both;
reference images, i.e. images representing each at least a single reference object, and at least one query image, i.e. an image representing unknown objects to be identified;(a) identification of key-points, i.e. salient image regions; (b) post-processing of key-points where key-points that are not useful for the identification process are eliminated; (c) computation of the descriptors, i.e. representations, of the key-points, (ii) an indexing stage of reference images including the following steps; (a) key-point extraction; (b) post-processing of key-points where key-points that are not useful for the identification process are eliminated; (c) assignment of key-points to visual words of a visual word vocabulary created from a collection of training images, wherein the visual words are centres of clusters of key-point descriptors; (d) addition of key-points to an inverted file structure, wherein the inverted file structure comprises a hit list for every visual word that stores all occurrences of the word in the reference images and wherein every hit stores an identifier of the reference image where the key-point was detected; and (iii) a stage of recognition of objects present in the query image including the following steps; (a) key-point extraction; (b) post-processing of key-points where key-points that are not useful for the identification process are eliminated; (c) assignment of key-points to visual words of the visual word vocabulary; (d) for each pairing of a key-point from the query image and one of the hits assigned to the same visual word aggregating a vote into an accumulator corresponding to the reference image of the hit; and (e) identification of the matching scores corresponding to the reference images based on the votes of the accumulators. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification