×

Systems and methods for clustering of near-duplicate images in very large image collections

  • US 10,504,002 B2
  • Filed: 07/30/2017
  • Issued: 12/10/2019
  • Est. Priority Date: 07/30/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for clustering a plurality of images, the computer-implemented method being performed in connection with a computerized system comprising a central processing unit and a memory, the computer-implemented method comprising:

  • a. generating a vocabulary of visual words in the plurality of images;

    b. extracting image features for image key points for each of the plurality of images;

    c. based on the extracted image features, creating an index pointing from the visual words in the vocabulary to images from the plurality of images, which contain these visual words;

    d. using the created index to collect all other images of the plurality of images that share at least one visual word with a selected image and determining a number of shared visual words;

    e. performing a geometric verification to verify whether the shared visual words are located at same locations in the selected image and the other images of the plurality of images and taking a fraction of verified shared visual words to all shared visual words as a similarity measure; and

    f. clustering the plurality of images hierarchically based on the similarity measure.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×