×

Learning multimedia semantics from large-scale unstructured data

  • US 9,875,301 B2
  • Filed: 04/30/2014
  • Issued: 01/23/2018
  • Est. Priority Date: 04/30/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • extracting, by at least one or more computing devices, visual features from images of a corpus of images;

    arranging, by the at least one or more computing devices, the images in clusters based at least in part on similarities of the visual features;

    calculating, by the at least one or more computing devices, at least two relevance features, including;

    first relevance features representing distribution characteristics of distances between pairs of images in a same cluster; and

    second relevance features representing distribution characteristics of distances between different clusters of images; and

    refining, by the at least one or more computing devices, the corpus by removing one or more images from the corpus based in part on the at least two relevance features to create a refined corpus.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×