×

Techniques for disambiguating clustered occurrence identifiers

  • US 10,803,135 B2
  • Filed: 12/13/2018
  • Issued: 10/13/2020
  • Est. Priority Date: 09/11/2018
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • identifying a collection of digital assets having at least a first digital asset and a second digital asset, the collection of digital assets being stored on a computing device;

    generating a consolidated address for a location cluster of digital assets comprising the first digital asset and the second digital asset;

    determining a first geographic location for the consolidated address based at least in part on location metadata associated with the first digital asset;

    determining a first time for the first digital asset based at least in part on time metadata associated with the first digital asset;

    transmitting a request to a web service for a plurality of event identifiers within a target range of the first geographic location, each of the event identifiers specifying an event geographic location and an event time;

    filtering the plurality of event identifiers, the filtering of the plurality of event identifiers to remove locations visited above a frequency threshold;

    accessing a knowledge graph stored on the computing device to correlate metadata for the collection of digital assets with a category identified in the knowledge graph;

    calculating a distance range between the first geographic location and a second geographic location, the second geographic location based at least in part on location metadata associated with the second digital asset, the second digital asset being within a target range of the event geographic location;

    calculating a time range between the first time and a second time, the second time based at least in part on time metadata associated with the second digital asset;

    calculating a confidence metric, the confidence metric indicating a degree of confidence that the first digital asset was generated at an event corresponding to at least one of the plurality of event identifiers, the confidence metric calculated based at least in part on rules for the category of the first digital asset, the rules specifying that the distance range satisfies a minimum distance, the time range satisfies a minimum duration, and a number of digital assets stored on the computing device that are within the distance range of the event geographic location and within the time range of an event time satisfies a minimum number;

    associating the first digital asset with at least one of the plurality of event identifiers based at least in part on a determination that the confidence metric exceeds a threshold; and

    updating the knowledge graph stored in the computing device to include the association between the first digital asset and the at least one of the plurality of event identifiers.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×