×

SYSTEM FOR CLUSTERING AND AGGREGATING DATA FROM MULTIPLE SOURCES

  • US 20150199744A1
  • Filed: 01/12/2015
  • Published: 07/16/2015
  • Est. Priority Date: 01/10/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method of aggregating entity data from a plurality of sources, the method comprising:

  • obtaining sample data from a plurality of data sources, the sample data corresponding to a plurality of entities, wherein samples from multiple data sources correspond to a same entity;

    processing the samples to identify a plurality of fields corresponding to each sample, the fields including a name and a geographical indicator;

    identifying a first cluster of the samples as corresponding to a first entity based on a first set of rules, the first cluster including a first sample, wherein identifying the first cluster includes;

    determining whether a second sample is in the first cluster by;

    determining a first field distance between a first field of the first sample and the first field of the second sample;

    calculating a first metric based on the first field distance; and

    adding the second sample to the first metric when the first metric is within a first threshold;

    comparing the fields of at least a portion of the samples in the first cluster to determine the name and the geographical indicator for the first entity; and

    storing the name and the geographical indicator of the first entity into a first record of a database.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×