×

DATA CLUSTERING BASED ON CANDIDATE QUERIES

  • US 20130124525A1
  • Filed: 11/15/2012
  • Published: 05/16/2013
  • Est. Priority Date: 11/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method, including:

  • receiving data records, the received data records each including one or more values in one or more fields; and

    processing the received data records to identify a matched data cluster to associate with each received data record, the processing including;

    for selected data records from the received data records, generating a query from the one or more values included in the selected data record;

    identifying one or more candidate data records from the received data records using the query;

    determining whether or not the selected data record satisfies a cluster membership criterion for at least one candidate data cluster of one or more existing data clusters containing the candidate records; and

    selecting the matched data cluster from among one or more candidate data clusters based at least in part on a growth criterion for the candidate data clusters, or initializing the matched data cluster with the selected data record if the selected data record does not satisfy a cluster membership criterion for any of the existing data clusters or based on a result of the growth criterion.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×