×

DATA CLUSTERING BASED ON VARIANT TOKEN NETWORKS

  • US 20130124524A1
  • Filed: 11/15/2012
  • Published: 05/16/2013
  • Est. Priority Date: 11/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method, including:

  • receiving data records, the received data records each including one or more values in one or more fields; and

    processing the received data records to identify one or more data clusters, the processing including;

    identifying tokens that each include at least one value or fragment of a value in a field or a combination of fields;

    generating a network representing the identified tokens, with nodes of the network representing tokens and edges of the network each representing a variant relationship between tokens; and

    generating a graphical representation of the network with different subsets of nodes distinguished based at least in part on values associated with nodes, where a value associated with a particular node quantifies a count of a number of instances of the token represented by that particular node appearing within the received data records.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×