Internal linking co-convergence using clustering with no hierarchy
First Claim
1. A computer-implemented method comprising:
- clustering database records into a first set of clusters having corresponding first cluster identifications (IDs), each database record comprising one or more field values, wherein the clustering is based at least in part on determining similarity among corresponding field values of the database records;
associating mutually matching database records, wherein the associating comprises performing at least one matching iteration for each of the database records, wherein the matching iteration is based at least in part on the first cluster IDs;
determining similarity among corresponding field values of the database records;
re-clustering at least a portion of the database records into a second set of clusters having corresponding second cluster IDs, the re-clustering based at least in part on the associating mutually matching database records and on the determining similarity among corresponding field values of the database records; and
outputting database record information, based at least in part on the re-clustering.
1 Assignment
0 Petitions
Accused Products
Abstract
Certain implementations of the disclosed technology include systems and methods for linking entities in an internal database by utilizing co-convergence and clustering. The method may include clustering database records into a first set of clusters having corresponding first cluster identifications (IDs). The clustering may be based at least in part on determining similarity among corresponding field values. The method may include associating mutually matching database records, by performing at least one matching iteration for each of the database records. The method may include determining similarity among corresponding field values of the database records, re-clustering at least a portion of the database records into a second set of clusters, the re-clustering based at least in part on the associating mutually matching database records and on the determining similarity among corresponding field values of the database records.
159 Citations
24 Claims
-
1. A computer-implemented method comprising:
-
clustering database records into a first set of clusters having corresponding first cluster identifications (IDs), each database record comprising one or more field values, wherein the clustering is based at least in part on determining similarity among corresponding field values of the database records; associating mutually matching database records, wherein the associating comprises performing at least one matching iteration for each of the database records, wherein the matching iteration is based at least in part on the first cluster IDs; determining similarity among corresponding field values of the database records; re-clustering at least a portion of the database records into a second set of clusters having corresponding second cluster IDs, the re-clustering based at least in part on the associating mutually matching database records and on the determining similarity among corresponding field values of the database records; and outputting database record information, based at least in part on the re-clustering. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
at least one memory for storing data and computer-executable instructions; and at least one processor configured to access the at least one memory and further configured to execute the computer-executable instructions for; clustering database records into a first set of clusters having corresponding first cluster identifications (IDs), each database record comprising one or more field values, wherein clustering is based at least in part on a determined similarity among corresponding field values of the database records; associating mutually matching database records by performing at least one matching iteration for each of the database records, wherein the matching iteration is based at least in part on the first cluster IDs; determining similarity among corresponding field values of the database records; re-clustering at least a portion of the database records into a second set of clusters having corresponding second cluster IDs, wherein the re-clustering is based at least in part on the associated mutually matching database records and on the determining the similarity among corresponding field values of the database records; and outputting database record information, based at least in part on the re-clustering. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable media comprising computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform a method comprising:
-
clustering database records into a first set of clusters having corresponding first cluster identifications (IDs), each database record comprising one or more field values, wherein the clustering is based at least in part on determining similarity among corresponding field values of the database records; associating mutually matching database records, wherein the associating comprises performing at least one matching iteration for each of the database records, wherein the matching iteration is based at least in part on the first cluster IDs; determining similarity among corresponding field values of the database records; re-clustering at least a portion of the database records into a second set of clusters having corresponding second cluster IDs, the re-clustering based at least in part on the associating mutually matching database records and on the determining similarity among corresponding field values of the database records; and outputting database record information, based at least in part on the re-clustering. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification