×

CROSS-DOMAIN CLUSTERABILITY EVALUATION FOR CROSS-GUIDED DATA CLUSTERING BASED ON ALIGNMENT BETWEEN DATA DOMAINS

  • US 20110167064A1
  • Filed: 01/06/2010
  • Published: 07/07/2011
  • Est. Priority Date: 01/06/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for evaluating cross-domain clusterability upon a target domain and a source domain, said method comprising:

  • a processor of a computer system receiving the source domain and the target domain, wherein the source domain comprises at least one source data item and the target domain comprises at least one target data item;

    said processor calculating target clusterability as an average of a respective clusterability of said at least one target data item such that the target clusterability quantifies how clusterable the target domain is, wherein the respective clusterability of a target data item of said at least one target data item quantifies how unambiguously the target data item can be assigned to a respective true target centroid associated with the target data item;

    said processor calculating target-side matchability as an average of a respective matchability of each target centroid of the target domain to source centroids of the source domain such that the target-side matchability quantifies how well target centroids of the target domain are aligned with the source centroids;

    said processor calculating source-side matchability as an average of a respective matchability of each source centroid of said source centroids to the target centroids such that the source-side matchability quantifies how well the source centroids are aligned with the target centroids;

    said processor calculating source-target pair matchability as an average of the target-side matchability and the source-side matchability;

    said processor calculating cross-domain clusterability between the target domain and the source domain as a linear combination of the calculated target clusterability and the calculated source-target pair matchability by use of a trade-off parameter that indicates relative contribution of the target clusterability and the source-target pair matchability to the cross-domain clusterability; and

    said processor transferring the calculated cross-domain clusterability to a device selected from an output device of the computer system, a storage device of the computer system, a remote computer system coupled to the computer system, and a combination thereof.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×