High-dimensional data clustering with the use of hybrid similarity matrices
First Claim
1. A computer-based method for computation of similarity matrices of objects in a high-dimensional space of attributes with the purpose of clustering, allowing for fusion of different attributes (parameters) on a dimensionless basis, comprising the steps of:
- a) computation of similarity matrices for each of attributes (parameters) individually, such matrices being monomer similarity matrices;
andb) hybridization of all monomer similarity matrices into one hybrid matrix which is further used in clustering process.
5 Assignments
0 Petitions
Accused Products
Abstract
This invention provides a method, apparatus and algorithm for compact description of objects in high-dimensional space of attributes for the purpose of cluster analysis by method of evolutionary transformation of similarity matrices. The proposed method comprises computation of monomeric similarity matrices based on each of parameters that describe a set of objects and the following hybridization of monomeric matrices into a hybrid similarity matrix, which allows for comparison of different attributes on a dimensionless basis. Individual monomeric matrices may be added to a hybrid matrix in any proportion, thus allowing for evaluation of significance of individual parameters. Two types of metrics are proposed for computation of monomeric matrices, depending on quantitative and qualitative nature of attributes used for description of objects under analysis.
28 Citations
8 Claims
-
1. A computer-based method for computation of similarity matrices of objects in a high-dimensional space of attributes with the purpose of clustering, allowing for fusion of different attributes (parameters) on a dimensionless basis, comprising the steps of:
-
a) computation of similarity matrices for each of attributes (parameters) individually, such matrices being monomer similarity matrices; and b) hybridization of all monomer similarity matrices into one hybrid matrix which is further used in clustering process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification