Method and system for data mining in high dimensional data spaces
First Claim
1. A computerized data mining method performed by a processor that analyzes a multitude of items in an n-dimensional space Dn, each described by n item features, said method using a mining function f with at least one control parameter Pi controlling a target feature of the data mining function, said method comprising:
- a first step of selecting a transformation function T to reduce dimensions of said n-dimensional space by space-filling curves mapping said n-dimensional space to a in-dimensional space;
a second step of determining a transformed control parameter PT i controlling the target feature of the data mining function in said m-dimensional space, wherein the m-dimensional space comprises fewer dimensions that the n-dimensional space and wherein the transformation function T ensures that all information within the n-dimensional space is mapped onto and maintained in the m-dimensional data space;
a third step of applying said selected transformation function T on said multitude Dn of items to create a transformed multitude Dm of items and executing said mining function f controlled by said transformed control parameter PT i on said transformed multitude of items Dm ; and
a fourth step of storing a result of the third step in memory.
2 Assignments
0 Petitions
Accused Products
Abstract
A computerized method and system for analyzing a multitude of items in a high dimensional (n-dimensional) data space Dn each described by n item features. The method uses a mining function f with at least one control parameter Pi controlling the target of the data mining function. The method selects a transformation function T for reducing dimensions of the n-dimensional space by space-filling curves mapping said n-dimensional space to a m-dimensional space (m<n). The method determines a transformed control parameter PT i controlling the target of the data mining function in the m-dimensional space. The method applies the selected transformation function T on the multitude Dn of items to create a transformed multitude Dm of items, executes the mining function f controlled by the transformed control parameter PT i on the transformed multitude of items Dm, and stores the result.
18 Citations
11 Claims
-
1. A computerized data mining method performed by a processor that analyzes a multitude of items in an n-dimensional space Dn, each described by n item features, said method using a mining function f with at least one control parameter Pi controlling a target feature of the data mining function, said method comprising:
-
a first step of selecting a transformation function T to reduce dimensions of said n-dimensional space by space-filling curves mapping said n-dimensional space to a in-dimensional space; a second step of determining a transformed control parameter PT i controlling the target feature of the data mining function in said m-dimensional space, wherein the m-dimensional space comprises fewer dimensions that the n-dimensional space and wherein the transformation function T ensures that all information within the n-dimensional space is mapped onto and maintained in the m-dimensional data space; a third step of applying said selected transformation function T on said multitude Dn of items to create a transformed multitude Dm of items and executing said mining function f controlled by said transformed control parameter PT i on said transformed multitude of items Dm ; and a fourth step of storing a result of the third step in memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
Specification