Data mining platform for bioinformatics and other knowledge discovery
First Claim
1. A computer-implemented data mining platform for generating an output comprising knowledge from analysis of a plurality of data sets, wherein the data sets include heterogeneous data types or the data sets come from heterogeneous data sources, the platform comprising:
- a plurality of modules, each module adapted for processing one data type of the plurality of heterogeneous data types, each module comprising an input data source, a data analysis engine, a data output and a server connection for each of the input data source, the data analysis engine and the data output, wherein the data analysis engine comprises at least one processor for executing one or more support vector machines for generating a plurality of classes of data and at least one margin between classes, and one or more feature subset ranking algorithms;
a server connected to the server connection for communicating with each of the input data source, the data analysis engine and the data output and for providing means for monitoring one or more of the input data source, the data analysis engine, and the data output; and
a combined data analysis engine in communication with the server for combining the data output from the plurality of modules to generate a single output representing results obtained from analyzing the plurality of heterogeneous data types.
5 Assignments
0 Petitions
Accused Products
Abstract
The data mining platform comprises a plurality of system modules (500, 550), each formed from a plurality of components. Each module has an input data component (502, 552), a data analysis engine (504, 554) for processing the input data, an output data component (506, 556) for outputting the results of the data analysis, and a web server (510) to access and monitor the other modules within the unit and to provide communication to other units. Each module processes a different type of data, for example, a first module processes microarray (gene expression) data while a second module processes biomedical literature on the Internet for information supporting relationships between genes and diseases and gene functionality
177 Citations
23 Claims
-
1. A computer-implemented data mining platform for generating an output comprising knowledge from analysis of a plurality of data sets, wherein the data sets include heterogeneous data types or the data sets come from heterogeneous data sources, the platform comprising:
-
a plurality of modules, each module adapted for processing one data type of the plurality of heterogeneous data types, each module comprising an input data source, a data analysis engine, a data output and a server connection for each of the input data source, the data analysis engine and the data output, wherein the data analysis engine comprises at least one processor for executing one or more support vector machines for generating a plurality of classes of data and at least one margin between classes, and one or more feature subset ranking algorithms;
a server connected to the server connection for communicating with each of the input data source, the data analysis engine and the data output and for providing means for monitoring one or more of the input data source, the data analysis engine, and the data output; and
a combined data analysis engine in communication with the server for combining the data output from the plurality of modules to generate a single output representing results obtained from analyzing the plurality of heterogeneous data types. - View Dependent Claims (3, 5, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
2. (canceled)
-
4. (canceled)
-
9. (canceled)
Specification