System for identifying patterns in biological data using a distributed network
First Claim
1. A system for identifying patterns in biological data using a support vector machine comprising:
- a server in communication with a distributed network for receiving a training biological data set, a test biological data set, a live biological data set and a financial account identifier from one or more remote sources, the one or more remote sources also in communication with the distributed network;
one or more storage devices in communication with the server for storing the training biological data set and the test biological data set; and
a processor for executing a support vector machine operable and for;
collecting the training biological data set from the one or more storage devices, pre-processing the training biological data set to add meaning to each of a plurality of training biological data points, inputting the pre-processed training biological data set into the support vector machine so as to train the support vector machine, in response to training of the support vector machine, collecting the test biological data set from the one or more storage devices, pre-processing the test biological data set to add meaning to each of a plurality of test biological data points, inputting the test biological data set into the trained support vector machine in order to test the support vector machine, in response to receiving a test output from the trained support vector machine, collecting the live biological data set from the one or more storage devices, inputting the live biological data set into the tested and trained support vector machine in order to process the live biological data, in response to receiving a live output from the support vector machine, post-processing the live output, and transmitting the post-processed live output to the server;
wherein the server is further operable for;
communicating with a financial institution in order to a financial account identified by the financial account identifier, and in response to securing payment, transmitting the post-processed live output to the one or more remote sources or another remote source.
8 Assignments
0 Petitions
Accused Products
Abstract
System for enhancing knowledge discovery from data using a learning machine in general and a support vector machine in particular in a distributed network environment. A customer transmits training data, test data and live data to a vendor'"'"'s server from a remote source, via a distributed network. The training biological data, test biological data and live biological data is stored in a storage device. Training biological data is then pre-processed in order to add meaning thereto. Pre-processing data involves transforming the biological data points and/or expanding the biological data points. Live biological data is pre-processed and input into the trained and tested learning machine. The live output from the learning machine is then post-processed into a computationally derived alphanumerical classifier for interpretation by a human or computer automated process.
209 Citations
16 Claims
-
1. A system for identifying patterns in biological data using a support vector machine comprising:
-
a server in communication with a distributed network for receiving a training biological data set, a test biological data set, a live biological data set and a financial account identifier from one or more remote sources, the one or more remote sources also in communication with the distributed network;
one or more storage devices in communication with the server for storing the training biological data set and the test biological data set; and
a processor for executing a support vector machine operable and for;
collecting the training biological data set from the one or more storage devices, pre-processing the training biological data set to add meaning to each of a plurality of training biological data points, inputting the pre-processed training biological data set into the support vector machine so as to train the support vector machine, in response to training of the support vector machine, collecting the test biological data set from the one or more storage devices, pre-processing the test biological data set to add meaning to each of a plurality of test biological data points, inputting the test biological data set into the trained support vector machine in order to test the support vector machine, in response to receiving a test output from the trained support vector machine, collecting the live biological data set from the one or more storage devices, inputting the live biological data set into the tested and trained support vector machine in order to process the live biological data, in response to receiving a live output from the support vector machine, post-processing the live output, and transmitting the post-processed live output to the server;
wherein the server is further operable for;
communicating with a financial institution in order to a financial account identified by the financial account identifier, and in response to securing payment, transmitting the post-processed live output to the one or more remote sources or another remote source. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
wherein pre-processing the training biological data set to add meaning to each training biological data point comprises;
determining that the training biological data point is dirty; and
in response to determining that the training biological data point is dirty, cleaning the training biological data point.
-
-
3. The system of claim 2, wherein cleaning the training biological data point comprises deleting, repairing or replacing the biological data point.
-
4. The system of claim 1, wherein each training biological data point comprises a vector having one or more original coordinates;
- and
wherein pre-processing the training biological data set to add meaning to each training biological data point comprises adding dimensionality to each training biological data point by adding one or more new coordinates to the vector.
- and
-
5. The system of claim 4, wherein the one or more new coordinates added to the vector are derived by applying a transformation to one or more of the original coordinates.
-
6. The system of claim 4, wherein the transformation is based on expert knowledge.
-
7. The system of claim 4, wherein the transformation is computationally derived.
-
8. The system of claim 4, wherein the training biological data set comprises a continuous variable;
- and
wherein the transformation comprises optimally categorizing the continuous variable of the training biological data set.
- and
-
9. The system of claim 1, wherein the knowledge to be discovered from the biological data relates to a regression or density estimation;
-
wherein the support vector machine produces a training output comprising a continuous variable; and
wherein the processor is further operable for post-processing the training output by optimally categorizing the training output to derive cutoff points in the continuous variable.
-
-
10. The system of claim 1, wherein the processor is further operable for:
-
in response to comparing each of the test outputs with each other, determining that none of the test outputs is the optimal solution;
adjusting the different kernels of one or more of the plurality of support vector machines; and
in response to adjusting the selection of the different kernels, retraining and retesting each of the plurality of support vector machines.
-
-
11. The system of claim 1, wherein the post-processed live output comprises a computationally derived alpha-numerical classifier adapted for human or automated interpretation.
-
12. A system for diagnosing disease using a support vector machine comprising:
-
a server in communication with a distributed network for receiving a training biological data set, a test biological data set, a live biological data set and a financial account identifier from a one or more remote sources, the one or more remote sources also in communication with the distributed network;
one or more storage devices in communication with the server for storing the training biological data set and the test biological data set; and
a processor for executing a support vector machine and for;
collecting the training biological data set from the one or more storage devices, pre-processing the training biological data set to add meaning to each of a plurality of training biological data points, inputting the pre-processed training biological data set into the support vector machine so as to train the support vector machine, in response to training of the support vector machine, collecting the test biological data set from the database one or more storage devices, pre-processing the test biological data set to add meaning to each of a plurality of test biological data points, inputting the test biological data set into the trained support vector machine in order to test the support vector machine, in response to receiving a test output from the trained support vector machine, collecting the live biological data set from the one or more storage devices, inputting the live biological data set into the tested and trained support vector machine in order to process the live biological data, in response to receiving a live output from the support vector machine, post-processing the live output, and transmitting the post-processed live output to the server;
wherein the server is further operable for;
communicating with a financial institution in order to secure payment through a financial account identified by the financial account identifier, and in response to securing payment, transmitting the post-processed live output to the one or more remote sources or another remote source. - View Dependent Claims (13, 14, 15, 16)
-
Specification