Method and apparatus for identification and optimization of bioactive compounds using a neural network
First Claim
1. A computerized neural network system for predicting the chemical activity of at least one molecule of interest comprising:
- a) an input layer consisting of at least one neuron where input data is sent as a vector value;
b) a weight matrix where every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
c) a hidden layer consisting of at least one neuron such that when said input vector is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
d) an output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function;
e) a transfer function that is non-linear in form and is capable of taking any value generated by said output layer and returning a number between −
1 and 1 or another predetermined range;
f) a training process for said neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer; and
g) a test process in which a trained neural network is used to predict a free energy of binding for said at least one molecule of interest;
wherein the physicochemical descriptor of said at least one molecule of interest is the quantum mechanical electrostatic potential of said at least one molecule of interest at the van der Waels surface of said at least one molecule of interest and, wherein said test process Includes the use of at least one adjuster molecule such that after said training process said neural network is used to predict a free energy of binding for said at least one adjuster molecule, said at least one adjuster molecule having a known free energy of binding and having been excluded from the set of molecules comprising the set of said of at least one known training molecule.
0 Assignments
0 Petitions
Accused Products
Abstract
A computational method for the discovery and design of therapeutically valuable bioactive compounds is presented. The method employed has successfully analyzed enzymatic inhibitors for their chemical properties through the use of a neural network and associated algorithms. This method is an improvement over the current methods of drug discovery which often employs a random search through a large library of synthesized chemical compounds or biological samples for bioactivity related to a specific therapeutic use. This time-consuming process is the most expensive portion of current drug discovery methods. The development of computational methods for the prediction of specific molecular activity will facilitate the design of novel chemotherapeutics or other chemically useful compounds. The novel neural network provided in the current invention is “trained” with the bioactivity of known compounds and then used to predict the bioactivity of unknown compounds.
-
Citations
59 Claims
-
1. A computerized neural network system for predicting the chemical activity of at least one molecule of interest comprising:
-
a) an input layer consisting of at least one neuron where input data is sent as a vector value;
b) a weight matrix where every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
c) a hidden layer consisting of at least one neuron such that when said input vector is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
d) an output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function;
e) a transfer function that is non-linear in form and is capable of taking any value generated by said output layer and returning a number between −
1 and 1 or another predetermined range;
f) a training process for said neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer; and
g) a test process in which a trained neural network is used to predict a free energy of binding for said at least one molecule of interest;
wherein the physicochemical descriptor of said at least one molecule of interest is the quantum mechanical electrostatic potential of said at least one molecule of interest at the van der Waels surface of said at least one molecule of interest and, wherein said test process Includes the use of at least one adjuster molecule such that after said training process said neural network is used to predict a free energy of binding for said at least one adjuster molecule, said at least one adjuster molecule having a known free energy of binding and having been excluded from the set of molecules comprising the set of said of at least one known training molecule.- View Dependent Claims (2)
-
-
3. A computerized double neural network system for predicting the chemical activity of at least one molecule of interest comprising:
-
a) an outer neural network further comprising;
i) an outer network input layer consisting of at least one neuron where input data is sent as a vector value;
ii) an outer network weight matrix where every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
iii) an outer network hidden layer consisting of at least one neuron such that when said input vector Is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
iv) an outer network output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function;
v) an outer network transfer function that is non-linear in form and is capable of taking any value generated by said output layer and returning a number between −
1 and 1 or another predetermined range;
vi) a training process for said outer neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer;
b) an inner neural network capable of receiving data from said outer neural network further comprising;
i) an inner network weight matrix where every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
ii) an inner network hidden layer consisting of at least one neuron such that when said input vector is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
iii) an inner network output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function said inner network output layer having an output value;
iv) an inner network transfer function that is non-linear in form and is capable of taking said output value generated by said output layer and returning a number between −
1 and 1 or another predetermined range;
v) a training process for said inner neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer vi) a test process in which a trained neural network is used to predict a free energy of binding for said at least one molecule of interest;
wherein said inner neural network is integrated to function with the data generated from said outer neural network such that the rules for said free energy of binding learned by said outer neural network are utilized by said inner neural network to model a quantum object such that said double neural network is used to predict the chemical characteristics of said quantum object, said quantum object describing a molecule with improved chemical properties of binding relative to said at least one molecule of interest;
wherein said outer network output layer is the input layer of said inner neural network;
wherein said outer network hidden layer includes an error term, said error term being used to calculate the correction terms for said outer network input layer such that the weights and biases of said double neural network are optimized; and
wherein the physicochemical descriptor of said at least one molecule of interest is the quantum mechanical electrostatic potential of said at least one molecule of interest at the van der Waals surface of said at least one molecule of interest. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
a) a pharmaceutical;
b) an enzyme;
c) a catalyst;
d) a polypeptide;
e) an amino acid derivative;
f) a carbohydrate;
g) a nucleotide;
h) a macromolecular compound;
i) an organic moiety of an alkyl, cycloalkyl, aryl, aralkyl or alkaryl group or a substituted or heterocyclic derivative thereof;
j) an industrial compound; and
k) a polymer.
-
-
20. The double neural network of claim 19, wherein said at least one molecule of interest is an enzyme.
-
21. The method of claim 20, wherein said at least one molecule of interest is selected from the group consisting of:
-
a) a pharmaceutical;
b) an enzyme;
c) a catalyst;
d) a polypeptide;
e) an amino acid derivative;
f) a carbohydrate;
g) a nucleotide;
h) a macromolecular compound;
i) an organic moiety of an alkyl, cycloalkyl, aryl, aralkyl or alkaryl group or a substituted or heterocyclic derivative thereof; and
j) an industrial compound; and
k) a polymer.
-
-
22. The double neural network of claim 3, wherein said output value is decreased by at least 1Δ
- G/RT.
-
23. The double neural network of claim 3, wherein said output value is decreased by 3Δ
- G/RT.
-
24. A computer implemented method for predicting the chemical activity of at least one molecule of interest by using a neural network comprising:
-
a) inputting data into an input layer consisting of at least one neuron where input data is sent as a vector value;
b) developing a weight matrix wherein every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
c) providing a hidden layer consisting of at least one neuron such that when said input vector is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
d) constructing an output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function;
e) utilizing a transfer function that is non-linear in form and is capable of taking any value generated by said output layer and returning a number between −
1 and 1;
f) employing a training process for said neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer; and
employing a test process in which a trained neural network is used to predict a free energy of binding for said at least one molecule of interest wherein the physicochemical descriptor of said at least one molecule of interest is the quantum mechanical electrostatic potential of said at least one molecule of interest at the van der Waals surface of said at least one molecule of interest, wherein said test process includes the use of at least one adjuster molecule such that after said training process said neural network is used to predict a free energy of binding for said at least one adjuster molecule, said at least one adjuster molecule having a known free energy of binding and having been excluded from the set of molecules comprising the set of said of at least one known training molecule.- View Dependent Claims (25)
-
-
26. A computer implemented method for predicting the chemical activity of at least one molecule of interest by using a double neural network comprising:
-
a) utilizing an outer neural network further comprising;
i) an outer network input layer consisting of at least one neuron where input data is sent as a vector value;
ii) an outer network weight matrix where every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
iii) an outer network hidden layer consisting of at least one neuron such that when said input vector is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
iv) an outer network output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function;
v) an outer network transfer function that is non-linear in form and is capable of b) taking any value generated by said output layer and returning a number between −
1 and 1;
i) an outer network training process for said neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer;
c) providing an inner neural network capable of receiving data from said outer neural network further comprising;
i) an inner network weight matrix where every entry in the form of an input vector is multiplied by a set weight and then sent to at least one hidden layer neuron;
ii) an inner network hidden layer consisting of at least one neuron such that when said input vector is multiplied by a set weight said hidden layer contains said weight matrix, said weight matrix having the dimensions n by m where n is the length of an input vector and m is the number of hidden layer neurons available;
iii) an inner network output layer consisting of at least one neuron where weight matrix data is sent before it is input into a transfer function said inner network output layer having an output value;
iv) an inner network transfer function that is non-linear in form and is capable of taking any value generated by said output layer and returning a number between −
1 and 1;
v) an inner network training process for said neural network such that said neural network can accurately approximate a free energy of binding of at least one known training molecule with an output from said output layer;
vi) a test process in which a trained neural network is used to predict a free energy of binding for said at least one molecule of interest;
d) integrating said inner neural network to function with the data generated from said outer neural network such that the rules for said free energy of binding learned by said outer neural network are utilized by said inner neural network to model a quantum object such that said double neural network is used to predict the chemical characteristics of said quantum object, said quantum object describing a molecule with improved chemical properties of binding relative to said at least one molecule of interest;
e) constructing said outer network input layer such that said output layer of said outer neural network is the input layer of said inner neural network; and
f) providing said outer network hidden layer with an error term, said error term being used to calculate the correction terms for said outer network input layer such that the weights and biases of said double neural network are optimized;
wherein the physicochemical descriptor of said at least one molecule of interest is the quantum mechanical electrostatic potential of said at least one molecule of interest at the van der Waals surface of said at least one molecule of interest.- View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
a) a pharmaceutical;
b) an enzyme;
c) a catalyst;
d) a polypeptide;
e) an amino acid derivative;
f) a carbohydrate;
g) a nucleotide;
h) a macromolecular compound;
i) an organic moiety of an alkyl, cycloalkyl, aryl, aralkyl or alkaryl group or a substituted or heterocyclic derivative thereof;
j) an industrial compound; and
k) a polymer.
-
-
42. The double neural network of claim 26, wherein said output value is decreased by at least 1Δ
- G/RT.
-
43. The double neural network of claim 26, wherein said output value is decreased by 3Δ
- G/RT.
-
44. A computerized neural network system comprising a neural network having a first component trained to recognize binding energy for a first set of molecular descriptors based on geometric and/or electrostatic information and for a given binding energy returning a second set of the molecular descriptors through a second component of the network.
- 45. A computerized double neural network system comprising a trained neural network for predicting binding potency for a chemotherapeutic agent with a target molecule, the network having an input layer, and the network being coupled to an output layer of an outer neural network comprising one or more layers so that the output of the output layer of the outer neural network is the input to the input layer of the inner neural network.
-
47. A computer implemented method comprising providing a neural network having a first component trained to recognize binding energy for a first set of molecular descriptors based on geometric and/or electrostatic information and for a given binding energy returning a second set of the molecular descriptors through a second component of the network.
- 48. A computer implemented method comprising providing a trained neural network for predicting binding potency for a chemotherapeutic agent with a target molecule, the network having an input layer, coupling the network to an output layer of an outer neural network comprising one or more layers so that the output of the output layer of the outer neural network is the input to the input layer of the inner neural network.
-
49. A computer implemented method comprising providing a trained neural network for predicting binding potency for a chemotherapeutic agent with a target molecule, the network having an input layer coupled to an output layer of an outer neural network comprising one or more layers so that the output of the output layer of the outer neural network is the input to the input layer of the inner neural network, and inputting molecular descriptors based on geometric and/or electrostatic information into the input layer from the coupled outer layer.
-
51. A computer implemented method of customizing the binding features of a molecule of interest comprising:
-
providing a neural network comprising a first component trained to recognize binding energy for first set of molecular descriptors based on electrostatic and/or geometrical information, and for a given binding energy returning a second set of the molecular descriptors through a second component of the network;
selecting a molecule of interest and modifying it so that the resulting molecule has a set of molecular descriptors that more closely matches descriptors in the second set of descriptors returned by the second component of the network.
-
-
52. A computer implemented method of determining a set of molecular descriptors:
-
providing a neural network comprising an inner network trained to predict binding energy of a molecule of interest with a target molecule using a set of molecular descriptors based on geometric and/or electrostatic information for the molecule of interest the inner network having an input layer coupled to the output layer of an outer neural network for inputting molecular descriptors, in the inner neural network, setting the binding energy for an unknown molecule of interest to a desired level; and
determining a set of molecular descriptors for an unknown molecule of interest by computing through the network a set of molecular descriptors that if output from the output layer of the outer neural network would yield a binding energy within a desired range of a predetermined binding energy set for the inner neural network. - View Dependent Claims (53, 54, 55, 56, 57, 58, 59)
-
Specification