Molecular interaction predictors
First Claim
Patent Images
1. A method executed on a processing unit, the method comprising:
- providing an optimized set of weighted contact potentials, the set of optimized weighted contact potentials optimized utilizing one or more machine learning algorithms;
choosing a set of distances according to a structural template relating to a contact between a protein and a peptide, the set of distances defining minimum distances of the contact between amino acids of the protein and an amino acids of a peptide;
determining a score that rates the contact between the protein and the peptide, the score determined by evaluating the protein sequence and peptide sequence data according to the set of distances and the optimized set of weighted contact potentials;
inferring a hidden variable representing information about geometry of a protein-peptide complex to facilitate evaluating the contact between the protein and the peptide, wherein the inferring occurs by defining distances for the set of distances from structural data corresponding to the protein sequence; and
utilizing the hidden variable to influence the protein sequence.
3 Assignments
0 Petitions
Accused Products
Abstract
Adaptive threading models for predicting an interaction between two or more molecules such as proteins are provided. The adaptive threading models have one or more learnable parameters that can be learned from all or some of the available data. The available data can include data relating to known interactions between the two or more molecules, the composition of the molecules and the geometry of the molecular complex.
35 Citations
16 Claims
-
1. A method executed on a processing unit, the method comprising:
-
providing an optimized set of weighted contact potentials, the set of optimized weighted contact potentials optimized utilizing one or more machine learning algorithms; choosing a set of distances according to a structural template relating to a contact between a protein and a peptide, the set of distances defining minimum distances of the contact between amino acids of the protein and an amino acids of a peptide; determining a score that rates the contact between the protein and the peptide, the score determined by evaluating the protein sequence and peptide sequence data according to the set of distances and the optimized set of weighted contact potentials; inferring a hidden variable representing information about geometry of a protein-peptide complex to facilitate evaluating the contact between the protein and the peptide, wherein the inferring occurs by defining distances for the set of distances from structural data corresponding to the protein sequence; and utilizing the hidden variable to influence the protein sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
a processing unit; a memory coupled to the processing unit; components executed on the processing unit including; a prediction component providing an optimized set of weighted contact potentials, the set of optimized weighted contact potentials optimized utilizing one or more machine learning algorithms; the prediction component choosing a set of distances according to a structural template relating to a contact between a protein and a peptide, the set of distances defining minimum distances of the contact between amino acids of the protein and amino acids of the peptide; the prediction component determining a score that rates the contact between the protein and the peptide, the score determined by evaluating protein sequence and peptide sequence data according to the set of distances and the optimized set of weighted contact potentials; an inference component inferring a hidden variable representing information about geometry of a protein-peptide complex to facilitate evaluating the contact between the protein and the peptide, wherein the inferring occurs by defining distances for the set of distances from structural data corresponding to the protein sequence; and the prediction component utilizing the hidden variable to influence the protein sequence. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification