Methods of constructing and screening libraries of peptide structures
First Claim
1. A method for producing a library of independent protein secondary structures, said method consisting essentially of the following sequential steps:
- (i) executing a computer program to thereby obtain a plurality of amino acid sequences of different proteins from bioinformatic data source(s) wherein the sequences are predicted to form secondary structures and/or assemblies of secondary structures in their native contexts;
(ii) executing a computer program to simultaneously select from (i) a plurality of amino acid sequences of the different proteins, wherein the simultaneously-selected sequences each consist of a single segment of a protein corresponding to a hydrophobic folding unit is predicted to form a secondary structures independent of any other part of the protein from which it is derived when isolated from those other parts of the protein;
(iii) chemically-synthesizing a plurality of peptides that each consist essentially of a sequence selected at (ii) that is predicted to form a secondary structure independent of any other part of the protein from which it is derived when isolated from other parts of the protein; and
(iv) displaying a plurality of chemically-synthesized peptides that each consists essentially of a sequence selected at (iii) that is predicted to independently-form a secondary structure such that the displayed peptides form secondary structures independent of other parts of the proteins from which they are derived, thereby producing a peptide library of independent protein secondary structures.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides the means for producing libraries of peptide structures for drug screening applications that are capable of folding or assuming their native conformations independently of artificial scaffolds or flanking sequences in the proteins from which they are derived. The libraries can be highly diverse such that they are representative of the repertoire of protein structures existing in nature. The libraries can also be non-redundant or normalized such that the bias towards specific structures existing in source data sets and/or in nature is/are removed. In a particularly preferred embodiment, the present invention provides 30,000 independent fold structures produced by this method. The present invention also provides computer-readable media and systems comprising structural data in relation to the peptide libraries, and methods for displaying and screening the libraries.
54 Citations
33 Claims
-
1. A method for producing a library of independent protein secondary structures, said method consisting essentially of the following sequential steps:
-
(i) executing a computer program to thereby obtain a plurality of amino acid sequences of different proteins from bioinformatic data source(s) wherein the sequences are predicted to form secondary structures and/or assemblies of secondary structures in their native contexts; (ii) executing a computer program to simultaneously select from (i) a plurality of amino acid sequences of the different proteins, wherein the simultaneously-selected sequences each consist of a single segment of a protein corresponding to a hydrophobic folding unit is predicted to form a secondary structures independent of any other part of the protein from which it is derived when isolated from those other parts of the protein; (iii) chemically-synthesizing a plurality of peptides that each consist essentially of a sequence selected at (ii) that is predicted to form a secondary structure independent of any other part of the protein from which it is derived when isolated from other parts of the protein; and (iv) displaying a plurality of chemically-synthesized peptides that each consists essentially of a sequence selected at (iii) that is predicted to independently-form a secondary structure such that the displayed peptides form secondary structures independent of other parts of the proteins from which they are derived, thereby producing a peptide library of independent protein secondary structures. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A method for producing a library of independent protein secondary structures having low structural redundancy, said method consisting essentially of the following sequential steps:
-
(i) executing a computer program to thereby obtain a plurality of amino acid sequences of different proteins from bioinformatic data source(s) wherein the sequences are predicted to form secondary structures and/or assemblies of secondary structures in their native contexts; (ii) executing a computer program to simultaneously select from (i) a plurality of amino acid sequences of the different proteins, wherein the simultaneously-selected sequences each consist of a single segment of a protein corresponding to a hydrophobic folding unit is predicted to form a secondary structures independent of any other part of the protein from which it is derived when isolated from those other parts of the protein; (iii) executing a computer program to thereby identify sequences in the plurality at (ii) that are predicted to form redundant secondary structures and removing or deleting sufficient sequences predicted to form the redundant secondary structures to thereby select a plurality of amino acid sequences predicted to form non-redundant secondary structures independent of any other part of the proteins from which they are derived when isolated from those other parts of the proteins, wherein no more than five of said sequences are predicted to form the same secondary structure; and (iv) chemically-synthesizing a plurality of peptides that each consist essentially of a sequence selected at (iii) that is predicted to form a non-redundant secondary structure independent of any other part of the protein from which it is derived when isolated from those other parts of the protein; and (v) displaying the plurality of peptides chemically-synthesized at (iv) such that the displayed peptides form secondary structures independent of other parts of the proteins, thereby producing a library of independent protein secondary structures having low structural redundancy. - View Dependent Claims (28, 29, 30)
-
-
31. A method for producing a library of independent protein secondary structures having low structural redundancy, said method consisting essentially of the following sequential steps:
-
(i) executing a computer program to thereby obtain a plurality of amino acid sequences of different proteins from bioinformatic data source(s) wherein the sequences are predicted to form secondary structures and/or assemblies of secondary structures in their native contexts; (ii) executing a computer program to simultaneously select from (i) a plurality of amino acid sequences of the different proteins, wherein the simultaneously-selected sequences each consist of a single segment of a protein corresponding to a hydrophobic folding unit is predicted to form a secondary structures independent of any other part of the protein from which it is derived when isolated from those other parts of the protein; (iii) executing a computer program to identify sequences in the plurality at (ii) which are redundant and removing or deleting the redundant sequences to thereby select for a non-redundant plurality of sequences; (iv) chemically-synthesizing a plurality of peptides that each consist essentially of a sequence selected at (iii) which is non-redundant and that is predicted to form a secondary structure independent of any other part of the protein from which it is derived when isolated from those other parts of the protein; and (v) displaying the plurality of peptides chemically-synthesized at (iv) such that the displayed peptides form secondary structures independent of their native contexts, thereby producing a library of independent protein secondary structures having low structural redundancy.
-
-
32. A method for producing a library of independent protein secondary structures, said method comprising:
-
(i) executing a computer program to thereby identify a plurality of amino acid sequences of different proteins from bioinformatic data source(s) wherein the sequences are capable of folding independently from other parts of the proteins in which they are contained in their native contexts; (ii) executing a computer program to simultaneously select from (i) a plurality of amino acid sequences of different proteins that arc each consist of a single segment of a protein corresponding to a hydrophobic folding unit capable of folding independently from other parts of the proteins from which they are derived when isolated from those other parts of the proteins; (iii) size-selecting those sequences selected at (ii) to thereby select a sub-set of sequences having the average length of an independent protein fold; (iv) executing a computer program to identify redundant sequences in the sub-set of sequences selected at (iii) and removing or deleting redundant sequences to thereby select for a non-redundant plurality of amino acid sequences; (v) chemically-synthesizing a plurality of peptides that each consist essentially of a sequence selected at (iv) which is non-redundant and capable of folding independently of any other part of the protein from which it is derived when isolated from other parts of the protein; and
;(vi) displaying the plurality of peptides chemically-synthesized at (v) such that the displayed peptides form secondary structures and/or folds independent of other parts of the proteins, thereby producing a library of independent protein secondary structures.
-
-
33. A method for producing a library of independent protein secondary structures, said method comprising:
-
(i) executing a computer program to thereby identify a plurality of amino acid sequences of different proteins from bioinformatic data source(s) wherein the sequences are capable of folding independently from other parts of the proteins in which they are contained in their native contexts; (ii) executing a computer program to simultaneously select from (i) a plurality of amino acid sequences of the different proteins that each consist of a single segment of a protein corresponding to a hydrophobic folding unit and are each capable of folding independently from other parts of the proteins from which they are derived when isolated from those other parts of the proteins; (iii) size-selecting those sequences at (ii) to thereby select a sub-set of sequences having the average length of an independent protein fold; (iv) executing a computer program to identify redundant sequences in the sub-set of sequences selected at (iii) and removing or deleting redundant sequences to thereby select for a non-redundant plurality of amino acid sequences; (v) identifying related sequences to the non-redundant plurality of amino acid sequences at (iv) and adding those related sequences to the non-redundant plurality of amino acid sequences to thereby produce a diverse pool of amino acid sequences; (vi) chemically-synthesizing a plurality of peptides that each consist essentially of a sequence selected from the diverse pool of amino acid sequences produced at (v); and (vii) displaying the plurality of peptides chemically-synthesized at (vi) such that the displayed peptides form secondary structures and/or folds independent of any other parts of the protein from which they are derived when isolated from those other parts of the proteins their native contexts, thereby producing a library of independent protein secondary structures.
-
Specification