HIGH THROUGHPUT METHOD FOR DISCOVERY OF GENE CLUSTERS

US 20100016170A1
Filed: 05/14/2007
Published: 01/21/2010
Est. Priority Date: 10/13/2000
Status: Abandoned Application

First Claim

Patent Images

1. A method for detecting genes which act together in a coordinated manner and are clustered together in a genome, said method comprising the steps of:

a) preparing, from isolated genomic DNA, a large-insert library of DNA fragments;

b) determining the DNA sequence of at least part of some of the fragments in the large-insert library to form a plurality of Gene Sequence Tags (GSTs);

c) comparing, under computer control, the DNA sequence of the GST with sequences in a database containing genes, gene fragments, DNA sequences or amino acid sequences known to be part of a cluster of genes that act together in a coordinated manner and that are clustered together on a chromosome to identify a GST that has similar structure to a gene, gene fragment, DNA sequence or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner; and

d) using the GST having similar structure to a gene, gene fragment, DNA sequence or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner to detect a DNA fragment from the large-insert library, which DNA fragment from the large insert library contains the GST and genes which act together in a coordinated manner and are clustered together on a chromosome.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for identifying gene cluster is disclosed. The method may be used for identifying gene clusters involved in the biosynthesis of natural products. A small insert library of DNA fragments of genomic DNA and a large insert library of DNA fragments of genomic DNA are prepared. Fragments in the small insert library are sequenced and compared by homology comparison under computer control to a database containing genes, gene fragments or proteins known to be involved in the biosynthesis of microbial natural products. Fragments having similar structure to genes, gene fragments or proteins known to be involved in the biosynthesis of naturally occurring metabolites are used as probes to screen the large insert library of genomic DNA to detect gene clusters involved in the biosynthesis of microbial natural products.

Citations

25 Claims

1. A method for detecting genes which act together in a coordinated manner and are clustered together in a genome, said method comprising the steps of:
- a) preparing, from isolated genomic DNA, a large-insert library of DNA fragments;
  
  b) determining the DNA sequence of at least part of some of the fragments in the large-insert library to form a plurality of Gene Sequence Tags (GSTs);
  
  c) comparing, under computer control, the DNA sequence of the GST with sequences in a database containing genes, gene fragments, DNA sequences or amino acid sequences known to be part of a cluster of genes that act together in a coordinated manner and that are clustered together on a chromosome to identify a GST that has similar structure to a gene, gene fragment, DNA sequence or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner; and
  
  d) using the GST having similar structure to a gene, gene fragment, DNA sequence or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner to detect a DNA fragment from the large-insert library, which DNA fragment from the large insert library contains the GST and genes which act together in a coordinated manner and are clustered together on a chromosome.

2. A method for detecting genes which act together in a coordinated manner and are clustered together in a genome, said method comprising the steps of:
- a) preparing, from isolated genomic DNA, a small insert library of DNA fragments of the genomic DNA and a large insert library of DNA fragments of the genomic DNA;
  
  b) determining the DNA sequence of at least part of some of the fragments in the small insert library to form a plurality of Gene Sequence Tags (GSTs);
  
  c) comparing, under computer control, the DNA sequence of the GSTs or the amino acid sequence corresponding to the DNA sequence of the GSTs with sequences in a database containing genes, gene fragments, DNA, or amino acid sequences known to be part of a cluster of genes that act together in a coordinated manner and are clustered together on a chromosome to identify a GST that has similar structure to a gene, gene fragment, DNA or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner; and
  
  d) using the GST having similar structure to a gene, gene fragment, DNA or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner to detect a DNA fragment from the large insert library, which DNA fragment from the large insert library contains the GST and genes which act together in a coordinated manner and are clustered together on a chromosome.
- View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
- - 3. The method of claim 2, wherein step d) involves identifying, from the small insert library, the DNA fragment containing the GST having similar structure to a gene, gene fragment, DNA or amino acid sequence known to be part of a cluster of genes that act together in a coordinated manner, and using the DNA fragment of the small insert library or a portion thereof as a hybridization probe to screen the large insert library to detect a large insert DNA fragment containing genes that act together in a coordinated manner.
  - 4. The method according to claim 2, comprising the further step of:
    - e) determining the sequence of the large insert DNA fragment from step d).
  - 5. The method according to claim 2, wherein step b) further comprises the additional step of translating the DNA sequence of the GSTs to generate corresponding amino acid sequences, and wherein in step c) comparing is done on the basis of the amino acid sequence.
  - 6. The method according to claim 2, wherein in step c) the identification of GSTs that have similar structure to genes, gene fragments, DNA or amino acid sequences known to be part of a cluster of genes that act together in a coordinated manner is done by computer assisted homology analysis.
  - 7. The method according to claim 2, wherein the genomic DNA is obtained from a microorganism.
  - 8. The method according to claim 7, wherein the microorganism is a prokaryotic microorganism.
  - 9. The method according to claim 8, wherein the microorganism is of a genus selected from Nocardia, Geodermatophilus, Actinoplanes, Micromonospora, Nocardioides, Saccharothrix, Amycolatopsis, Kutzneria, Saccharomonospora, Saccharopolyspora, Kitasatospora, Streptomyces, Microbispora, Streptosporangium, and Actinomadura.
  - 10. The method according to claim 8, wherein the microorganism is of a genus selected from Stigmatella, Myxococcus and Polyangium.
  - 11. The method according to claim 2, wherein the genomic DNA is drawn from a population of uncultured microorganisms.
  - 12. The method according to claim 2, wherein the genomic DNA is derived from a cultured microorganism.
  - 13. The method according to claim 2, wherein the DNA fragments in the small insert library are between about 1.5 kilobase pairs (kbp) and about 10 kbp.
  - 14. The method according to claim 13, wherein the DNA fragments in the small insert library are between about 1.5 kbp and about 5 kbp.
  - 15. The method according to claim 13, wherein the DNA fragments in the small insert library are between about 1.5 kbp and about 3 kbp.
  - 16. The method according to claim 2, wherein the DNA fragments in the large insert library are between about 10 kbp and about 300 kbp.
  - 17. The method according to claim 16, wherein the DNA fragments in the large insert library are between about 30 kbp to about 50 kbp.
  - 18. The method according to claim 2, wherein the genes which act together in a coordinated manner and are clustered together in a genome are associated with a pathogenicity island.
  - 19. The method according to claim 2, wherein the genes which act together in a coordinated manner and are clustered together in a genome are associated with degradation of a compound.
  - 20. The method according to claim 2, wherein the genes which act together in a coordinated manner and are clustered together in a genome are associated with conferring resistance to a therapeutic drug.

21. A high-throughput method for identifying a gene or gene cluster involved in the biosynthesis of a microbial natural product comprising:
- a) preparing, from isolated genomic DNA, a large insert library of DNA fragments of about 30 kbp to about 300 kbp;
  
  b) determining the DNA sequence of a least part of some of the fragments in the small insert library to form a plurality of Gene Sequence Tags (GSTs);
  
  c) comparing, under computer control, the DNA sequence of the GSTs or the amino acid sequence corresponding to the GSTs with sequences in a database containing genes, gene fragments, DNA sequences or amino acid sequences known to be involved in the biosynthesis of microbial natural products to identify a GST that has a similar structure to a gene, gene fragment, DNA sequence or amino acid sequence known to be involved in the biosynthesis of microbial natural products; and
  
  d) using the GST having similar structure to a gene, gene fragment, DNA or amino acid sequence known to be involved in the biosynthesis of microbial natural products, or portions thereof, to identify a DNA fragment from the large insert library, which DNA fragment contains the GST and a gene or gene cluster involved in the biosynthesis of a microbial natural product.

22. A method for scanning the genome of a microorganism to identify a gene cluster involved in the biosynthesis of a lipopeptide, said method comprising:
- a) providing genomic DNA from a microorganism;
  
  b) preparing a randomly generated small insert library of DNA fragments of about 1.5 kbp to about 10 kbp of the genomic DNA, and a randomly generated large insert library of DNA fragments of the genomic DNA of about 10 kbp to about 300 kbp;
  
  c) sequencing at least part of some of the fragments in the small insert library to form a plurality of Gene Sequence Tags (GSTs) of about 300 base pairs (bp) to about 700 bp, translating the DNA sequences of the GSTs into the corresponding amino acid sequence and providing the amino acid sequence of the GSTs in computer readable form;
  
  d) comparing, under computer control, the amino acid sequences of the GSTs with sequences in a database containing amino acid sequences known to be involved in the biosynthesis of lipopeptides to identify a GST that has a similar structure to an amino acid sequence known to be involved in the biosynthesis of lipopeptides; and
  
  e) using the GST of step d) as a hybridization probe to screen the large insert library of genomic DNA to detect a DNA fragment containing a gene cluster involved in the biosynthesis of a lipopeptide.

23. A method for scanning the genome of a microorganism to identify a gene cluster involved in the biosynthesis of an enediyne, said method comprising:
- a) providing genomic DNA from a microorganism;
  
  b) preparing a randomly generated small insert library of DNA fragments of about 1.5 kbp to about 10 kbp of the genomic DNA, and a randomly generated large insert library of DNA fragments of the genomic DNA of about 10 kbp to about 300 kbp;
  
  c) sequencing at least part of some of the fragments in the small insert library to form a plurality of gene sequence tags (GSTs) of about 300 bp to about 700 bp, translating the DNA sequence of the GSTs into the corresponding amino acid sequence and providing the amino acid sequences of the GSTs in computer readable form;
  
  d) comparing, under computer control, the amino acid sequences of the GSTs with sequences in a database containing amino acid sequences known to be involved in the biosynthesis of enediynes to identify a GST that has a similar structure to an amino acid sequence known to be involved in the biosynthesis of enediynes; and
  
  e) using the GST of step d) as a hybridization probe to screen the large insert library of genomic DNA to detect a DNA fragment containing a gene cluster involved in the biosynthesis of an enediyne.

24. A method for scanning the genome of a microorganism to identify a gene cluster involved in the biosynthesis of an orthosomycin, said method comprising:
- a) providing genomic DNA from a microorganism;
  
  b) preparing a randomly generated small insert library of DNA fragments of about 1.5 kbp to about 10 kbp of the genomic DNA, and a randomly generated large insert library of DNA fragments of the genomic DNA of about 10 kbp to about 300 kbp;
  
  c) sequencing at least part of the fragments in the small insert library to form a plurality of gene sequence tags (GSTs) of about 300 bp to about 700 bp, translating the DNA sequence of the GSTs into the corresponding amino acid sequence and providing the amino acid sequences of the GSTs in computer readable form; and
  
  d) comparing, under computer control, the amino acid sequences of the GSTs with sequences in a database containing amino acid sequences known to be involved in the biosynthesis of orthosomycins to identify a GST that has a similar structure to an amino acid sequences known to be involved in the biosynthesis of orthosomycins; and
  
  e) using the GST of step d) as a hybridization probe to screen the large insert library of genomic DNA to detect a DNA fragment containing a gene cluster involved in the biosynthesis of an orthosomycin.

25. A method for scanning the genome of a microorganism to identify a polyketide synthase gene or a gene cluster including a polyketide synthase gene, said method comprising:
- a) providing genomic DNA from a microorganism;
  
  b) preparing a randomly generated small insert library of DNA fragments of about 1.5 kbp to about 10 kbp of the genomic DNA, and a randomly generated large insert library of DNA fragments of the genomic DNA of about 10 kbp to about 300 kbp;
  
  c) sequencing at least part of some of the fragments in the small insert library to form a plurality of Gene Sequence Tags (GSTs) of about 300 base pairs (bp) to about 700 bp, translating the DNA sequences of the GSTs into the corresponding amino acid sequence and providing the amino acid sequence of the GSTs in computer readable form;
  
  d) comparing, under computer control, the amino acid sequences of the GSTs with sequences in a database containing amino acid sequences known to be associated with a polyketide synthase to identify a GST that has a similar structure to an amino acid sequence known to be associated with a polyketide synthase; and
  
  e) using the GST of step d) as a hybridization probe to screen the large insert library of genomic DNA to detect a DNA fragment containing a polyketide synthase gene or a gene cluster including polyketide synthase gene.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Thallion Pharmaceuticals Inc. (Sun Pharmaceutical Industries Limited)
Original Assignee
Thallion Pharmaceuticals Inc. (Sun Pharmaceutical Industries Limited)
Inventors
Zazopoulos, Emmanuel, Farnet, Chris M., Staffa, Alfredo

Application Number

US11/803,406
Publication Number

US 20100016170A1
Time in Patent Office

Days
Field of Search
US Class Current

506/4
CPC Class Codes

C07K 14/36   from Actinomyces; from Stre...

C07K 14/365   from Actinoplanes (G)

C12N 15/52   Genes encoding for enzymes ...

C12P 19/62   the hetero ring having eigh...

HIGH THROUGHPUT METHOD FOR DISCOVERY OF GENE CLUSTERS

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

25 Claims

Specification

Solutions

Use Cases

Quick Links

HIGH THROUGHPUT METHOD FOR DISCOVERY OF GENE CLUSTERS

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

25 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links