×

Identification and comparison of protein-protein interactions that occur in populations

  • US 6,083,693 A
  • Filed: 06/14/1996
  • Issued: 07/04/2000
  • Est. Priority Date: 06/14/1996
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for identifying, classifying, or quantifying one or more nucleic acids encoding one or more proteins of one or more protein--protein interactions, said method comprising:

  • (a) recombinantly expressing within a population of host cells a first population of first fusion proteins, each said first fusion protein comprising a first protein sequence and a DNA binding domain in which the DNA binding domain is the same in each said first fusion protein, in which said host cells comprise at least one polynucleotide operably linked to a promoter driven by one or more DNA binding sites recognized by said DNA binding domain of said first fusion protein such that an interaction of a first fusion protein with a second fusion protein, said second fusion protein comprising a transcriptional regulatory domain, results in regulation of transcription of said at least one polynucleotide by said transcriptional regulatory domain, and in which said first population of first fusion proteins has a complexity of at least 10;

    (b) negatively selecting to eliminate those host cells expressing said first population of first fusion proteins in which said regulation of transcription of said at least one polynucleotide occurs in the absence of a second fusion protein, said negatively selecting being performed by a method comprising incubating said host cells expressing said first population of first fusion proteins in an environment in which substantial death of said host cells occurs if said regulation of transcription occurs;

    (c) after step (b), recombinantly co-expressing within said population of host cells a second population of second fusion proteins, each said second fusion protein comprising a second protein sequence and a transcriptional regulatory domain of a transcriptional regulator, in which the transcriptional regulatory domain is the same in each said second fusion protein, and in which said second population of second fusion proteins has a complexity of at least 10;

    (d) detecting said regulation of transcription of said at least one polynucleotide, thereby detecting cells in which an interaction between a first protein and a second protein has occurred;

    (e) amplifying DNA fragments encoding said first fusion protein or encoding a portion thereof comprising said first protein sequence from a plurality of host cells in which said regulation of transcription has been detected;

    (f) probing a sample comprising said amplified DNA fragments with one or more recognition means, each recognition means causing recognition of a target nucleotide subsequence or a set of target nucleotide subsequences;

    (g) generating one or more output signals from said sample probed by said recognition means, each output signal being produced from a nucleic acid in said sample by recognition of one or more target nucleotide subsequences in said nucleic acid by said recognition means and comprising a representation of (i) the identities of effective subsequences, each said effective subsequence being a nucleotide subsequence comprising a target nucleotide subsequence, or the identities of sets of effective subsequences, each said set having member effective subsequences each of which comprises a different target nucleotide subsequence from one of said sets of target nucleotide sequences, and (ii) the length between occurrences of effective subsequences in said nucleic acid; and

    (h) searching a nucleotide sequence database to determine sequences that are predicted to produce or the absence of any sequences that are predicted to produce said one or more signals produced by said nucleic acid, said database comprising a plurality of known nucleotide sequences of nucleic acids that may be present in the sample, a sequence from said database being predicted to produce said one or more output signals when the sequence from said database has both (i) the same length between occurrences of effective subsequences as is represented by said one or more output signals, and (ii) the same effective subsequences as are represented by said one or more output signals, or effective subsequences that are members of the same sets of effective subsequences as are represented by said one or more output signals, whereby said one or more nucleic acids in said sample are identified or classified.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×