GENOMIC CLASSIFICATION OF MALIGNANT MELANOMA BASED ON PATTERNS OF GENE COPY NUMBER ALTERATIONS

US 20100145897A1
Filed: 10/28/2009
Published: 06/10/2010
Est. Priority Date: 10/31/2008
Status: Active Grant

First Claim

Patent Images

1. A method for obtaining a database of malignant melanoma genomic subgroups, the method comprising the steps of:

(a) obtaining a plurality of m samples comprising at least one MM cell;

(b) acquiring a data set comprising copy number alteration information from at least one locus from each chromosome from each sample obtained in step (a);

(c) identifying in the data set samples contaminated by normal cells and eliminating the contaminated samples from the data set, wherein the identifying and eliminating comprises;

(1) applying a machine learning algorithm tuned to parameters that represent the differences between tumor and normal samples to the data;

(2) assigning a probability score for normal cell contamination to each sample as determined by the machine learning algorithm;

(3) eliminating data from the data set for each sample scoring 50% or greater probability of containing normal cells;

(d) estimating a number of subgroups, r, in the data set by applying an unsupervised clustering algorithm using Pearson linear dissimilarity algorithm to the data set;

(e) assigning each sample in the data set to at least one cluster using a modified genomic Non-negative Matrix Factorization (gNMF) algorithm, wherein the modified gNMF algorithm comprises;

(1) calculating divergence of the algorithm after every 100 steps of multiplicative updating using the formula;

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention is directed to methods and kits that allow for classification of malignant melanoma cells according to genomic profiles, and methods of diagnosing, predicting clinical outcomes, and stratifying patient populations for clinical testing and treatment using the same.

Citations

23 Claims

1. A method for obtaining a database of malignant melanoma genomic subgroups, the method comprising the steps of:
- (a) obtaining a plurality of m samples comprising at least one MM cell;
  
  (b) acquiring a data set comprising copy number alteration information from at least one locus from each chromosome from each sample obtained in step (a);
  
  (c) identifying in the data set samples contaminated by normal cells and eliminating the contaminated samples from the data set, wherein the identifying and eliminating comprises;
  
  (1) applying a machine learning algorithm tuned to parameters that represent the differences between tumor and normal samples to the data;
  
  (2) assigning a probability score for normal cell contamination to each sample as determined by the machine learning algorithm;
  
  (3) eliminating data from the data set for each sample scoring 50% or greater probability of containing normal cells;
  
  (d) estimating a number of subgroups, r, in the data set by applying an unsupervised clustering algorithm using Pearson linear dissimilarity algorithm to the data set;
  
  (e) assigning each sample in the data set to at least one cluster using a modified genomic Non-negative Matrix Factorization (gNMF) algorithm, wherein the modified gNMF algorithm comprises;
  
  (1) calculating divergence of the algorithm after every 100 steps of multiplicative updating using the formula;
- View Dependent Claims (3, 4, 5, 6, 7, 8)
- - 3. The method of claim 1 or 2, wherein the unsupervised clustering algorithm is a hierarchical clustering.
  - 4. The method of claim 1 or 2, wherein Cophenetic correlation is used to provide a final number of clusters from the data set.
  - 5. The method of claim 1 or 2, wherein Bayesian Information Criterion is used to provide a final number of clusters from the data set.
  - 6. The method of claim 1 or 2, wherein Cophenetic correlation and Bayesian Information Criterion are used to provide a final number of clusters from the data set.
  - 7. The method of claim 1 or 2, wherein the plurality of samples, m, comprises a first, second, third, fourth, fifth and sixth cell lines, whereinthe first cell line is selected from the group consisting of SKMEL119, HS944, WM1366, and WM88;
    - the second cell line is WM3248;
      
      the third cell line is 1205LU;
      
      the fourth cell line is selected from the group consisting of 451LU, SKMEL19, SKMEL28, SKMEL30, SKMEL63, WM35, WM983, and WM983C;
      
      the fifth cell line is selected from the group consisting of WM3211, M14, MEWO, SKMEL2, SKMEL5, UACC257, UACC62, WM122, WM13662, WM239A, WM32112, WM32482, WM793B, and 501MEL, andthe sixth cell line is MALME3M or WM882.
  - 8. The method of claim 1 or 2, wherein the plurality of samples, m, consists of SKMEL119, HS944, WM1366, WM88;
    - WM3248;
      
      1205LU;
      
      451LU, SKMEL19, SKMEL28, SKMEL30, SKMEL63;
      
      WM35, WM983, WM983C, WM3211, M14, MEWO, SKMEL2, SKMEL5, UACC257, UACC62, WM122, WM13662, WM239A, WM32112, WM32482, WM793B, 501MEL, MALME3M and WM882.

2. A method of classifying a MM tumor or cell line, comprising:
- (a) providing a database, developed through a method comprising;
  
  (i) obtaining a plurality of m samples comprising at least one MM tumor or cell line;
  
  (ii) acquiring a first data set comprising copy number alteration information from at least one locus from each chromosome from each sample obtained in step (i);
  
  (iii) identifying in the first data set samples contaminated by normal cells and eliminating the contaminated samples from the first data set, wherein the identifying and eliminating comprises;
  
  (1) applying a machine learning algorithm tuned to parameters that represent the differences between tumor and normal samples to the data;
  
  (2) assigning a probability score for normal cell contamination to each sample as determined by the machine learning algorithm;
  
  (3) eliminating data from the first data set for each sample scoring 50% or greater probability of containing normal cells;
  
  (iv) estimating a number of subgroups, r, in the data set by applying an unsupervised clustering algorithm using Pearson linear dissimilarity algorithm to the data set;
  
  (v) assigning each sample in the data set to at least one cluster using a modified genomic Non-negative Matrix Factorization (gNMF) algorithm, wherein the modified gNMF algorithm comprises;
  
  (1) calculating divergence of the algorithm after every 100 steps of multiplicative updating using the formula;

9. A method of classifying a therapeutic intervention for arresting or killing malignant melanoma (MM) cells, comprising:
- (a) from a panel of MM cells classified according to genomic subgroups, selecting at least one MM cell line from each subgroup, wherein the panel is assembled from a method comprising;
  
  (i) obtaining a plurality of m samples comprising MM cells;
  
  (ii) acquiring a first data set comprising copy number alteration information from at least one locus from each chromosome from each sample obtained in step (i);
  
  (iii) identifying in the first data set samples contaminated by normal cells and eliminating the contaminated samples from the first data set, wherein the identifying and eliminating comprises;
  
  (1) applying a machine learning algorithm tuned to parameters that represent the differences between tumor and normal samples to the data;
  
  (2) assigning a probability score for normal cell contamination to each sample as determined by the machine learning algorithm;
  
  (3) eliminating data from the first data set for each sample scoring 50% or greater probability of containing normal cells;
  
  (iv) estimating a number of subgroups, r, in the data set by applying an unsupervised clustering algorithm using Pearson linear dissimilarity algorithm to the data set;
  
  (v) assigning each sample in the data set to at least one cluster using a modified genomic Non-negative Matrix Factorization (gNMF) algorithm, wherein the modified gNMF algorithm comprises;
  
  (1) calculating divergence of the algorithm after every 100 steps of multiplicative updating using the formula;
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
- - 10. The method of claim 9, wherein the unsupervised clustering algorithm is a hierarchical clustering.
  - 11. The method of claim 9, wherein Cophenetic correlation is used to provide a final number of clusters from the data set.
  - 12. The method of claim 9, wherein Bayesian Information Criterion is used to provide a final number of clusters from the data set.
  - 13. The method of claim 9, wherein Cophenetic correlation and Bayesian Information Criterion are used to provide a final number of clusters from the data set.
  - 14. The method of claim 9, wherein the MM cells are from a cell line.
  - 15. The method of claim 9, wherein the plurality of samples, m, comprises a first, second, third, fourth, fifth and sixth cell line, whereinthe first cell line is selected from the group consisting of SKMEL119, HS944, WM1366, and WM88;
    - the second cell line is WM3248;
      
      the third cell line is 1205LU;
      
      the fourth cell line is selected from the group consisting of 451LU, SKMEL19, SKMEL28, SKMEL30, SKMEL63, WM35, WM983, and WM983C;
      
      the fifth cell line is selected from the group consisting of WM3211, M14, MEWO, SKMEL2, SKMEL5, UACC257, UACC62, WM122, WM13662, WM239A, WM32112, WM32482, WM793B, and 501MEL, andthe sixth cell lines is MALME3M or WM882.
  - 16. The method of claim 9, wherein the plurality of samples, m, consists of SKMEL119, HS944, WM1366, WM88;
    - WM3248;
      
      1205LU;
      
      451LU, SKMEL19, SKMEL28, SKMEL30, SKMEL63;
      
      WM35, WM983, WM983C, WM3211, M14, MEWO, SKMEL2, SKMEL5, UACC257, UACC62, WM122, WM13662, WM239A, WM32112, WM32482, WM793B, 501MEL, MALME3M and WM882.
  - 17. The method of claim 9, wherein the therapeutic intervention comprises chemotherapy, biological response modifiers, vaccine immunotherapy, or biochemotherapy.
  - 18. The method of claim 17, wherein the therapeutic intervention is a biological response modifier, and the biological response modifier comprises administering at least one pharmaceutical composition comprising an active agent selected from the group consisting of interferon, interleukin-2, monoclonal antibodies, and tumor necrosis factor-alpha.
  - 19. The method of claim 18, wherein the biological response modifier comprises administering two or more active agents.

20. A method of assembling a probe panel for classifying a MM cell from a sample, comprising:
- (a) assembling a database, comprising;
  
  (i) obtaining a plurality of m samples comprising at least one MM cell;
  
  (ii) acquiring a first data set comprising copy number alteration information from at least one locus from each chromosome from each sample obtained in step (i);
  
  (iii) identifying in the first data set samples contaminated by normal cells and eliminating the contaminated samples from the first data set, wherein the identifying and eliminating comprises;
  
  (1) applying a machine learning algorithm tuned to parameters that represent the differences between tumor and normal samples to the data;
  
  (2) assigning a probability score for normal cell contamination to each sample as determined by the machine learning algorithm;
  
  (3) eliminating data from the first data set for each sample scoring 50% or greater probability of containing normal cells;
  
  (iv) estimating a number of subgroups, r, in the data set by applying an unsupervised clustering algorithm using Pearson linear dissimilarity algorithm to the data set;
  
  (v) assigning each sample in the data set to at least one cluster using a modified genomic Non-negative Matrix Factorization (gNMF) algorithm, wherein the modified gNMF algorithm comprises;
  
  (1) calculating divergence of the algorithm after every 100 steps of multiplicative updating using the formula;
- View Dependent Claims (21, 22)
- - 21. A kit comprising the probe panel of claim 20.
  - 22. The kit of claim 21, wherein each probe is a FISH probe.

23. A kit for classifying a MM tumor sample or a cell line, comprising:
- (a) instructions to assemble a database, comprising instructions for;
  
  (i) obtaining a plurality of m samples comprising at least one MM cell;
  
  (ii) acquiring a first data set comprising copy number alteration information from at least one locus from each chromosome from each sample obtained in step (i);
  
  (iii) identifying in the first data set samples contaminated by normal cells and eliminating the contaminated samples from the first data set, wherein the identifying and eliminating comprises;
  
  (1) applying a machine learning algorithm tuned to parameters that represent the differences between tumor and normal samples to the data;
  
  (2) assigning a probability score for normal cell contamination to each sample as determined by the machine learning algorithm;
  
  (3) eliminating data from the first data set for each sample scoring 50% or greater probability of containing normal cells;
  
  (iv) estimating a number of subgroups, r, in the data set by applying an unsupervised clustering algorithm using Pearson linear dissimilarity algorithm to the data set;
  
  (v) assigning each sample in the data set to at least one cluster using a modified genomic Non-negative Matrix Factorization (gNMF) algorithm, wherein the modified gNMF algorithm comprises;
  
  (1) calculating divergence of the algorithm after every 100 steps of multiplicative updating using the formula;

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Abbvie Incorporated
Original Assignee
Abbott Laboratories
Inventors
Zhang, Ke, Lu, Xin, Semizarov, Dimitri, Lesniewski, Rick R.

Granted Patent

US 8,498,821 B2
Time in Patent Office

Days
Field of Search
US Class Current

706/13
CPC Class Codes

C12Q 1/6886   for cancer immunoassay for ...

C12Q 2600/106   Pharmacogenomics, i.e. gene...

C12Q 2600/112   Disease subtyping, staging ...

C12Q 2600/156   Polymorphic or mutational m...

G16B 20/00   ICT specially adapted for f...

G16B 20/10   Ploidy or copy number detec...

G16B 20/20   Allele or variant detection...

G16B 40/00   ICT specially adapted for b...

G16B 40/30   Unsupervised data analysis

G16H 20/10   relating to drugs or medica...

G16H 50/20   for computer-aided diagnosi...

Y02A 90/10   Information and communicati...

GENOMIC CLASSIFICATION OF MALIGNANT MELANOMA BASED ON PATTERNS OF GENE COPY NUMBER ALTERATIONS

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

23 Claims

Specification

Solutions

Use Cases

Quick Links

GENOMIC CLASSIFICATION OF MALIGNANT MELANOMA BASED ON PATTERNS OF GENE COPY NUMBER ALTERATIONS

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

23 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links