METHODS FOR CELL LABEL CLASSIFICATION
First Claim
1. A method for identifying a signal cell label, comprising:
- (a) barcoding a plurality of targets in a plurality of cells using a plurality of barcodes to create a plurality of barcoded targets, wherein each of the plurality of barcodes comprises a cell label and a molecular label;
(b) obtaining sequencing data of the plurality of barcoded targets;
(c) determining the number of molecular labels with distinct sequences associated with each of the cell labels of the plurality of barcodes;
(d) determining a rank of each of the cell labels of the plurality of barcodes based on the number of molecular labels with distinct sequences associated with each of the cell labels;
(e) generating a cumulative sum plot based on the number of molecular labels with distinct sequences associated with each of the cell labels determined in (c) and the rank of each of the cell labels determined in (d);
(f) generating a second derivative plot of the cumulative sum plot;
(g) determining a minimum of the second derivative plot of the cumulative sum plot, wherein the minimum of the second derivative plot corresponds to a cell label threshold; and
(h) identifying each of the cell labels as a signal cell label or a noise cell label based on the number of molecular labels with distinct sequences associated with each of the cell labels determined in (c) and the cell label threshold determined in (g).
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are methods and systems for classifying cell labels, for example identifying a signal cell label. In some embodiments, the method comprises: obtaining sequencing data of barcoded targets created using targets in cells barcoded using barcodes, wherein a barcode comprises a cell label and a molecular label. After ranking the cell labels, a minimum of a second derivative plot of a cumulative sum plot can be determined. Using the methods, a cell label can be classified as a signal cell label or a noise cell label based on the number of molecular labels with distinct sequences associated with the cell label and a cell label threshold.
-
Citations
97 Claims
-
1. A method for identifying a signal cell label, comprising:
-
(a) barcoding a plurality of targets in a plurality of cells using a plurality of barcodes to create a plurality of barcoded targets, wherein each of the plurality of barcodes comprises a cell label and a molecular label; (b) obtaining sequencing data of the plurality of barcoded targets; (c) determining the number of molecular labels with distinct sequences associated with each of the cell labels of the plurality of barcodes; (d) determining a rank of each of the cell labels of the plurality of barcodes based on the number of molecular labels with distinct sequences associated with each of the cell labels; (e) generating a cumulative sum plot based on the number of molecular labels with distinct sequences associated with each of the cell labels determined in (c) and the rank of each of the cell labels determined in (d); (f) generating a second derivative plot of the cumulative sum plot; (g) determining a minimum of the second derivative plot of the cumulative sum plot, wherein the minimum of the second derivative plot corresponds to a cell label threshold; and (h) identifying each of the cell labels as a signal cell label or a noise cell label based on the number of molecular labels with distinct sequences associated with each of the cell labels determined in (c) and the cell label threshold determined in (g). - View Dependent Claims (2, 3)
-
-
4-19. -19. (canceled)
-
20. A method for determining a signal cell label, comprising:
-
(a) obtaining sequencing data of a plurality of barcoded targets, wherein the plurality of barcoded targets is created from a plurality of targets in a plurality of cells that are barcoded using a plurality of barcodes, and wherein each of the plurality of barcodes comprises a cell label and a molecular label; (b) determining a rank of each of the cell labels of the plurality of barcodes based on the number of molecular labels with distinct sequences associated with each of the cell labels of the plurality of barcodes; (c) determining a cell label threshold based on the number of molecular labels with distinct sequences associated with each of the cell labels and the rank of each of the cell labels of the plurality of barcodes determined in (b); and (d) identifying each of the cell labels as a signal cell label or a noise cell label based on the number of molecular labels with distinct sequences associated with each of the cell labels and the cell label threshold determined in (c). - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
29. (canceled)
-
30. (canceled)
-
43. A method for identifying a signal cell label, comprising:
-
(a) obtaining sequencing data of a plurality of targets of cells, wherein each target is associated with a number of molecular labels with distinct sequences associated with each cell label of a plurality of cell labels; (b) determining a cell label threshold based on the number of molecular labels with distinct sequences associated with each of the cell labels; and (c) identifying each of the cell labels as a signal cell label or a noise cell label based on the number of molecular labels with distinct sequences associated with each of the cell labels and the cell label threshold.
-
-
44-65. -65. (canceled)
-
66. A method for identifying a signal cell label, comprising:
-
(a) barcoding a plurality of targets in a plurality of cells using a plurality of barcodes to create a plurality of barcoded targets, wherein each of the plurality of barcodes comprises a cell label and a molecular label, wherein barcoded targets created from targets of different cells of the plurality of cells have different cell labels, and wherein barcoded targets created from targets of the same cell of the plurality of cells have different molecular labels; (b) obtaining sequencing data of the plurality of barcoded targets; (c) determining a feature vector of each cell label of the plurality of barcoded targets, wherein the feature vector comprise numbers of molecular labels with distinct sequences associated with the each cell label; (d) determining a cluster for the each cell label of the plurality of barcoded targets based on the feature vector; and (e) identifying the each cell label of the plurality of barcoded targets as a signal cell label or a noise cell label based on a number of cell labels in the cluster and a cluster size threshold.
-
-
67-80. -80. (canceled)
-
81. A method for identifying a signal cell label, comprising:
-
(a) obtaining sequencing data of a plurality of barcoded targets, wherein the plurality of barcoded targets is create from a plurality of targets in a plurality of cells that are barcoded using a plurality of barcodes, wherein each of the plurality of barcodes comprises a cell label and a molecular label, wherein barcoded targets created from targets of different cells of the plurality of cells have different cell labels, and wherein barcoded targets created from targets of the same cell of the plurality of cells have different molecular labels; (b) determining a feature vector of each cell label of the plurality of barcoded targets, wherein the feature vector comprise numbers of molecular labels with distinct sequences associated with the each cell label; (c) determining a cluster for the each cell label of the plurality of barcoded targets based on the feature vector; and (d) identifying the each cell label of the plurality of barcoded targets as a signal cell label or a noise cell label based on a number of cell labels in the cluster and a cluster size threshold. - View Dependent Claims (82, 83)
-
-
84-95. -95. (canceled)
-
96. A method for identifying a signal cell label, comprising:
-
(a) obtaining sequencing data of a plurality of first targets of cells, wherein each first target is associated with a number of molecular labels with distinct sequences associated with each cell label of a plurality of cell labels; (b) identifying each of the cell labels as a signal cell label or a noise cell label based on the number of molecular labels with distinct sequences associated with each of the cell labels and an identification threshold; and (c) re-identifying at least one of the plurality of cell labels as a signal cell label identified as a noise cell label in (b) or re-identifying at least one of the cell label as a noise cell label identified as a signal cell label in (b).
-
-
97-106. -106. (canceled)
Specification