Method for the identification of gene transcripts with improved efficiency in the treatment of errors
First Claim
Patent Images
1. A method for identification of gene transcripts comprising the steps of:
- a) generating at least a first set of raw sequences by sequencing at least a first type of biological material;
b) isolating first ditags from said at least first set of raw sequences;
c) isolating first tags from said isolated at least first ditags;
d) determining abundance of said first tags; and
e) identifying said first tags, further comprising a step off) rejecting said isolated first tags that are wrongly sequenced by means of a statistical model for sequencing errors to be applied to said isolated first tags, said statistical model being defined by a probability function F(a,b), wherein said function F is intended to modelize the probability that a given tag a can be sequenced as b.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for identification of gene transcripts comprises the steps of: generating a set of raw sequences by sequencing of biological material; isolating ditags from the set of raw sequences; isolating tags from the ditags; determining abundance of the tags; and identifying the tags, the method providing a step of reducing the amount of sequencing errors by using a statistical model for sequencing errors.
-
Citations
29 Claims
-
1. A method for identification of gene transcripts comprising the steps of:
-
a) generating at least a first set of raw sequences by sequencing at least a first type of biological material; b) isolating first ditags from said at least first set of raw sequences; c) isolating first tags from said isolated at least first ditags; d) determining abundance of said first tags; and e) identifying said first tags, further comprising a step of f) rejecting said isolated first tags that are wrongly sequenced by means of a statistical model for sequencing errors to be applied to said isolated first tags, said statistical model being defined by a probability function F(a,b), wherein said function F is intended to modelize the probability that a given tag a can be sequenced as b. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
Specification