SEQUENCE IDENTIFICATION AND ANALYSIS
First Claim
1. A method for identifying a sequence of interest in a data series, comprising:
- generating a data structure that stores characteristics about a plurality of sequences present in the data series;
identifying one or more sequences based upon the contents of the data structure;
evaluating one or more exit criteria to determine if an exit point has been reached;
updating the data structure to reflect the most recent identification of a sequence and repeating the identification process if the exit criteria are not met; and
terminating if the exit criteria are met.
3 Assignments
0 Petitions
Accused Products
Abstract
The present technique provides for the analysis of a data series to identify sequences of interest within the series. Specifically, in accordance with one embodiment of the present technique, a method is provided comprising generating a data structure that stores characteristics about a plurality of sequences present in a data series. One or more sequences are identified based upon the contents of the data structure. In accordance with other aspects of the invention, more than one heuristic is calculated for each sequence under review. The plurality of heuristics associated with each sequence are evaluated to identify a sequence of interest.
-
Citations
30 Claims
-
1. A method for identifying a sequence of interest in a data series, comprising:
-
generating a data structure that stores characteristics about a plurality of sequences present in the data series; identifying one or more sequences based upon the contents of the data structure; evaluating one or more exit criteria to determine if an exit point has been reached; updating the data structure to reflect the most recent identification of a sequence and repeating the identification process if the exit criteria are not met; and terminating if the exit criteria are met. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for identifying a sequence of interest, comprising:
-
evaluating a plurality of heuristics for a candidate sequence from a plurality of candidate sequences present in a data series; identifying a sequence in the plurality of candidate sequences based on the evaluation of the heuristics; evaluating one or more exit criteria to determine if an exit point has been reached; updating the data series or a corresponding data structure to reflect the most recent identification of a sequence and repeating the steps of evaluating the plurality of heuristics and identifing the sequences if the exit criteria are not met; and terminating if the exit criteria are met. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A method for identifying a biological sequence of interest, comprising:
-
generating a data structure that stores characteristics about potential biological sequences of interest present in a biological polymer; identifying one or more biological sequences based upon the contents of the data structure; evaluating one or more exit criteria to determine if an exit point has been reached; updating the data structure to reflect the most recent identification of a biological sequence and repeating the identification process if the exit criteria are not met; and terminating if the exit criteria are met. - View Dependent Claims (15, 16, 17, 18)
-
-
19. One or more tangible, machine-readable media, comprising:
-
code adapted to generate a data structure that stores characteristics about a plurality of sequences present in a data series; code adapted to identify one or more sequences based upon the contents of the data structure; code adapted to evaluate one or more exit criteria to determine if an exit point has been reached; code adapted to update the data structure to reflect the most recent identification of a sequence and repeating the identification process if the exit criteria are not met; and code adapted to terminate if the exit criteria are met. - View Dependent Claims (20, 21, 22, 23)
-
-
24. One or more tangible, machine-readable media, comprising:
-
code adapted to evaluate a plurality of heuristics for a candidate sequence from a plurality of candidate sequences present in a data series; code adapted to identify a sequence in the plurality of candidate sequences based on the evaluation of the heuristics; code adapted to evaluate one or more exit criteria to determine if an exit point has been reached; code adapted to update the data series or a corresponding data structure to reflect the most recent identification of a sequence and repeating the steps of evaluating the plurality of heuristics and identifing the sequences if the exit criteria are not met; and code adapted to terminate if the exit criteria are met. - View Dependent Claims (25, 26, 27, 28, 29, 30)
-
Specification