Probe designing method and information processing apparatus
First Claim
1. A method of determining and outputting a probe candidate that is utilized for designing a base sequence to be used as a probe which is hybridized with a nucleic acid fragment to perform analysis, comprising:
- a generation step of generating a tree in which a plurality of partial base sequences obtained on the basis of a plurality of target base sequences are arranged on nodes, said target base sequences including a base sequence to be examined;
an extraction step of extracting a partial base sequence indicated by nodes present on a path from one of the nodes to a root node on the tree, the partial base sequence being a portion of the base sequence to be examined;
an evaluation step of calculating specificity of the extracted partial base sequence among the plurality of target sequences, evaluating suitability as a probe of the extracted base sequence based on the calculated specificity and obtaining an evaluation result thereof;
a determination step of determining a partial base sequence as a probe candidate that is utilized for designing a base sequence to be used as a probe which is hybridized with a nucleic acid fragment to perform analysis on the basis of the evaluation result in the evaluation step; and
wherein the evaluation step comprises;
calculating the entropy of each node present on the path as the specificity on the basis of the number of times of appearance of a partial base sequence corresponding to the node in the target base sequences;
introducing an evaluation function which multiplies a change in the entropy between nodes by a weight which reduces in response to the increase of distance between the center of a partial base sequence and a node; and
obtaining an evaluation result based on a calculation result of the evaluation function.
1 Assignment
0 Petitions
Accused Products
Abstract
In automatically designing a base sequence to be used as a probe which is hybridized with an unknown nucleic acid fragment to perform gene analysis, a discrimination tree in which a plurality of partial base sequences obtained from target base sequence data are arranged on nodes is generated. The suitability as a probe of a partial base sequence represented by a desired node is evaluated. A plurality of partial base sequences are grouped on the basis of specificity, and an appropriate group combination is selected. A partial base sequence to be used as a probe is determined, from partial base sequences belonging to the selected groups, on the basis of the evaluation result. In this manner, a probe appropriate for analysis can be automatically selected in accordance with a target base sequence to be analyzed. This effectively supports probe designing.
-
Citations
21 Claims
-
1. A method of determining and outputting a probe candidate that is utilized for designing a base sequence to be used as a probe which is hybridized with a nucleic acid fragment to perform analysis, comprising:
-
a generation step of generating a tree in which a plurality of partial base sequences obtained on the basis of a plurality of target base sequences are arranged on nodes, said target base sequences including a base sequence to be examined; an extraction step of extracting a partial base sequence indicated by nodes present on a path from one of the nodes to a root node on the tree, the partial base sequence being a portion of the base sequence to be examined; an evaluation step of calculating specificity of the extracted partial base sequence among the plurality of target sequences, evaluating suitability as a probe of the extracted base sequence based on the calculated specificity and obtaining an evaluation result thereof; a determination step of determining a partial base sequence as a probe candidate that is utilized for designing a base sequence to be used as a probe which is hybridized with a nucleic acid fragment to perform analysis on the basis of the evaluation result in the evaluation step; and wherein the evaluation step comprises; calculating the entropy of each node present on the path as the specificity on the basis of the number of times of appearance of a partial base sequence corresponding to the node in the target base sequences; introducing an evaluation function which multiplies a change in the entropy between nodes by a weight which reduces in response to the increase of distance between the center of a partial base sequence and a node; and obtaining an evaluation result based on a calculation result of the evaluation function. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21)
-
-
19. An information processing apparatus for performing the method of determining a probe candidate that is utilized for designing a base sequence to be used as a probe which is hybridized with a nucleic acid fragment to perform analysis, said apparatus comprising:
-
generation means for generating a tree in which a plurality of partial base sequences obtained on the basis of a plurality of target base sequences are arranged on nodes, said target base sequences including a base sequence to be examined; extraction means for extracting a partial base sequence indicated by nodes present on a path from one of the nodes to a root node on the tree, the partial base sequence being a portion of the base sequence to be examined; evaluation means for calculating specificity of the extracted partial base sequence among the plurality of target sequences, evaluating suitability as a probe of the extracted base sequence based on the calculated specificity and obtaining an evaluation result thereof; determination means for determining a partial base sequence as a probe candidate that is utilized for designing a base sequence to be used as a probe which is hybridized with a nucleic acid fragment to perform analysis on the basis of the evaluation result in said evaluation means; and wherein said evaluation means calculates the entropy of each node present on the path as the specificity on the basis of the number of times of appearance of a partial base sequence corresponding to the node in the target base sequences, introduces an evaluation function which multiplies a change in the entropy between nodes by a weight which reduces in response to the increase of distance between the center of a partial base sequence and a node, and obtains an evaluation result based on a calculation result of the evaluation function.
-
Specification