Method and apparatus for searching pattern in sequence data
First Claim
Patent Images
1. A method of searching a pattern in sequence data, the method comprising:
- setting a plurality of interest pattern models each comprising an interest pattern length, an allowed mismatch value, and a minimum support;
determining a candidate pattern that is within an interest pattern length of at least one of the plurality of interest pattern models;
controlling a processor to access a predetermined structure of the sequence data in a memory and calculate a support for the candidate pattern in the sequence data based on an allowed mismatch value of the at least one of the plurality of interest pattern models; and
determining whether the support fulfills a condition of the minimum support of the at least one of the plurality of interest pattern models,wherein the determining of whether the support fulfills the condition of the minimum support of the at least one of the plurality of interest pattern models comprises determining, for each of the plurality of interest pattern models, whether a support sum of similar patterns in a set fulfilling a condition of the allowed mismatch value of the interest pattern model fulfills the condition of the minimum support of the interest pattern model.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of searching a pattern in sequence data includes setting a plurality of interest pattern models each including an interest pattern length, an allowed mismatch value, and a minimum support; calculating a support of a candidate pattern that is generable within the interest pattern length based on the allowed mismatch value of at least one of the plurality of interest pattern models; and determining whether the support of the candidate pattern fulfills a condition of the minimum support of at least one of the plurality of interest pattern models.
14 Citations
29 Claims
-
1. A method of searching a pattern in sequence data, the method comprising:
-
setting a plurality of interest pattern models each comprising an interest pattern length, an allowed mismatch value, and a minimum support; determining a candidate pattern that is within an interest pattern length of at least one of the plurality of interest pattern models; controlling a processor to access a predetermined structure of the sequence data in a memory and calculate a support for the candidate pattern in the sequence data based on an allowed mismatch value of the at least one of the plurality of interest pattern models; and determining whether the support fulfills a condition of the minimum support of the at least one of the plurality of interest pattern models, wherein the determining of whether the support fulfills the condition of the minimum support of the at least one of the plurality of interest pattern models comprises determining, for each of the plurality of interest pattern models, whether a support sum of similar patterns in a set fulfilling a condition of the allowed mismatch value of the interest pattern model fulfills the condition of the minimum support of the interest pattern model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of searching a pattern in sequence data, the method comprising:
-
setting a plurality of interest pattern models each comprising an interest pattern length, an allowed mismatch value, and a minimum support; controlling a processor to calculate a support of a parent pattern in the sequence data through an accessing of a predetermined structure of the sequence data in a memory; and in response to determining whether the support of the parent pattern is greater than or equal to the minimum support, calculating a support of a child pattern, of the parent pattern, based on the allowed mismatch value of the at least one interest pattern model and determining whether the support of the child pattern fulfills a condition of the minimum support of the at least one interest pattern model, wherein the determining of whether the support of the child pattern fulfills the condition of the minimum support of the at least one interest pattern model comprises determining, for each of the at least one interest pattern model, whether a support sum of similar patterns in a set fulfilling a condition of the allowed mismatch value of the interest pattern model fulfills a condition of the minimum support of the interest pattern model. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method of searching a pattern in sequence data, the method comprising:
-
setting an interest pattern model comprising an interest pattern length, an allowed mismatch value, and a minimum support; controlling a processor to calculate a support of a parent pattern, including accessing a predetermined structure of the sequence data in a memory; and in response to determining whether the support of the parent application is greater than or equal to the minimum support, calculating a support of a child pattern of the parent pattern using mismatch values of similar patterns of the parent pattern, wherein the calculating of the support of the child pattern comprises; generating a first set of patterns by appending a unit pattern to similar patterns of the parent pattern having a mismatch value less than the allowed mismatch value; generating a second set of patterns by appending a unit pattern to similar patterns of the parent pattern having a mismatch value equal to the allowed mismatch value; calculating a support of each of the patterns in the first set and the second set; and calculating the support of the child pattern using the support of each of the patterns in the first set and the second set. - View Dependent Claims (15, 16)
-
-
17. A method of searching a pattern in sequence data, the method comprising:
-
setting an interest pattern model comprising an interest pattern length, an allowed mismatch value, and a minimum support; controlling a processor to calculate a support of a parent pattern, including accessing a predetermined structure of the sequence data in a memory; in response to determining whether the support of the parent pattern is greater than or equal to the minimum support, calculating a support of a child pattern of the parent pattern using mismatch values and supports of similar patterns of the parent pattern; and determining of whether the support of the child pattern fulfills the condition of the one minimum support of the at least one interest pattern model comprises determining, for each of the at least one interest pattern model, whether a support sum of similar patterns in the set fulfilling a condition of the allowed mismatch value of the interest pattern model fulfills a condition of the minimum support of the interest pattern model. - View Dependent Claims (18, 19, 20)
-
-
21. An apparatus for searching a pattern in sequence data, the apparatus comprising:
-
an interest pattern model setter configured to set a plurality of interest pattern models each comprising an interest pattern length, an allowed mismatch value, and a minimum support; a support calculator comprising a processor configured to calculate a support for a candidate pattern which is within the interest pattern length of at least one of the plurality of interest pattern models, for the sequence data, through an accessing of a predetermined structure of the sequence data in a memory and based on the allowed mismatch value of at least one of the plurality of interest pattern models; and a determiner configured to determine whether the support fulfills a condition of the minimum support of at least one of the plurality of interest pattern models, wherein, in response to the support of the candidate pattern fulfilling the condition of the minimum support of the at least one of the plurality of interest pattern models and the candidate pattern having a length less than a maximum interest pattern length of the at least one of the plurality of interest pattern models, the support calculator is further configured to calculate a support of a child pattern that is generable from the candidate pattern using a mismatch value of each pattern of similar patterns of the candidate pattern; and the determiner is further configured to determine whether the child pattern fulfills conditions of the at least one of the plurality of interest pattern models of which the condition of the minimum support is fulfilled by the candidate pattern. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A method of searching a pattern in sequence data, the method comprising:
-
setting a plurality of interest pattern models each comprising an interest pattern length, an allowed mismatch value, and a minimum support, wherein the allowed mismatch value is different in each of the plurality of interest pattern models; generating all possible similar patterns of a candidate pattern within the interest pattern length; controlling a processor to access a predetermined structure of the sequence data in a memory and calculate a support and a mismatch value of each of the similar patterns with respect to the sequence data; and determining, for each of the plurality of interest pattern models, whether the candidate pattern fulfills conditions of the interest pattern model based on the support of only those similar patterns having a mismatch value less than or equal to the mismatch value of the interest pattern model, wherein the determining comprises; calculating a support sum of only those similar patterns having a mismatch value less than or equal to the mismatch value of the interest pattern model; comparing the support sum with the minimum support of the interest pattern model; in response to the support sum being greater than or equal to the minimum support of the interest pattern model, determining that the candidate pattern fulfills the conditions of the interest pattern model; and in response to the support sum being less than the minimum support of the interest pattern model, determining that the candidate pattern does not fulfill the conditions of the interest pattern model. - View Dependent Claims (27)
-
-
28. A method of searching a pattern in sequence data, the method comprising:
-
setting a plurality of interest pattern models each comprising an interest pattern length, an allowed mismatch value, and a minimum support; determining a candidate pattern, which is within an interest pattern length of at least one of the plurality of interest pattern models, and determining similar patterns of the candidate pattern; controlling a processor to access a predetermined structure of the sequence data, respectively calculating support of each of the similar patterns in the sequence data, based on a minimum allowed mismatch value of the plurality of interest pattern models, and to calculate a sum of the respectively calculated supports of the similar patterns; and determining whether the candidate pattern fulfills conditions of all of the plurality of interest pattern models based at least on the calculated sum, wherein the calculating of the sum of the respectively determined supports of the similar patterns comprises; calculating the sum as a first sum of respectively determined supports of similar patterns corresponding to a first interest pattern model having the minimum allowed mismatch value of the mismatch values of the plurality of interest pattern models; and calculating a second sum of the first sum and respectively determined supports of similar patterns of the candidate pattern corresponding to a mismatch value, greater than the minimum mismatch value, of a second interest pattern model of the plurality of interest pattern models. - View Dependent Claims (29)
-
Specification