Method of generating an optimized, diverse population of variants
First Claim
1. A computer program product comprising a non-transitory computer useable medium having a computer readable medium having program instructions for selecting an optimized, diverse set of molecular variants from a plurality of molecular variants, wherein the program instructions comprise:
- (a) code for receiving an objective data set representing two or more properties of each molecular variant in a plurality of molecular variants;
(b) code for generating a first Pareto front membership of all molecular variants based on the one or more objectives for optimization;
(c) code for setting optimization parameters, wherein the optimization parameters comprise;
(i) number nvar of molecular variants to create;
(ii) a molecular variant population size popSize;
(iii) a crossover rate crossrate;
(iv) a mutation rate mutrate;
(v) a fitness function comprising a penalty fitness function and an overall fitness function, wherein the penalty fitness is based on niche count and the overall fitness function is based on location in a descending Pareto front divided by the shared number of molecular variants within each front; and
(vi) specific molecular variants to be included in the nvar of molecular variants; and
(d) code for identifying a search space of acceptable molecular variants;
(e) code for generating a random population of genomes from the search space of acceptable molecular variants by applying a selection operator; and
(f) code for applying a set of operators comprising a crossover operator, a mutation operator, a fitness operator, and a repair operator on the random plurality of genomes to select the nvar set of optimized, diverse molecular variants.
2 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure relates to methods of rapidly and efficiently searching biologically-related data space to identify a population set maximally diverse and optimized for sets of desired properties. More specifically, the disclosure provides methods of identifying a diverse, evolutionary separated bio-molecules with desired properties from complex bio-molecule libraries. The disclosure additionally provides digital systems and software for performing these methods.
58 Citations
4 Claims
-
1. A computer program product comprising a non-transitory computer useable medium having a computer readable medium having program instructions for selecting an optimized, diverse set of molecular variants from a plurality of molecular variants, wherein the program instructions comprise:
-
(a) code for receiving an objective data set representing two or more properties of each molecular variant in a plurality of molecular variants; (b) code for generating a first Pareto front membership of all molecular variants based on the one or more objectives for optimization; (c) code for setting optimization parameters, wherein the optimization parameters comprise; (i) number nvar of molecular variants to create; (ii) a molecular variant population size popSize; (iii) a crossover rate crossrate; (iv) a mutation rate mutrate; (v) a fitness function comprising a penalty fitness function and an overall fitness function, wherein the penalty fitness is based on niche count and the overall fitness function is based on location in a descending Pareto front divided by the shared number of molecular variants within each front; and (vi) specific molecular variants to be included in the nvar of molecular variants; and (d) code for identifying a search space of acceptable molecular variants; (e) code for generating a random population of genomes from the search space of acceptable molecular variants by applying a selection operator; and (f) code for applying a set of operators comprising a crossover operator, a mutation operator, a fitness operator, and a repair operator on the random plurality of genomes to select the nvar set of optimized, diverse molecular variants. - View Dependent Claims (2)
-
-
3. A computer program product comprising a non-transitory computer useable medium a computer readable medium having program instructions for selecting an optimized, diverse set of molecular variants from a plurality of molecular variants, wherein the program instructions comprise:
-
(a) code for receiving an objective data set representing two or more properties of each molecular variant in a plurality of molecular variants; (b) code for generating a first Pareto front membership of all molecular variants based on the one or more objectives for optimization; (c) code for setting optimization parameters, wherein the optimization parameters comprise; (i) number nvar of molecular variants to create; (ii) a molecular variant population size popSize; (iii) a crossover rate crossrate; (iv) number of generations to create nGen; (v) a mutation rate mutrate; (vi) a fitness function comprising a penalty fitness function and an overall fitness function, wherein the penalty fitness is based on niche count and the overall fitness function is based on the location of the molecular variant in a descending Pareto front divided by the shared number of molecular variants within each front; and (vii) specific molecular variants to be included in the nvar of molecular variants; and (d) code for identifying a search space of acceptable molecular variants; (e) code for generating a random population of genomes from the search space of acceptable molecular variants; (f) code for selecting a first set of genomes of size popSize from the random population, wherein each genome consisting of nvar molecular variants is created by applying a selection operator, a crossover operator, a mutation operator, a repair operator, and a fitness operator to the random population of genomes; and (g) code for returning the genome with the highest fitness as the final, optimized, diverse nvar set of molecular variants. - View Dependent Claims (4)
-
Specification