Systems and methods for engineering nucleic acid constructs using scoring techniques
First Claim
1. A method of defining an engineered nucleic acid construct for integration into a genomic locus L of a target organism or a host cell, the method comprising:
- (A) receiving a plurality of nucleic acid requests {NR1, . . . , NRn}, wherein n is a positive integer greater than 1, each nucleic acid request NRi in {NR1, . . . , NRn} specifying a genetic change to L;
(B) expanding each NRi in {NR1, . . . , NRn} into a corresponding component polynucleotide, thereby forming a plurality of component polynucleotides;
(C) arranging the plurality of component polynucleotides into a contiguous arrangement ARi, wherein the arranging uses linker nucleic acid sequences from a predetermined library of linker nucleic acid sequences to combine component polynucleotides in the plurality of component polynucleotides into the contiguous arrangement ARi;
(D) repeating the arranging (C) until a set of {AR1, . . . , ARm} contiguous arrangements are formed, wherein m is a positive integer greater than 1, the set of {AR1, . . . , ARm} contiguous arrangements representing a plurality of different contiguous arrangements of the component polynucleotides in the plurality of component polynucleotides;
(E) determining a score Si for each respective contiguous arrangement ARi in {AR1, . . . , ARm}, wherein, for each respective contiguous arrangement ARi, a contribution to the score Si is made when one or more source constructs are identified as being physically present in a freezer store, wherein each of the one or more physically present source constructs encodes one or more of the component polynucleotides, and wherein a 3′
or 5′
terminus, or both the 3′ and
5′
termini, of each respective component polynucleotide in the one or more component polynucleotides encoded by the one or more physically present source constructs is bound to a corresponding linker nucleic acid sequence that was used for the corresponding component polynucleotide in the arranging (C) to form the ARi;
(F) selecting a final contiguous arrangement ARf in {AR1, . . . , ARm} having a score Si that meets a selection criterion as an optimal contiguous arrangement; and
(G) calculating, in response to selection of the ARf, one or more primer pairs based upon the ARf, wherein each primer pair in the one or more primer pairs is capable of amplifying a portion of the ARf not represented in any identified one or more physically present source constructs identified for the ARf, wherein the portions of the contiguous arrangement ARf amplified by the one or more primer pairs and the one or more component polynucleotides in any identified one or more physically present source constructs identified for the ARf, in the order specified in the ARf, collectively define the engineered nucleic acid construct,wherein at least one of the receiving (A), expanding (B), arranging (C), repeating (D), determining (E), selecting (F), and calculating (G) is performed using one or more suitably programmed computers.
20 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are provided for defining a nucleic acid construct for integration at locus L of an organism. Nucleic acid requests are received, each such request specifying a genetic change to L. The request are expanded into component polynucleotides which are then arranged into {AR1, . . . , ARm} different arrangements, each ARi in {AR1, . . . , ARm} defining a different arrangement of the component polynucleotides. A score Si for each ARi in {AR1, . . . , ARm} is determined based on whether source constructs encoding a portion ofARi are physically present. An ARf in {AR1, . . . , ARm} is selected based on the score for ARf. Primer pairs are calculated to amplify the portions of ARf not represented in the source constructs. The portions of ARf amplified by the primer pairs and the portions of ARf in the source constructs, ordered by ARf, define the nucleic acid construct.
-
Citations
30 Claims
-
1. A method of defining an engineered nucleic acid construct for integration into a genomic locus L of a target organism or a host cell, the method comprising:
-
(A) receiving a plurality of nucleic acid requests {NR1, . . . , NRn}, wherein n is a positive integer greater than 1, each nucleic acid request NRi in {NR1, . . . , NRn} specifying a genetic change to L; (B) expanding each NRi in {NR1, . . . , NRn} into a corresponding component polynucleotide, thereby forming a plurality of component polynucleotides; (C) arranging the plurality of component polynucleotides into a contiguous arrangement ARi, wherein the arranging uses linker nucleic acid sequences from a predetermined library of linker nucleic acid sequences to combine component polynucleotides in the plurality of component polynucleotides into the contiguous arrangement ARi; (D) repeating the arranging (C) until a set of {AR1, . . . , ARm} contiguous arrangements are formed, wherein m is a positive integer greater than 1, the set of {AR1, . . . , ARm} contiguous arrangements representing a plurality of different contiguous arrangements of the component polynucleotides in the plurality of component polynucleotides; (E) determining a score Si for each respective contiguous arrangement ARi in {AR1, . . . , ARm}, wherein, for each respective contiguous arrangement ARi, a contribution to the score Si is made when one or more source constructs are identified as being physically present in a freezer store, wherein each of the one or more physically present source constructs encodes one or more of the component polynucleotides, and wherein a 3′
or 5′
terminus, or both the 3′ and
5′
termini, of each respective component polynucleotide in the one or more component polynucleotides encoded by the one or more physically present source constructs is bound to a corresponding linker nucleic acid sequence that was used for the corresponding component polynucleotide in the arranging (C) to form the ARi;(F) selecting a final contiguous arrangement ARf in {AR1, . . . , ARm} having a score Si that meets a selection criterion as an optimal contiguous arrangement; and (G) calculating, in response to selection of the ARf, one or more primer pairs based upon the ARf, wherein each primer pair in the one or more primer pairs is capable of amplifying a portion of the ARf not represented in any identified one or more physically present source constructs identified for the ARf, wherein the portions of the contiguous arrangement ARf amplified by the one or more primer pairs and the one or more component polynucleotides in any identified one or more physically present source constructs identified for the ARf, in the order specified in the ARf, collectively define the engineered nucleic acid construct, wherein at least one of the receiving (A), expanding (B), arranging (C), repeating (D), determining (E), selecting (F), and calculating (G) is performed using one or more suitably programmed computers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. An apparatus comprising one or more memories and one or more processors, wherein the one or more memories and the one or more processors are in electronic communication with each other, the one or more memories tangibly encoding a set of instructions for defining an engineered nucleic acid construct for integration into a genomic locus L of a target organism or a host cell using the one or more processors, the set of instructions comprising:
-
(A) instructions for receiving a plurality of nucleic acid requests {NR1, . . . , NRn}, wherein n is a positive integer greater than 1, each nucleic acid request NRi in {NR1, . . . , NRn} specifying a genetic change to L; (B) instructions for expanding each NRi in {NR1, . . . , NRn} into a corresponding component polynucleotide, thereby forming a plurality of component polynucleotides; (C) instructions for arranging the plurality of component polynucleotides into a contiguous arrangement ARi, wherein the arranging (C) uses linker nucleic acid sequences from a predetermined library of linker nucleic acid sequences to combine component polynucleotides in the plurality of component polynucleotides into a contiguous arrangement ARi; (D) instructions for repeating the instructions for arranging (C) until a set of {AR1, . . . , ARm} contiguous arrangements are formed, wherein m is a positive integer greater than 1, the set of {AR1, . . . , ARm} contiguous arrangements representing a plurality of different contiguous arrangements of the component polynucleotides in the plurality of component polynucleotides; (E) instructions for determining a score Si for each respective contiguous arrangement ARi in {AR1, . . . , ARm}, wherein, for each respective contiguous arrangement ARi, a contribution to the score Si is made when one or more source constructs are identified as being physically present in a freezer store, wherein each of the one or more physically present source constructs encodes one or more of the component polynucleotides, and wherein a 3′
or 5′
terminus, or both the 3′ and
5′
termini, of each respective component polynucleotide in the one or more component polynucleotides encoded by the one or more physically present source constructs is bound to a corresponding linker that was used for the corresponding component polynucleotide in the arranging (C) to form ARi;(F) instructions for selecting a final contiguous arrangement ARf in {AR1, . . . , ARm} having a score Si that meets a selection criterion as an optimal contiguous arrangement; and (G) instructions for calculating, in response to completion of the instructions for selecting, one or more primer pairs based upon the final ARf, wherein each primer pair in the one or more primer pairs is capable of amplifying a portion of the ARf not represented in any identified one or more physically present source constructs identified for the ARf, wherein the portions of the contiguous arrangement amplified by the one or more primer pairs and the one or more component polynucleotides in any identified one or more physically present source constructs identified for ARf, in the order specified in the contiguous arrangement ARf, collectively define the engineered nucleic acid construct.
-
-
29. A method of defining an engineered nucleic acid construct for integration into a genomic locus L of a target organism or a host cell, the method comprising:
-
(A) receiving a plurality of nucleic acid requests {NR1, . . . , NRn}, wherein n is a positive integer greater than 1, each nucleic acid request NRi in {NR1, . . . , NRn} specifying a genetic change to L; (B) expanding each NRi in {NR1, . . . , NRn} into a corresponding component polynucleotide, thereby forming a plurality of component polynucleotides; (C) arranging the plurality of component polynucleotides into a contiguous arrangement ARi, wherein the arranging (C) uses linker nucleic acid sequences from a predetermined library of linker nucleic acid sequences to combine component polynucleotides in the plurality of component polynucleotides into the ARi; (D) selecting, in response to the arranging, one or more source constructs from a plurality of source constructs physically present in a freezer store, wherein each of the one or more physically present source constructs encode one or more of the component polynucleotides, and wherein a 3′
or 5′
terminus, or both the 3′ and
5′
termini, of each respective component polynucleotide in the one or more component polynucleotides encoded by the one or more physically present source constructs is bound to a corresponding linker that was used for the corresponding component polynucleotide in the arranging (C) to form the ARi; and(E) calculating one or more primer pairs based upon ARi, wherein each primer pair is capable of amplifying a portion of ARi not represented in the one or more physically present source constructs identified for ARi, wherein the portions of the ARi amplified by the one or more primer pairs and the one or more component polynucleotides in the one or more physically present source constructs identified for the ARi, in the order specified by the ARi, collectively define the engineered nucleic acid construct, wherein at least one of the expanding (B), arranging (C), selecting (D), and calculating (E) is performed using one or more suitably programmed computers.
-
-
30. A non-transitory computer readable storage medium storing one or more programs configured for execution by one or more processors of a system, the one or more programs for defining an engineered nucleic acid construct for integration into a genomic locus L of a target organism or a host cell, the one or more programs comprising:
-
(A) instructions for receiving a plurality of nucleic acid requests {NR1, . . . , NRn}, wherein n is a positive integer greater than 1, each nucleic acid request NRi in {NR1, . . . , NRn} specifying a genetic change to L; (B) instructions for expanding each NRi in {NR1, . . . , NRn} into a corresponding component polynucleotide, thereby forming a plurality of component polynucleotides; (C) instructions for arranging the plurality of component polynucleotides into a contiguous arrangement ARi, wherein the arranging (C) uses linker nucleic acid sequences from a predetermined library of linker nucleic acid sequences to combine component polynucleotides in the plurality of component polynucleotides into a contiguous arrangement ARi; (D) instructions for repeating the instructions for arranging (C) until a set of {AR1, . . . , ARm} contiguous arrangements are formed, wherein m is a positive integer greater than 1, the set of {AR1, . . . , ARm} contiguous arrangements representing a plurality of different contiguous arrangements of the component polynucleotides in the plurality of component polynucleotides; (E) instructions for determining a score Si for each respective contiguous arrangement ARi in {AR1, . . . , ARm}, wherein, for each respective contiguous arrangement ARi, a contribution to the score Si is made when one or more source constructs are identified as being physically present in a freezer store, wherein each of the one or more physically present source constructs encodes one or more of the component polynucleotides, and wherein a 3′
or 5′
terminus, or both the 3′ and
5′
termini, of each respective component polynucleotide in the one or more component polynucleotides encoded by the one or more physically present source constructs is bound to a corresponding linker that was used for the corresponding component polynucleotide in the arranging (C) to form ARi;(F) instructions for selecting a final contiguous arrangement ARf in {AR1, . . . , ARm} having a score Si that meets a selection criterion as an optimal contiguous arrangement; and (G) instructions for calculating, in response to completion of the instructions for selecting, one or more primer pairs based upon the final ARf, wherein each primer pair in the one or more primer pairs is capable of amplifying a portion of the ARf not represented in any identified one or more physically present source constructs identified for the ARf, wherein the portions of the contiguous arrangement amplified by the one or more primer pairs and the one or more component polynucleotides in any identified one or more physically present source constructs identified for ARf, in the order specified in the contiguous arrangement ARf, collectively define the engineered nucleic acid construct.
-
Specification