phage-assisted continuous evolution

Phage-assisted continuous evolution (PACE) is a phage-based technique for the automated directed evolution of proteins. It relies on relating the desired activity of a target protein with the fitness of an infectious bacteriophage which carries the protein's corresponding gene. Proteins with greater desired activity hence confer greater infectivity to their carrier phage. More infectious phage propagate more effectively, selecting for advantageous mutations. Genetic variation is generated using error-prone polymerases on the phage vectors, and over time the protein accumulates beneficial mutations. This technique is notable for performing hundreds of rounds of selection with minimal human intervention.

Principle

The central component of PACE is a fixed-volume vessel known as the “lagoon”. The lagoon contains M13 bacteriophage vectors carrying the gene of interest (known as the selection plasmid, or SP), as well as host E. coli cells that allow the phage to replicate. The lagoon is constantly diluted via the addition and draining of liquid media containing E. coli cells. The liquid flow rate is set such that the dilution rate is faster than the rate of E. coli reproduction but slower than the rate of phage reproduction. Hence, a fresh supply of E. coli cells is constantly present in the lagoon, but phage can only be retained via sufficiently fast replication.{{cite journal |last1=Esvelt |first1=K. |last2=Carlson |first2=J. |last3=Liu |first3=D.R. |date=2011 |title=A system for the continuous directed evolution of biomolecules |journal=Nature |volume=472 |issue=7344 |pages=499–503 |doi=10.1038/nature09929|pmid=21478873 |pmc=3084352 |bibcode=2011Natur.472..499E }}

Phage replication requires E. coli infection, which, for M13 phage, relies on protein III (pIII).{{cite journal |last1=Riechmann |first1=L. |last2=Holliger |first2=P. |date=1997 |title=The C-terminal domain of TolA is the coreceptor for filamentous phage infection of E. coli. |journal=Cell |volume=90 |issue=2 |pages=351–360 |doi=10.1016/s0092-8674(00)80342-6|pmid=9244308 |doi-access=free }} When using PACE, the phage vectors lack the gene to produce pIII. Instead, the production of pIII is tied with the activity of the protein of interest via a mechanism that varies per use case, oftentimes involves an extra plasmid containing the pIII-expressing gene III (gIII) known as the accessory plasmid, or AP. Notably, production of infectious phage scales with the production of pIII.{{cite journal |last1=Rakonjac |first1=J. |last2=Model |first2=P. |date=1998 |title=Roles of pIII in filamentous phage assembly |journal=J. Mol. Biol. |volume=282 |issue=1 |pages=25–41 |doi=10.1006/jmbi.1998.2006|pmid=9733639 }} Hence, the better the activity of the protein, the higher the rate of pIII production, and the more infectious phage are generated for that particular gene.

Using error-prone polymerases (encoded on the mutagenesis plasmid, or MP), genetic variation is introduced into the protein gene portion of the phage vectors. Due to the selective pressures applied by the constant draining of the lagoon, only phages that can replicate fast enough can be retained in the lagoon, so over time beneficial mutations accumulate in phage replicating in the lagoon. In this manner, rounds of evolution are continuously performed, allowing hundreds of rounds to elapse with little human intervention.

[[File:PACE Overview.png|thumb|upright=3.0|center|alt=PACE overview|A general PACE scheme. MP stands for mutagenesis plasmid, and encodes the necessary proteins for introducing mutations into SP. SP stands for selection plasmid, and encodes the gene of the M13 bacteriophage minus gIII, as well as the gene of interest. AP stands for accessory plasmid, and contains gIII, as well as a method for inducing transcription of gIII.

]]

Applications

= Polymerase promoter specificity =

In the initial paper pioneering this technique, T7 RNA polymerases were evolved to recognize different promoters, such as the T3 or SP6 promoters.{{cite journal |last1=Lane |first1=M.D. |last2=Seelig |first2=B. |date=2014 |title=Advances in the directed evolution of proteins |journal=Curr. Opin. Chem. Biol. |volume=22 |pages=129–136 |doi=10.1016/j.cbpa.2014.09.013|pmid=25309990 |pmc=4253873 |bibcode=2014COCB...22..129L }} This was done by making the target promoter the sole promoter for gIII.{{cite journal |last1=Lemire |first1=S. |last2=Yehl |first2=K.M. |last3=Lu |first3=T.K. |journal=Annu. Rev. Virol. |date=2018 |volume=5 |pages=453–476 |title=Phage-Based Applications in Synthetic Biology |issue=1 |doi=10.1146/annurev-virology-092917-043544|pmid=30001182 |pmc=6953747 }} Hence, mutant polymerases with greater specificity for the desired promoter caused greater pIII production. This resulted in polymerases with ~3-4 orders of magnitude greater activity for the target promoter than the original T3 promoter.

While this original PACE system only performed positive selection, a variant was developed that allowed for negative selection as well. This is done by linking undesired activity to the production of non-functional pIII, which decreases the amount of infectious phage made.{{cite journal |last1=Brödel |first1=A.K. |last2=Isalan |first2=M. |last3=Jaramillo |first3=A. |date=2018 |title=Engineering of biomolecules by bacteriophage directed evolution |journal=Curr. Opin. Biotech. |volume=51 |pages=32–38 |doi=10.1016/j.copbio.2017.11.004|pmid=29175708 |doi-access=free |hdl=10261/184372 |hdl-access=free }}

[[File:Promoter pace.png|thumb|upright=2.0|center|alt=PACE on polymerases|A scheme for evolving polymerase promoter specificity using PACE. T7 RNAP containing mutations unfavorable to T3 promoter binding results in no gIII transcription. However, mutations that allow for the binding to a T3 promoter lead to increased gIII transcription.

]]

= Protease substrate specificity =

Proteases have been evolved to cut different peptides using PACE. In these systems, the desired protease cut site is used to link a T7 RNA polymerase and a T7 lysozyme. The T7 lysozyme prevents the T7 polymerase from transcribing gIII. When the peptide linker is cleaved, the T7 polymerase is activated, allowing for the transcription of the pIII gene. This method was used to create a TEV protease with a significantly different peptide substrate.{{cite journal |last1=Kim |first1=J.Y. |last2=Yoo |first2=H.W. |last3=Lee |first3=P.G. |last4=Lee |first4=S.G. |last5=Seo |first5=J.H. |last6=Kim |first6=B.G. |date=2019 |title=In vivo Protein Evolution, Next Generation Protein Engineering Strategy: from Random Approach to Target-specific Approach |journal=Biotechnol. Bioproc. E. |volume=24 |pages=85–94 |doi=10.1007/s12257-018-0394-2|s2cid=91687131 }}

File:Protease PACE.png

= Orthogonal Aminoacyl-tRNA Synthetases =

Using PACE, aminoacyl-tRNA synthetases (aaRSs) were evolved for noncanonical amino acids as well. Activity of an aaRS is linked to pIII production by the addition of a TAG stop codon in the middle of gIII. Synthetases that aminoacylate the TAG codon's suppressor tRNA prevents stop codon activity, allowing for production of functional pIII. Using this system, aaRSs were evolved that utilize non-canonical amino acids p-nitro-phenyalanine, iodophenylalanine, and Boc-lysine.{{cite journal |last1=Vargas-Rodriguez |first1=O. |last2=Sevostyanova |first2=A. |last3=Söll |first3=D. |last4=Crnković |first4=A. |date=2018 |title=Upgrading aminoacyl-tRNA synthetases for genetic code expansion |journal=Curr. Opin. Chem. Biol. |volume=46C |pages=115–122 |doi=10.1016/j.cbpa.2018.07.011|pmid=30056281 |pmc=7083171 }}

File:AaRS PACE.png

= Protein-Protein Interactions =

Protein-protein interactions have been evolved using PACE as well. Under this scheme, the target protein is fused with a DNA binding protein, which binds to a target sequence placed upstream of the gIII promoter. The protein undergoing evolution is fused with an RNA polymerase. The better the protein-protein interaction, the more transcription of pIII occurs, allowing the evolution of the protein-protein interaction under PACE conditions. This method was used to evolve Bacillus thuringiensis endotoxin variants that can overcome insect toxin resistance.{{cite journal |last1=Simon |first1=A.J. |last2=d'Oelsnitz |first2=S. |last3=Ellington |first3=A.D. |date=2018 |title=Synthetic Evolution |journal=Nat. Biotechnol. |volume=37 |issue=7 |pages=730–743 |doi=10.1038/s41587-019-0157-4|pmid=31209374 |s2cid=189927244 }}

File:Protein-protein interaction PACE.png

= Base Editors =

PACE was used to evolve APOBEC1 for greater soluble expression. APOBEC1 is a cytidine deaminase that has found use in base editors to catalyze the single nucleotide edit C-->T.{{cite journal |last1=Gaudelli |first1=N.M. |last2=Komor |first2=A.C. |last3=Rees |first3=H.A. |last4=Packer |first4=M.S. |last5=Badran |first5=A.H. |last6=Bryson |first6=D.I. |last7=Liu |first7=D.R. |date=2017 |title=Programmable base editing of A·T to G·C in genomic DNA without DNA cleavage |journal=Nature |volume=551 |issue=7681 |pages=464–471 |doi=10.1038/nature24644|pmid=29160308 |pmc=5726555 }} In E. coli, APOBEC1 usually falls out of solution into the insoluble fraction.{{cite journal |last1=Wang |first1=T. |last2=Badran |first2=A.H. |last3=Huang |first3=T.P. |last4=Liu |first4=D.R. |date=2018 |title=Continuous directed evolution of proteins with improved soluble expression |journal=Nat. Chem. Biol. |volume=14 |issue=10 |pages=972–980 |doi=10.1038/s41589-018-0121-5|pmid=30127387 |pmc=6143403 }} To evolve APOBEC1 for better soluble expression, the N-terminus of a T7 polymerase was fused to APOBEC1, with the remaining portion of the polymerase separately expressed. The T7 polymerase can only function when the N-terminus portion can bind to the rest of the polymerase. Since APOBEC1 must be properly folded for the N-terminus portion to be exposed properly, T7 polymerase activity is correlated to APOBEC1 folding. As follows, pIII transcription and production is linked with APOBEC1 soluble expression via the T7 polymerase. Using this approach, the soluble expression of APOBEC1 was increased by 4 fold with no change in function.

File:Soluble Expression PACE.png

PACE was also used to create a more catalytically active deoxyadenosine deaminase. Deoxyadenosine deaminase is used in base editors to perform the single nucleotide edit A-->T. This was done by placing adenosine-containing stop codons in the gene for T7 polymerase. If the base editor is able to correct the error, functional T7 polymerase is produced, allowing production of pIII. Using this system, they evolved a deoxyadenosine deaminase with 590 fold activity compared to wild type.{{cite journal |last1=Richter |first1=M.F. |last2=Zhao |first2=K.T. |last3=Eton |first3=E. |last4=Lapinaite |first4=A. |last5=Newby |first5=G.A. |last6=Thuronyi |first6=B.W. |last7=Wilson |first7=C. |last8=Koblan |first8=L.W. |last9=Zeng |first9=J. |last10=Bauer |first10=D.E. |last11=Doudna |first11=J.A. |last12=Liu |first12=D.R. |date=2020 |title=Phage-Assisted Evolution of an Adenine Base Editor with Enhanced Cas Domain Compatibility and Activity |journal=Nat. Biotechnol. |volume=38 |issue=7 |pages=883–891 |doi=10.1038/s41587-020-0453-z|pmid=32433547 |pmc=7357821 }}

File:A to G PACE.png

References