Skip Navigation
Skip to contents

Journal of Microbiology : Journal of Microbiology

OPEN ACCESS
SEARCH
Search

Articles

Page Path
HOME > J. Microbiol > Volume 63(3); 2025 > Article
Review
Expanding the genetic code: In vivo approaches for incorporating non-proteinogenic monomers
Dongheon Lee1, Suk Min Yun2, Jong-il Choi1,*
Journal of Microbiology 2025;63(3):e2501005.
DOI: https://doi.org/10.71150/jm.2501005
Published online: March 28, 2025

1Department of Biotechnology and Bioengineering, Chonnam National University, Gwangju 61186, Republic of Korea

2National Institute of Nakdong Basin Biological Resources, Sangju 37242, Republic of Korea

*Correspondence Jong-il Choi 'choiji01@jnu.ac.kr'
• Received: January 6, 2025   • Revised: February 20, 2025   • Accepted: February 21, 2025

© The Microbiological Society of Korea

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

  • 157 Views
  • 4 Download
  • 1 Crossref
  • The application of genetic code expansion has enabled the incorporation of non-canonical amino acids (ncAAs) into proteins, introducing novel functional groups and significantly broadening the scope of protein engineering. Over the past decade, this approach has extended beyond ncAAs to include non-proteinogenic monomers (npMs), such as β-amino acids and hydroxy acids. In vivo incorporation of these monomers requires maintaining orthogonality between endogenous and engineered aminoacyl-tRNA synthetase (aaRS)/tRNA pairs while optimizing the use of the translational machinery. This review introduces the fundamental principles of genetic code expansion and highlights the development of orthogonal aaRS/tRNA pairs and ribosomal engineering to incorporate npMs. Despite these advancements, challenges remain in engineering aaRS/tRNA pairs to accommodate npMs, especially monomers that differ significantly from L-α-amino acids due to their incompatibility with existing translational machinery. This review also introduces recent methodologies that allow aaRSs to recognize and aminoacylate npMs without reliance on the ribosomal translation system, thereby unlocking new possibilities for synthesizing biopolymers with chemically diverse monomers.
The canonical genetic code, composed of 64 codons for 20 standard amino acids, restricts the diversity of biopolymers achievable through ribosomal synthesis. To overcome these limitations, scientists have developed strategies to expand the genetic code, enabling the incorporation of non-proteinogenic monomers (npMs) into biopolymers (Costello et al., 2024). These monomers are chemical entities not typically incorporated by the canonical genetic code or used in standard ribosomal synthesis. npMs are promising in biological systems due to their diverse chemical functionalities, which allow for the creation of biopolymers with enhanced or novel properties. Unlike the 20 standard amino acids, npMs offer unique attributes, such as enhanced stability (Humpola et al., 2023; Li et al., 2018), modified electronic properties (Faraldos et al., 2011), studies of post-translational modification (Chen & Tsai, 2022; Gan & Fan, 2024), and additional reactive groups (Saleh et al., 2019; Switzer et al., 2023), expanding the range of chemical modifications possible. These distinct characteristics make npMs valuable in synthetic biology (Krahn et al., 2020; Niu & Guo, 2024), drug development (Ding et al., 2020; Huang & Liu, 2018), biocatalysis (Birch-Price et al., 2024; Lugtenburg et al., 2023) and biomaterials engineering (Chemla et al., 2024; Sisila et al., 2023), allowing researchers to design novel biomolecules with improved stability, pharmacokinetics, or catalytic functions that are difficult to achieve with standard amino acids.
npMs can be categorized into non-amino acid exotic monomers (exMs) and non-proteinogenic amino acids (npAAs) (Sigal et al., 2024). npAAs, such as non-canonical L-amino acids, D-amino acids, and β-amino acids, offer new functional groups and expand the stereochemical and structural landscape, while exMs, including α-hydroxy acids and α-thio acids, introduce novel backbones and interactions between monomers that are not achievable using amino acid structures (Fig. 1). This categorization provides a structured approach to capture the diversity of non-canonical monomers, aligning with the overall strategy of expanding the genetic code to incorporate these novel substrates.
Genetic code expansion involves reprogramming translation mechanisms to recognize new codons or synthetic nucleotide pairs, along with engineering translation machinery such as aminoacyl-tRNA synthetases (aaRS), tRNAs, ribosomes, and elongation factors (EFs) to accommodate non-canonical substrates (de la Torre & Chin, 2021; Hammerling et al., 2020; Rodnina, 2018). Expanding the genetic code allows researchers to explore the unique properties of npMs and to create biopolymers with functions and characteristics that exceed those of naturally occurring proteins. Incorporating npMs in vivo presents specific challenges, such as maintaining translation fidelity, ensuring the bioavailability of npMs, and optimizing orthogonal translation systems to function effectively in the cellular environment (Jin et al., 2019). Key components include designing orthogonal aaRS/tRNA pairs that do not interfere with the native machinery of the host and engineering translation components such as ribosomes and elongation factors to handle non-canonical substrates while maintaining cellular fitness (Arranz-Gibert et al., 2019). Addressing these challenges requires both rational design and directed evolution to ensure the expanded genetic code operates efficiently in living cells.
Advances in genetic code expansion have led to several key developments, including orthogonal translation systems, codon expansion techniques, and the development of genomically recoded organisms (GROs) (Chemla et al., 2018; Guo & Niu, 2022; Shandell et al., 2021). These advancements have enabled the successful incorporation of a diverse range of npMs into in vivo systems, which have broadened the range of npMs that can be incorporated through recent improvements in efficiency and specificity.
This review provides an overview of the strategies and challenges associated with incorporating npMs through genetic code expansion, with a focus on in vivo approaches. It explores the use of diverse orthogonal aaRS/tRNA pairs for incorporating npMs, while excluding non-canonical L-amino acids. Additionally, this review introduces methods for engineering orthogonal aaRS capable of recognizing novel npMs independently of the ribosomal translation mechanism.
Expanding the genetic code to incorporate npMs involves modifying and optimizing the translation mechanism and its core components, as well as developing orthogonal translation systems (OTS). The translation mechanism focuses on how engineered components, such as aaRSs, tRNAs, ribosomes, and elongation factors, are adapted to incorporate npMs (Kim et al., 2022). OTS are designed to function independently of the natural translation machinery of the host, providing a robust framework for incorporating npMs with high specificity and minimal competition with standard amino acids (Fu et al., 2022).
The successful incorporation of npMs relies on engineering and optimizing core components involved in the translation process. The translation mechanism begins with aaRS, which charges the tRNA with npMs (Fig. 2). Once charged, the modified tRNA recognizes specific codons, delivering the monomer to the ribosome. The ribosome then facilitates translation by incorporating the monomer into the growing polypeptide chain. Throughout this process, elongation factors, such as EF-Tu, ensure that the npM-charged tRNA is positioned correctly and that translation proceeds efficiently (Xu et al., 2021). These components work together to expand the genetic code and incorporate npMs into proteins.
Orthogonal Translation Systems (OTS)
Orthogonal translation systems are engineered to function independently of the natural translation machinery of the host. They use orthogonal aaRS and tRNA pairs that specifically recognize and incorporate npMs without interfering with endogenous translational components. OTS are an effective approach for incorporating npMs, ensuring high specificity and reducing competition with natural amino acids.
Various aaRS/tRNA pairs have been developed to incorporate npMs into proteins. To avoid interference with endogenous aaRS or tRNA, orthogonal aaRS/tRNA pairs from various foreign species have been introduced and manipulated. A representative strategy involves utilizing variations in recognition mechanisms among tRNA homologs from different domains to create orthogonal aaRS/tRNA pairs (Melnikov & Söll, 2019). This strategy has been successfully implemented by transplanting aaRS/tRNA pairs from archaea into different domains, including bacteria, eukaryotes, and mammalian cells. Notable examples include the use of Methanococcus jannaschii TyrRS/tRNATyr and Methanosarcina species PylRS/tRNAPyl pairs. For instance, M. jannaschii TyrRS/tRNATyr (Amiram et al., 2015; Wang et al., 2001), Methanosarcina barkeri PylRS/tRNAPyl (Blight et al., 2004; Neumann et al., 2008), Methanosarcina mazei PylRS/tRNAPyl (Neumann et al., 2008; Yanagisawa et al., 2008b), and Methanomethylophilus alvus PylRS/tRNAPyl (Beránek et al., 2019; Willis & Chin, 2018) are widely used. Additionally, orthogonal aaRS/tRNA pairs such as Saccharomyces cerevisiae PheRS/tRNAPhe (Furter, 1998; Kwon et al., 2006), TrpRS/tRNATrp (Chatterjee et al., 2013), and Pyrococcus horikoshii LysRS/tRNALys (Anderson et al., 2004) have been studied. These examples demonstrate the growing repertoire of tools available for expanding the genetic code, which facilitate the incorporation of increasingly complex npMs.
Stop codon, quadruplet codon, and sense codon suppression in genetic code expansion
Expanding the genetic code to incorporate npMs involves reassigning existing codons or introducing new coding strategies to overcome the limitations of the standard genetic code. Two widely used strategies for this purpose are stop codon suppression and the introduction of quadruplet codons. Stop codon suppression reassigns one of the three natural stop codons—amber (UAG), ochre (UAA), and opal (UGA)—for the purpose of encoding npMs. Among these, the amber codon is most commonly used, particularly in Escherichia coli, as it is the least frequently utilized stop codon in this model organism (Nakamura et al., 2000).
An early study investigated the M. jannaschii TyrRS/tRNATyr pair, in which the anticodon of tRNATyr (GUA), originally recognizing a tyrosine codon, was replaced with CUA to recognize the amber codon UAG (Wang & Schultz, 2001). This modification enabled the incorporation of O-methyl-L-tyrosine 1 at amber codon sites in E. coli proteins (Wang et al., 2001). Since then, numerous studies have refined amber suppression by developing enhanced orthogonal aaRS/tRNA pairs and improving translation efficiency (Kato, 2019). Another example utilizes the PylRS/tRNAPyl pair, which naturally recognizes the UAG codon (Srinivasan et al., 2002). The PylRS/tRNAPyl pair achieves orthogonality through unique structural features, such as a compacted binding site in the synthetase and distinctive structural elements in tRNAPyl, including a shortened variable loop and elongated anticodon stem (Nozawa et al., 2009; Srinivasan et al., 2002). These adaptations prevent recognition by the endogenous aminoacyl-tRNA synthetases and tRNAs, to ensure precise interaction exclusively with the amber codon.
Despite its success, stop codon suppression is inherently limited by the small number of available nonsense codons. To address this limitation, quadruplet codons have been explored as a strategy to increase the coding capacity and to enable the simultaneous incorporation of multiple npMs into a single protein. Early research in E. coli demonstrated the potential of quadruplet codons for genetic code expansion. For example, one study showed that mutant tRNALeu with engineered anticodon loops could decode specific quadruplet codons such as UAGA, enabling the incorporation of additional amino acids into proteins (Anderson et al., 2004). Another study achieved ncAA incorporation in response to quadruplet codons by engineering tRNALys anticodon loops to target specific frameshift codons, such as AGGA, enabling dual ncAA incorporation into proteins (Moore et al., 2000). A recent study demonstrated the synthesis of protein containing four distinct ncAAs at quadruplet codon sites using four mutually orthogonal aaRS/tRNA pairs. This achievement underscores the growing potential of quadruplet codons for expanding the repertoire of encoded monomers (Dunkelmann et al., 2021).
In addition to nonsense and quadruplet codon strategies, other strategies have been developed to explore the reassignment of sense codons to ncAAs. Researchers modified the anticodon loop of M. jannaschii tRNATyr and its cognate aaRS, reassigning the AGG codon, which encodes arginine, to tyrosine. This approach enhanced the efficiency of tyrosine incorporation and showed similar improvements for ncAA incorporation, highlighting the versatility of sense codon reassignment as a complementary strategy for genetic code expansion (Biddle et al., 2022). Another study focused on incorporating monomers into serine codons. By utilizing engineered M. mazei PylRS/tRNAPyl pairs in combination with genomically recoded organism (GRO), ncAAs were successfully incorporated into the serine codons TCG and TCA (Robertson et al., 2021). GRO provides a versatile platform for enhancing the efficiency of amber codon incorporation and facilitating sense codon reassignment, which will be discussed further in the next chapter. This process involves removing the tRNAs that decode these codons and leveraging orthogonal translational machinery to achieve high specificity in ncAA incorporation.
Genomically recoded organisms
Recent advancements in genetic code expansion have enabled the study of reassigning the genetic code within the entire genome. These efforts have resulted in the creation of genomically recoded organisms (GROs), which are designed and synthesized to use an alternate genetic code for enhanced flexibility and novel functionalities in protein synthesis. GROs are engineered by systematically replacing codons across the genome and manipulating translational components, enabling precise control over genetic coding and allowing the incorporation of npMs with minimal competition from endogenous machinery.
A notable example is the C321 strain, derived from Escherichia coli MG1655 through conjugative assembly genome engineering (CAGE) (Isaacs et al., 2011). This method enables the systematic transfer of synthesized chromosomal regions from a donor to a recipient, resulting in a fully recoded genome (Lajoie et al., 2013). In the C321 strain, all 321 UAG stop codons were replaced with UAA codons. To further enhance amber suppression, the prfA gene, which encodes release factor 1 responsible for UAG recognition during translational termination, was deleted to produce the C321.ΔA strain. However, this process introduced 355 off-target mutations, resulting in a 60% increase in doubling time compared to the original MG1655 strain. To address this issue, researchers used multiplex automated genome engineering (MAGE), a technique for simultaneous multiple genetic modifications, combined with whole-genome sequencing to identify and correct fitness-related mutations (Wang et al., 2009). This approach successfully modified six critical mutations, leading to the development of the C321.ΔA opt strain, which recovered 59% of the fitness observed in the C321.ΔA strain (Kuznetsov et al., 2017).
As another example, GRO research has expanded to include the systematic reassignment of sense codons. One notable example is the E. coli MDS42 strain, characterized by a simplified genome with the removal of non-essential genes. In this strain, the serine codons TCG and TCA were systematically replaced with their synonymous counterparts, AGC and AGT, respectively, in the genome. Subsequently, the prfA gene, which is responsible for terminating translation at the amber codon, and the serU and serT tRNA genes, which decode TCG and TCA, were deleted. This led to the creation of the Syn61 strain, a streamlined organism that uses only 61 codons, excluding the amber codon and two serine codons, thereby expanding the potential for blank codon usage and novel synthetic biology applications (Fredens et al., 2019).
These advancements in GRO development demonstrate the potential of genome-wide codon reassignment in genetic code expansion. By creating organisms with optimized genetic codes, researchers can introduce novel functionalities, improve translation fidelity, and provide unique platforms for incorporating non-proteinogenic monomers into proteins.
The incorporation of npMs into proteins is important for expanding the chemical and functional diversity of biomolecules. To achieve this, engineering of translational components is crucial for maintaining specificity, efficiency, and fidelity. Advancements in aaRSs, tRNAs, and ribosomes have expanded the repertoire of npMs that can be incorporated. In this section, we focus on the in vivo approaches for incorporating npMs, excluding non-canonical L-α-amino acids, which are extensively covered in reviews (Ishida et al., 2024; Rezhdo et al., 2019; Tang et al., 2022).
Engineered aminoacyl-tRNA synthetase/tRNA pair
The genetic incorporation of npMs in vivo often relies on the engineering aaRS and their cognate tRNAs to achieve substrate specificity and translational fidelity. For instance, the α-hydroxy acid, p-hydroxy-L-phenyllactic acid 2, was incorporated into proteins in E. coli using an engineered M. jannaschii tyrosyl-tRNA synthetase (Guo et al., 2008). To achieve this, five residues within the active site were randomized. Positive selection was then implemented to identify mutants capable of activating the desired substrate and negative selection to remove those that recognize endogenous amino acids (Wang et al., 2006). After screening, a synthetase variant capable of incorporating the hydroxy acid at amber codon sites in the myoglobin protein was identified.
Similarly, a study utilizing a polysubstrate-specific aminoacyl-tRNA synthetase, pCNFRS, developed from the same M. jannaschii derived TyrRS/tRNATyr pair, successfully demonstrated the site-specific incorporation of eight D-AAs, including D-Phe, D-Asn, D-Ile, D-Met, D-Arg, D-Ala, D-Pro, and D-Val, into GFPuv at residue 18, which had been mutated to an amber stop codon (Liu et al., 2012; Young et al., 2011). Additionally, D-Phe 3 was specifically introduced at the fluorophore-forming Tyr66 residue of GFPuv, resulting in a mutant protein with shifted emission and excitation wavelengths and enhanced thermal stability compared to its L-isomer counterpart.
The PylRS from M. mazei has also shown remarkable versatility in incorporating npMs by leveraging its loose recognition of the α-amino group (Kobayashi et al., 2009; Yanagisawa et al., 2008a). PylRS successfully acylated α-hydroxy acids such as Boc-LysOH 4 onto tRNAPyl, demonstrating high efficiency, with yields surpassing those of Boc-lysine. Sterically hindered substrates like NMe-Boc-lysine and D-Boc-lysine showed reduced activity, likely due to substrate-binding pocket clashes. The successful incorporation of Boc-LysOH 4 into proteins at amber codon sites in E. coli further established the utility of this system for site-specific introduction of ester bonds.
Additionally, PylRS/tRNAPyl systems have been employed to streamline therapeutic peptide production through the incorporation of hydroxy acids. For instance, Boc-LysOH 4, HO-Phe(3-Br)-OH 5, and HO-Tyr(propargyl)-OH 6 were site-specifically introduced into lanthipeptides like lacticin 481 and nukacin ISK-1 in E. coli using an amber codon suppression system (Bindman et al., 2015). This enabled the formation of ester bonds, facilitating the efficient removal of leader peptides under alkaline conditions without proteolytic enzymes, which are often limited by sequence specificity.
Similarly, recent advancements have enabled the in vivo incorporation of α-hydroxy acids into proteins using wild-type and engineered pyrrolysyl-tRNA synthetase systems from M. alvus (MaPylRS), achieving site-specific incorporation of monomers such as Boc-LysOH 4 and α-OH-m-CF3-Phe 7, respectively, into superfolder GFP at amber codon sites in E. coli (Fricke et al., 2023; Wang et al., 2011, 2012). Additionally, engineered derivatives were also shown to selectively acylate npMs in vitro, including α-hydroxy acids, α-thio acids, N-formyl-L-α-amino acids, and α-carboxy acids. Structural analysis of the engineered derivative MaFRSA bound to the prochiral m-CF3-2-benzylmalonate, an α-carboxy acid, revealed that PylRS adapts to the expanded size of the α-substituent by forming new stabilizing hydrogen bonds with the α-carboxy group. It also employs distinct binding modes for the pro-S and pro-R configurations of the substrate (Fricke et al., 2023). Based on these findings, subsequent studies demonstrated the in vivo incorporation of multiple (S)-β2-hydroxy acids into proteins using the orthogonal MaPylRS/MatRNAPyl system (Hamlish et al., 2024). The MaPylRS enzyme demonstrated a preference for (S)-β2-hydroxy acids as substrates in cellulo, a finding attributed to their absolute configuration aligning with that of D-α-amino acids. In vivo synthesis of protein containing two (S)-β2-OH 8, a β2-hydroxy acid whose backbone was extended to a β2 configuration from Boc-LysOH 4, was confirmed. This work represents the first successful in vivo synthesis of proteins containing multiple β2-hydroxy acids, and thus expanded the chemical diversity of engineered proteins.
Additionally, two distinct non-proteinogenic monomers, non-canonical amino acids and hydroxy acids, were incorporated into macrocyclic peptides in E. coli (Spinck et al., 2023). These macrocycles were synthesized in a Syn61-derived strain (Robertson et al., 2021), a GRO with reduced codon redundancy. This strain enables the suppression of the UAG stop codon as well as the UCG and UCA sense codons, facilitating the efficient incorporation of multiple monomers. For example, this was achieved using mutually orthogonal pairs, such as the M. mazei PylRS/tRNAPyl for the UCG codon to incorporate AlkynK-OH 9 and the Archaeoglobus fulgidus TyrRS/tRNATyr for the UAG codon to incorporate para-azido-L-phenylalanine 10. The resulting macrocycles featured ester bonds and novel side chains, demonstrating the potential for incorporating multiple chemically diverse npMs and expanding the range of non-canonical polymers.
Ribosome engineering
A strategy for incorporating npMs into proteins involves engineering ribosomes to accommodate non-canonical backbones. This is often achieved by increasing the flexibility of the peptidyltransferase center (PTC) within the ribosome, enabling the accommodation of npMs. Antibiotics such as erythromycin and puromycin, which mimic substrates and are recognized at the PTC, can be employed to facilitate selection for desired traits during engineering (Hecht, 2022).
Building on the 040329 ribosome, which was optimized for β-amino acid incorporation in vitro through specific mutations in the 23S rRNA (Dedkova et al., 2012), a more advanced ribosome named P7A7 was developed (Melo Czekster et al., 2016). Targeted mutations in the A-site-adjacent region of the 23S rRNA (positions 2502–2507) enhanced its efficiency in incorporating β3-(p-Br)Phe 11. When paired with the E. coli PheRS/tRNAPhe system and the wild-type elongation factor Tu (EF-Tu), the P7A7 ribosome demonstrated a threefold improvement in β3-Phe incorporation efficiency compared to the 040329 ribosome, while simultaneously achieving a more favorable incorporation ratio of β3-(p-Br)Phe to α-Phe. However, further analysis revealed that the P7A7 ribosome faced structural and assembly challenges, with disordered regions in the PTC and adjacent helices leading to inefficient 70S ribosome formation (Ward et al., 2019). These structural instabilities also led to subunit association deficiencies, requiring higher magnesium concentrations to stabilize the ribosome complex. These findings emphasize the importance of balancing functionality and stability in the design of ribosomes for non-canonical translation systems.
The expansion of the genetic code to incorporate npMs in vivo has been driven by OTS, which use engineered aaRS/tRNA pairs and modified ribosomes. However, traditional methods for developing OTS to incorporate exotic monomers or non-proteinogenic amino acids such as positive and negative selection methods (Guo et al., 2008; Wang & Schultz, 2001) often rely on translation-dependent readouts. This dependence restricts the range of compatible monomers and complicates screening procedures, as the strong coupling between ribosomal translation efficiency and the specificity of the synthetase for monomers can create bottlenecks, particularly with structurally diverse, less compatible substrates exhibiting low fidelity. To address these challenges, recent studies have introduced more flexible and efficient strategies for OTS development. These approaches aim to expand the chemical diversity accessible through in vivo genetic code expansion while reducing the constraints imposed by traditional, translation-dependent methodologies.
The tRNA Extension (tREX) methodology introduces an efficient, scalable approach to identify and evaluate orthogonal aaRS/tRNA pairs for the genetic incorporation of npMs (Cervettini et al., 2020). At its core, tREX leverages a two-step biochemical strategy to determine the aminoacylation status of candidate tRNAs (Fig. 3). First, tRNA is subjected to selective oxidation of its 3′ terminal hydroxyl group. This treatment prevents extension by polymerase unless the tRNA is aminoacylated, preserving the attachment of the amino acid. Second, a Cy5-labeled DNA probe complementary to the tRNA acceptor stem is hybridized, and the resulting hybrid is extended using a 3′-to-5′ exonuclease-deficient Klenow fragment of DNA polymerase I. Only aminoacylated tRNAs resist oxidation and undergo extension, enabling precise differentiation between charged and uncharged tRNAs via native PAGE. Using this method, tREX successfully identified eight mutually orthogonal aaRS-tRNA pairs that function independently of endogenous E. coli pairs.
Expanding on the earlier fluoro-tREX method, the bio-mREX approach was specifically developed to improve the efficiency of synthetase mutant screening, allowing for the identification of variants capable of efficiently acylating tRNAs (Dunkelmann et al., 2024). This progression included the introduction of split tRNA (stRNA) systems, initially employing trans configurations where tRNA halves divided at the anticodon were expressed separately and assembled in vivo to enable modular testing of tRNA functionality and flexibility in structural design (Fig. 3). This process was further refined in the cis stRNA approach, by circularly permuting the tRNA gene and using intervening sequences processed by endogenous RNases, ensuring efficient assembly and tRNA reconstitution. Building on this foundation, stmRNA were developed to integrate one half of the cis stRNA with the synthetase mRNA, creating a system that co-expressed both components and enabled aminoacylation via the synthetase encoded on its linked mRNA transcript. Bio-mREX utilized biotin-labeled probes rather than fluorescent probes, and streptavidin beads to selectively capture acylated tRNAs, which were then converted into cDNA for targeted synthetase mutant screening. This process, combined with next-generation sequencing (NGS), allowed for high-throughput screening and detailed analysis of synthetase-tRNA pairs, facilitating the identification of synthetases compatible with structurally diverse non-proteinogenic monomers. Furthermore, the tRNA display technology enabled by bio-mREX expanded the scope of genetic code incorporation to include not only L-α-amino acids but also a wide range of structurally diverse non-proteinogenic monomers. These included β-amino acids such as (S)-3-amino-3-(3-bromophenyl)propanoic acid 12, (S)-3-Amino-3-(4-bromophenyl)propanoic acid 13, and (S)-3-Amino-3-[3-(trifluoromethyl)phenyl]propanoic acid 14, as well as α,α-disubstituted amino acids like (S)-alpha-Methyl-4-Iodophenylalanine 15. This achievement demonstrates the versatility of the system in accommodating chemically and structurally unique substrates, opening new opportunities for developing aaRS variants for novel monomers.
As a similar approach, the START (Sequence-based Translation-independent Acylation of RNA by aaRSs with barcoding for Translation) methodology also emphasizes translation-independent strategies to engineer aaRSs by directly linking tRNA acylation to aaRS activity through a sequence-based barcoding system (Soni et al., 2024). In this strategy, candidate aaRS mutants acylate barcode-tagged tRNAs, with their activity subsequently analyzed through high-throughput sequencing. A key feature of the START methodology is its reliance on barcode-tagged tRNAs, each uniquely linked to a specific aaRS mutant (Fig. 3). By utilizing the anticodon loop of tRNA as a permissive site for sequence barcoding, which is possible because PylRS does not interact with the anticodon region and tolerates expanded tRNApyl anticodons, the system maintains efficient tRNA identification (Ambrogelly et al., 2007; Nozawa et al., 2009). This design allows for the creation of a highly diverse barcode library, with 10 randomized nucleotides generating 1 × 106 unique sequences. To further facilitate the screening process, polymerase-mediated extension of the 3′-terminus is employed using a DNA template hybridized onto the tRNA sequence. This step allows selective tagging of acylated tRNAs by hybridizing a DNA template to the intact 3′-terminus, enabling their subsequent enrichment and identification through reverse transcription (RT) PCR. By minimizing interference from endogenous translational machinery, this translation-independent approach is effective for screening synthetase mutants with non-standard or structurally complex substrates. Each barcode establishes a direct link between genotype and phenotype, allowing identification of active variants through NGS. Using the START strategy, researchers successfully identified aminoacyl-tRNA synthetase mutants capable of incorporating ncAAs such as m-iodo-L-phenylalanine 16, o-nitro-L-phenylalanine 17 and o-cyano-L-phenylalanine 18. Additionally, the study confirmed that the START system is compatible with Boc-LysOH 4, demonstrating its scalability for incorporating diverse npMs.
A key distinction between these approaches is that tREX, bio-mREX, and START differ in their strategies for identifying and optimizing orthogonal aaRS/tRNA pairs for genetic code expansion. tREX utilizes tRNA oxidation and fluorescence labeling to distinguish aminoacylated tRNAs from non-aminoacylated ones, enabling the rapid screening of orthogonal pairs without direct dependence on translation. Building upon tREX, bio-mREX enhances this approach by integrating mRNA-tRNA pairing through split tRNA systems, allowing for high-throughput selection using biotin-based capture and NGS analysis. This advancement enables the efficient screening of synthetase variants with improved substrate specificity. Meanwhile, START introduces a unique barcode-based system, where sequence tags in the anticodon loop of tRNAs are used to track acylation events via NGS. This method ensures a direct genotype-phenotype link, allowing for large-scale mutational analysis and precise optimization of aaRS variants. While all three approaches facilitate the identification of synthetase mutants capable of incorporating npMs, bio-mREX and START offer higher throughput and scalability compared to tREX, making them suitable for the rapid evolution of translation systems, particularly aaRS, to accommodate chemically diverse substrates.
These emerging methodologies, including tREX, bio-mREX, and START, collectively illustrate the transformative potential of translation-independent and high-throughput strategies for engineering orthogonal translation systems. By addressing the limitations of traditional, translation-dependent approaches, these methods are expected to enable the incorporation of structurally diverse and chemically unique non-proteinogenic monomers, significantly expanding the chemical diversity accessible through genetic code expansion.
This review highlights the remarkable advancements in genetic code expansion, particularly focusing on the in vivo incorporation of non-proteinogenic monomers (npMs) into proteins. By leveraging orthogonal translation systems (OTS), including engineered aminoacyl-tRNA synthetase (aaRS)/tRNA pairs, codon reassignment, and genomically recoded organisms, researchers have significantly advanced the field of genetic code expansion. In particular, the development of engineered translational components, such as aaRS/tRNA pairs and ribosomes, has facilitated the incorporation of chemically and functionally diverse npMs, including α-hydroxy acids, β-amino acids, and other monomers, in vivo through ribosomal translation. Furthermore, recent methodologies such as tREX, bio-mREX, and START have addressed key challenges in translational dependency of aaRS/tRNA engineering on ribosome-mediated translation by introducing translation-independent approaches.
Despite these significant advancements, the practical applications of npMs, excluding non-canonical L-α-amino acids, have been largely studied in vitro, employing techniques such as solid-phase peptide synthesis (SPPS) or flexible in vitro translation (FIT) systems (Goto et al., 2011; Palomo, 2014), and, to the best of our knowledge, have remained limited in vivo. Although in vitro integration has demonstrated promising potential (Table 1), realizing this potential in vivo remains challenging, particularly with regard to achieving efficient incorporation into proteins in living systems, necessitating further expansion of the substrate scope. To address this, translation-independent screening methods—tREX, bio-mREX, and START—are anticipated to accelerate the discovery of novel aaRS/tRNA pairs and broaden the substrate scope. In parallel, computational approaches, such as predictive modeling of substrate binding affinities (Ren et al., 2015) and rational design of aaRS (Baumann et al., 2019), are expected to facilitate the development of translation components, enabling more precise and scalable in vivo incorporation of npMs.
One potential future direction involves developing microbial platforms specifically optimized for non-natural polymer production. By engineering microbial cell factories to metabolically produce desired npMs alongside optimized genetic codes and translational machinery, the biosynthesis of non-natural polymers could be significantly improved. Additionally, novel enzymes capable of catalyzing unprecedented chemical reactions might be developed. Integrating npMs with atoms such as sulfur, chlorine, or phosphorus—which possess unique polarities and sizes—could enable the design of artificial active sites and create innovative catalytic reactions. Lastly, predictive models capable of accommodating complex polymers with novel side chains and backbones are required to fully design npM-incorporated biopolymers, as current protein structure prediction models primarily rely on databases of known protein structures (Niazi et al., 2024) and are limited when applied to non-canonical proteins.
In conclusion, as the genetic code expansion continues to evolve, it will likely serve as an innovative technology for overcoming the limitations of traditional protein engineering, synthesizing novel biopolymers, and opening new frontiers in biotechnology.
Fig. 1.
Classification of non-proteinogenic monomers (npMs). Non-proteinogenic monomers (npMs) are categorized into non-proteinogenic amino acids (npAAs) and exotic monomers (exMs). npAAs include non-canonical L-amino acids, D-amino acids, and β-amino acids, while exMs include α- and β-hydroxy acids. The numbers refer to the corresponding chemical structures as indicated in the main text.
jm-2501005f1.jpg
Fig. 2.
Schematic representation of the orthogonal translation system for the in vivo incorporation of non-proteinogenic monomers (npMs). Orthogonal aminoacyl-tRNA synthetases (aaRS) and their corresponding tRNAs are engineered to aminoacylate non-proteinogenic monomers (npMs) while avoiding interference from endogenous aaRS/tRNA pairs and canonical amino acids. The aminoacylated tRNAs carrying npMs are delivered to the ribosome, where they occupy the A-site through interactions with elongation factor Tu (EF-Tu) and GTP. The translation process incorporates the npMs into the growing peptide chain, resulting in novel biopolymers with expanded chemical diversity.
jm-2501005f2.jpg
Fig. 3.
Overview of translation-independent tRNA acylation and screening methodologies. To engineer aminoacyl-tRNA synthetases (aaRSs) or screen orthogonal aaRS/tRNA pairs, translation-independent methodologies have been utilized. These methods exploit the chemical distinctions between aminoacylated and non-aminoacylated tRNAs to enable precise selection and analysis. Non-aminoacylated tRNAs are selectively oxidized at their 3′ end, rendering them incapable of extension, while aminoacylated tRNAs remain unmodified, allowing subsequent extension. tREX facilitates screening by differentiating aminoacylated tRNAs through either by size separation via gel electrophoresis or by using biotinylated oligonucleotides. Building upon tREX, the bio-mREX approach improves screening efficiency by linking mRNAs encoding aaRS variants to their corresponding tRNAs. This technique employs split tRNAs with an intervening loop to assign specific mRNAs to each tRNA, and utilizes biotinylated extensions on aminoacylated tRNAs, which can then be captured using streptavidin beads. The START methodology employs sequence-barcoded tRNAs by assigning unique barcodes to the anticodon region, as PylRS tolerates additional nucleotides. This approach utilizes next-generation sequencing (NGS) to directly connect the genotype of aaRS mutants to their acylation activity, enabling high-throughput analysis and facilitating directed evolution.
jm-2501005f3.jpg
Table 1.
Engineered peptides for practical applications through in vitro integration of non-proteinogenic monomers
Application Engineered peptide Introduced npMs Additional features Method Ref
Biomaterial D-peptide hydrogelator D-Phe, D-Lys, D-Tyr phosphate Phosphatase substrate compatibility, resistant to proteolytic degradation SPPS Li et al. (2013)
Collagen mimetic peptide Aza-glycine Triple-helix thermal stability SPPS Zhang et al. (2015)
Peptide linker for hydrogels D-Val, D-Pro, D-Met, D-Ser, D-Arg Resistance to enzymatic degradation, increased cytotoxicity SPPS Bomb et al. (2023)
Therapeutic Agonist of GLP-1 receptor cyclic β-amino acids Native-like potency, proteolysis resistance SPPS Johnson et al. (2014)
Aurein 1.2 peptide analog β-Leu, β-Lys, β-Ile, β-Ala, β-Ser, β-Phe, cyclic β-amino acids Increased helical stability, improved selectivity, reduced hemolysis SPPS Lee et al. (2017)
Ubiquitin chain binding peptide D-Ala, D-Phe, N-methyl-Gly, N-methyl-Ala High affinity, protease resistance, membrane permeability FIT Rogers et al. (2021)
Ganglioside GM1-binding peptide tag β-Trp, β-Tyr, β-Lys Binding affinity, protease resistance SPPS Hetényi et al. (2022)
Inhibitor of SARS-CoV-2 main protease cyclic γ-amino acids, cyclic β-amino acids High affinity, proteolytic stability, prolonged serum stability FIT Miura et al. (2023, 2024b)
Inhibitor of interferon gamma receptor 1 cyclic β-amino acids, cyclic γ-amino acids Enhanced binding affinity, inhibitory activity, prolonged serum stability FIT Miura et al. (2024a)
  • Ambrogelly A, Gundllapalli S, Herring S, Polycarpo C, Frauer C, et al. 2007. Pyrrolysine is not hardwired for cotranslational insertion at UAG codons. Proc Natl Acad Sci USA. 104(9): 3141–3146. ArticlePubMedPMC
  • Amiram M, Haimovich AD, Fan C, Wang YS, Aerni HR, et al. 2015. Evolution of translation machinery in recoded bacteria enables multi-site incorporation of nonstandard amino acids. Nat Biotechnol. 33(12): 1272–1279. ArticlePubMedPMCPDF
  • Anderson JC, Wu N, Santoro SW, Lakshman V, King DS, et al. 2004. An expanded genetic code with a functional quadruplet codon. Proc Natl Acad Sci USA. 101(20): 7566–7571. ArticlePubMedPMC
  • Arranz-Gibert P, Patel JR, Isaacs FJ. 2019. The role of orthogonality in genetic code expansion. Life. 9(3): 58.ArticlePubMedPMC
  • Baumann T, Hauf M, Richter F, Albers S, Möglich A, et al. 2019. Computational aminoacyl-tRNA synthetase library design for photocaged tyrosine. Int J Mol Sci. 20(9): 2343.ArticlePubMedPMC
  • Beránek V, Willis JCW, Chin JW. 2019. An evolved Methanomethylophilus alvus pyrrolysyl-tRNA synthetase/tRNA pair is highly active and orthogonal in mammalian cells. Biochemistry. 58(5): 387–390. ArticlePubMedPDF
  • Biddle W, Schwark DG, Schmitt MA, Fisk JD. 2022. Directed evolution pipeline for the improvement of orthogonal translation machinery for genetic code expansion at sense codons. Front Chem. 10: 815788.ArticlePubMedPMC
  • Bindman NA, Bobeica SC, Liu WR, van der Donk WA. 2015. Facile removal of leader peptides from lanthipeptides by incorporation of a hydroxy acid. J Am Chem Soc. 137(22): 6975–6978. ArticlePubMedPMC
  • Birch-Price Z, Hardy FJ, Lister TM, Kohn AR, Green AP. 2024. Noncanonical amino acids in biocatalysis. Chem Rev. 124(14): 8740–8786. ArticlePubMedPMCPDF
  • Blight SK, Larue RC, Mahapatra A, Longstaff DG, Chang E, et al. 2004. Direct charging of tRNACUA with pyrrolysine in vitro and in vivo. Nature. 431: 333–335. ArticlePubMedPDF
  • Bomb K, Zhang Q, Ford EM, Fromen CA, Kloxin AM. 2023. Systematic D-amino acid substitutions to control peptide and hydrogel degradation in cellular microenvironments. ACS Macro Lett. 12(6): 725–732. ArticlePubMedPMCPDF
  • Cervettini D, Tang S, Fried SD, Willis JCW, Funke LFH, et al. 2020. Rapid discovery and evolution of orthogonal aminoacyl-tRNA synthetase-tRNA pairs. Nat Biotechnol. 38: 989–999. ArticlePubMedPMCPDF
  • Chatterjee A, Xiao H, Yang PY, Soundararajan G, Schultz PG. 2013. A tryptophanyl-tRNA synthetase/tRNA pair for unnatural amino acid mutagenesis in E. coli. Angew Chem Int Ed Engl. 52(19): 5106–5109. ArticlePubMedPDF
  • Chemla Y, Kaufman F, Amiram M, Alfonta L. 2024. Expanding the genetic code of bioelectrocatalysis and biomaterials. Chem Rev. 124(20): 11187–11241. ArticlePubMedPDF
  • Chemla Y, Ozer E, Algov I, Alfonta L. 2018. Context effects of genetic code expansion by stop codon suppression. Curr Opin Chem Biol. 46: 146–155. ArticlePubMed
  • Chen J, Tsai YH. 2022. Applications of genetic code expansion in studying protein post-translational modification. J Mol Biol. 434(8): 167424.ArticlePubMed
  • Costello A, Peterson AA, Chen PH, Bagirzadeh R, Lanster DL, et al. 2024. Genetic code expansion history and modern innovations. Chem Rev. 124(21): 11962–12005. ArticlePubMedPDF
  • de la Torre D, Chin JW. 2021. Reprogramming the genetic code. Nat Rev Genet. 22: 169–184. ArticlePubMedPDF
  • Dedkova LM, Fahmi NE, Paul R, del Rosario M, Zhang L, et al. 2012. β-Puromycin selection of modified ribosomes for in vitro incorporation of β-amino acids. Biochemistry. 51(1): 401–415. ArticlePubMed
  • Ding Y, Ting JP, Liu J, Al-Azzam S, Pandya P, et al. 2020. Impact of non-proteinogenic amino acids in the discovery and development of peptide therapeutics. Amino Acids. 52(9): 1207–1226. ArticlePubMedPMCPDF
  • Dunkelmann DL, Oehm SB, Beattie AT, Chin JW. 2021. A 68-codon genetic code to incorporate four distinct non-canonical amino acids enabled by automated orthogonal mRNA design. Nat Chem. 13(11): 1110–1117. ArticlePubMedPMCPDF
  • Dunkelmann DL, Piedrafita C, Dickson A, Liu KC, Elliott TS, et al. 2024. Adding α,α-disubstituted and β-linked monomers to the genetic code of an organism. Nature. 625: 603–610. ArticlePubMedPMCPDF
  • Faraldos JA, Antonczak AK, González V, Fullerton R, Tippmann EM, et al. 2011. Probing eudesmane cation-π interactions in catalysis by aristolochene synthase with non-canonical amino acids. J Am Chem Soc. 133(35): 13906–13909. ArticlePubMed
  • Fredens J, Wang K, de la Torre D, Funke LFH, Robertson WE, et al. 2019. Total synthesis of Escherichia coli with a recoded genome. Nature. 569: 514–518. ArticlePubMedPMCPDF
  • Fricke R, Swenson CV, Roe LT, Hamlish NX, Shah B, et al. 2023. Expanding the substrate scope of pyrrolysyl-transfer RNA synthetase enzymes to include non-α-amino acids in vitro and in vivo. Nat Chem. 15(7): 960–971. ArticlePubMedPMCPDF
  • Fu X, Huang Y, Shen Y. 2022. Improving the efficiency and orthogonality of genetic code expansion. Biodes Res. 2022: 9896125.ArticlePubMedPMCPDF
  • Furter R. 1998. Expansion of the genetic code: Site-directed p-fluoro-phenylalanine incorporation in Escherichia coli. Protein Sci. 7(2): 419–426. ArticlePubMedPMCPDF
  • Gan Q, Fan C. 2024. Orthogonal translation for site-specific installation of post-translational modifications. Chem Rev. 124(5): 2805–2838. ArticlePubMedPDF
  • Goto Y, Katoh T, Suga H. 2011. Flexizymes for genetic code reprogramming. Nat Protoc. 6: 779–790. ArticlePubMedPDF
  • Guo J, Niu W. 2022. Genetic code expansion through quadruplet codon decoding. J Mol Biol. 434(8): 167346.ArticlePubMed
  • Guo J, Wang J, Anderson JC, Schultz PG. 2008. Addition of an alpha-hydroxy acid to the genetic code of bacteria. Angew Chem Int Ed Engl. 47(4): 722–725. ArticlePubMed
  • Hamlish NX, Abramyan AM, Shah B, Zhang Z, Schepartz A. 2024. Incorporation of multiple β(2)-hydroxy acids into a protein in vivo using an orthogonal aminoacyl-tRNA synthetase. ACS Cent Sci. 10(5): 1044–1053. ArticlePubMedPMCPDF
  • Hammerling MJ, Krüger A, Jewett MC. 2020. Strategies for in vitro engineering of the translation machinery. Nucleic Acids Res. 48(3): 1068–1083. ArticlePubMedPDF
  • Hecht SM. 2022. Expansion of the genetic code through the use of modified bacterial ribosomes. J Mol Biol. 434(8): 167211.ArticlePubMed
  • Hetényi A, Szabó E, Imre N, Bhaumik KN, Tököli A, et al. 2022. α/β-Peptides as nanomolar triggers of lipid raft-mediated endocytosis through GM1 ganglioside recognition. Pharmaceutics. 14(3): 580.ArticlePubMedPMC
  • Huang Y, Liu T. 2018. Therapeutic applications of genetic code expansion. Synth Syst Biotechnol. 3(3): 150–158. ArticlePubMedPMC
  • Humpola MV, Spinelli R, Erben M, Perdomo V, Tonarelli GG, et al. 2023. D- and N-methyl amino acids for modulating the therapeutic properties of antimicrobial peptides and lipopeptides. Antibiotics. 12(5): 821.ArticlePubMedPMC
  • Isaacs FJ, Carr PA, Wang HH, Lajoie MJ, Sterling B, et al. 2011. Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science. 333(6040): 348–353. ArticlePubMedPMC
  • Ishida S, Ngo PHT, Gundlach A, Ellington A. 2024. Engineering ribosomal machinery for noncanonical amino acid incorporation. Chem Rev. 124(12): 7712–7730. ArticlePubMedPDF
  • Jin X, Park OJ, Hong SH. 2019. Incorporation of non-standard amino acids into proteins: Challenges, recent achievements, and emerging applications. Appl Microbiol Biotechnol. 103(7): 2947–2958. ArticlePubMedPMC
  • Johnson LM, Barrick S, Hager MV, McFedries A, Homan EA, et al. 2014. A potent α/β-peptide analogue of GLP-1 with prolonged action in vivo. J Am Chem Soc. 136(37): 12848–12851. ArticlePubMedPMC
  • Kato Y. 2019. Translational control using an expanded genetic code. Int J Mol Sci. 20(4): 887.ArticlePubMedPMC
  • Kim S, Yi H, Kim YT, Lee HS. 2022. Engineering translation components for genetic code expansion. J Mol Biol. 434(8): 167302.ArticlePubMed
  • Kobayashi T, Yanagisawa T, Sakamoto K, Yokoyama S. 2009. Recognition of non-alpha-amino substrates by pyrrolysyl-tRNA synthetase. J Mol Biol. 385(5): 1352–1360. ArticlePubMed
  • Krahn N, Tharp JM, Crnković A, Söll D. 2020. Engineering aminoacyl-tRNA synthetases for use in synthetic biology. Enzymes. 48: 351–395.
  • Kuznetsov G, Goodman DB, Filsinger GT, Landon M, Rohland N, et al. 2017. Optimizing complex phenotypes through model-guided multiplex genome engineering. Genome Biol. 18: 100.ArticlePubMedPMCPDF
  • Kwon I, Wang P, Tirrell DA. 2006. Design of a bacterial host for site-specific incorporation of p-bromophenylalanine into recombinant proteins. J Am Chem Soc. 128(36): 11778–11783. ArticlePubMed
  • Lajoie MJ, Rovner AJ, Goodman DB, Aerni HR, Haimovich AD, et al. 2013. Genomically recoded organisms expand biological functions. Science. 342(6156): 357–360. ArticlePubMedPMC
  • Lee MR, Raman N, Gellman SH, Lynn DM, Palecek SP. 2017. Incorporation of β-amino acids enhances the antifungal activity and selectivity of the helical antimicrobial peptide aurein 1.2. ACS Chem Biol. 12(12): 2975–2980. ArticlePubMedPMC
  • Li J, Gao Y, Kuang Y, Shi J, Du X, et al. 2013. Dephosphorylation of D-peptide derivatives to form biofunctional, supramolecular nanofibers/hydrogels and their potential applications for intracellular imaging and intratumoral chemotherapy. J Am Chem Soc. 135(26): 9907–9914. ArticlePubMedPMC
  • Li JC, Liu T, Wang Y, Mehta AP, Schultz PG. 2018. Enhancing protein stability with genetically encoded noncanonical amino acids. J Am Chem Soc. 140(47): 15997–16000. ArticlePubMedPMC
  • Liu Z, Yang X, Yi D, Wang S, Chen Y. 2012. Genetic incorporation of D-lysine into diketoreductase in Escherichia coli cells. Amino Acids. 43(6): 2553–2559. ArticlePubMedPDF
  • Lugtenburg T, Gran-Scheuch A, Drienovská I. 2023. Non-canonical amino acids as a tool for the thermal stabilization of enzymes. Protein Eng Des Sel. 36: gzad003.ArticlePubMedPMC
  • Melnikov SV, Söll D. 2019. Aminoacyl-tRNA synthetases and tRNAs for an expanded genetic code: What makes them orthogonal? Int J Mol Sci. 20(8): 1929.ArticlePubMedPMC
  • Melo Czekster C, Robertson WE, Walker AS, Söll D, Schepartz A. 2016. In vivo biosynthesis of a β-amino acid-containing protein. J Am Chem Soc. 138(16): 5194–5197. ArticlePubMedPMC
  • Miura T, Lee KJ, Katoh T, Suga H. 2024a. In vitro selection of macrocyclic L-α/D-α/β/γ-hybrid peptides targeting IFN-γ/IFNGR1 protein-protein interaction. J Am Chem Soc. 146(26): 17691–17699. ArticlePubMedPDF
  • Miura T, Malla TR, Brewitz L, Tumber A, Salah E, et al. 2024b. Cyclic β2,3-amino acids improve the serum stability of macrocyclic peptide inhibitors targeting the SARS-CoV-2 main protease. Bull Chem Soc Jpn. 97(5): uoae018.ArticlePubMedPMC
  • Miura T, Malla TR, Owen CD, Tumber A, Brewitz L, et al. 2023. In vitro selection of macrocyclic peptide inhibitors containing cyclic γ2,4-amino acids targeting the SARS-CoV-2 main protease. Nat Chem. 15(7): 998–1005. ArticlePubMedPMCPDF
  • Moore B, Persson BC, Nelson CC, Gesteland RF, Atkins JF. 2000. Quadruplet codons: Implications for code expansion and the specification of translation step size. J Mol Biol. 298(2): 195–209. ArticlePubMed
  • Nakamura Y, Gojobori T, Ikemura T. 2000. Codon usage tabulated from international DNA sequence databases: Status for the year 2000. Nucleic Acids Res. 28(1): 292.ArticlePubMedPMC
  • Neumann H, Peak-Chew SY, Chin JW. 2008. Genetically encoding Nε-acetyllysine in recombinant proteins. Nat Chem Biol. 4: 232–234. ArticlePubMed
  • Niazi SK, Mariam Z, Paracha RZ. 2024. Limitations of protein structure prediction algorithms in therapeutic protein development. BioMedInformatics. 4(1): 98–112. Article
  • Niu W, Guo J. 2024. Cellular site-specific incorporation of noncanonical amino acids in synthetic biology. Chem Rev. 124(18): 10577–10617. ArticlePubMed
  • Nozawa K, O'Donoghue P, Gundllapalli S, Araiso Y, Ishitani R, et al. 2009. Pyrrolysyl-tRNA synthetase-tRNAPyl structure reveals the molecular basis of orthogonality. Nature. 457: 1163–1167. ArticlePubMedPDF
  • Palomo JM. 2014. Solid-phase peptide synthesis: An overview focused on the preparation of biologically relevant peptides. RSC Adv. 4(62): 32658–32672. Article
  • Ren W, Truong TM, Ai H-W. 2015. Study of the binding energies between unnatural amino acids and engineered orthogonal tyrosyl-tRNA synthetases. Sci Rep. 5: 12632.ArticlePubMedPMCPDF
  • Rezhdo A, Islam M, Huang M, Van Deventer JA. 2019. Future prospects for noncanonical amino acids in biological therapeutics. Curr Opin Biotechnol. 60: 168–178. ArticlePubMedPMC
  • Robertson WE, Funke LFH, de la Torre D, Fredens J, Elliott TS, et al. 2021. Sense codon reassignment enables viral resistance and encoded polymer synthesis. Science. 372(6546): 1057–1062. ArticlePubMedPMC
  • Rodnina MV. 2018. Translation in prokaryotes. Cold Spring Harb Perspect Biol. 10(9): a032664.ArticlePubMedPMC
  • Rogers JM, Nawatha M, Lemma B, Vamisetti GB, Livneh I, et al. 2021. In vivo modulation of ubiquitin chains by N-methylated non-proteinogenic cyclic peptides. RSC Chem Biol. 2(2): 513–522. ArticlePubMedPDF
  • Saleh AM, Wilding KM, Calve S, Bundy BC, Kinzer-Ursem TL. 2019. Non-canonical amino acid labeling in proteomics and biotechnology. J Biol Eng. 13: 43.ArticlePubMedPMC
  • Shandell MA, Tan Z, Cornish VW. 2021. Genetic code expansion: A brief history and perspective. Biochemistry. 60(46): 3455–3469. ArticlePubMedPDF
  • Sigal M, Matsumoto S, Beattie A, Katoh T, Suga H. 2024. Engineering tRNAs for the ribosomal translation of non-proteinogenic monomers. Chem Rev. 124(10): 6444–6500. ArticlePubMedPDF
  • Sisila V, Indhu M, Radhakrishnan J, Ayyadurai N. 2023. Building biomaterials through genetic code expansion. Trends Biotechnol. 41(2): 165–183. ArticlePubMed
  • Soni C, Prywes N, Hall M, Nair MA, Savage DF, et al. 2024. A translation-independent directed evolution strategy to engineer aminoacyl-tRNA synthetases. ACS Cent Sci. 10(6): 1211–1220. ArticlePubMedPMCPDF
  • Spinck M, Piedrafita C, Robertson WE, Elliott TS, Cervettini D, et al. 2023. Genetically programmed cell-based synthesis of non-natural peptide and depsipeptide macrocycles. Nat Chem. 15: 61–69. ArticlePubMedPDF
  • Srinivasan G, James CM, Krzycki JA. 2002. Pyrrolysine encoded by UAG in Archaea: Charging of a UAG-decoding specialized tRNA. Science. 296(5572): 1459–1462. ArticlePubMed
  • Switzer HJ, Howard CA, Halonski JF, Peairs EM, Smith N, et al. 2023. Employing non-canonical amino acids towards the immobilization of a hyperthermophilic enzyme to increase protein stability. RSC Adv. 13: 8496–8501. ArticlePubMedPMC
  • Tang H, Zhang P, Luo X. 2022. Recent technologies for genetic code expansion and their implications on synthetic biology applications. J Mol Biol. 434(8): 167382.ArticlePubMed
  • Wang L, Brock A, Herberich B, Schultz PG. 2001. Expanding the genetic code of Escherichia coli. Science. 292(5516): 498–500. ArticlePubMed
  • Wang YS, Fang X, Wallace AL, Wu B, Liu WR. 2012. A rationally designed pyrrolysyl-tRNA synthetase mutant with a broad substrate spectrum. J Am Chem Soc. 134(6): 2950–2953. ArticlePubMedPMC
  • Wang HH, Isaacs FJ, Carr PA, Sun ZZ, Xu G, et al. 2009. Programming cells by multiplex genome engineering and accelerated evolution. Nature. 460(7257): 894–898. ArticlePubMedPMCPDF
  • Wang YS, Russell WK, Wang Z, Wan W, Dodd LE, et al. 2011. The de novo engineering of pyrrolysyl-tRNA synthetase for genetic incorporation of L-phenylalanine and its derivatives. Mol Biosyst. 7(3): 714–717. ArticlePubMed
  • Wang L, Schultz PG. 2001. A general approach for the generation of orthogonal tRNAs. Chem Biol. 8(9): 883–890. ArticlePubMed
  • Wang L, Xie J, Schultz PG. 2006. Expanding the genetic code. Annu Rev Biophys Biomol Struct. 35: 225–249. ArticlePubMed
  • Ward FR, Watson ZL, Ad O, Schepartz A, Cate JHD. 2019. Defects in the assembly of ribosomes selected for β-amino acid incorporation. Biochemistry. 58(45): 4494–4504. ArticlePubMed
  • Willis JCW, Chin JW. 2018. Mutually orthogonal pyrrolysyl-tRNA synthetase/tRNA pairs. Nat Chem. 10: 831–837. ArticlePubMedPMCPDF
  • Xu B, Liu L, Song G. 2021. Functions and regulation of translation elongation factors. Front Mol Biosci. 8: 816398.ArticlePubMed
  • Yanagisawa T, Ishii R, Fukunaga R, Kobayashi T, Sakamoto K, et al. 2008a. Crystallographic studies on multiple conformational states of active-site loops in pyrrolysyl-tRNA synthetase. J Mol Biol. 378(3): 634–652. ArticlePubMed
  • Yanagisawa T, Ishii R, Fukunaga R, Kobayashi T, Sakamoto K, et al. 2008b. Multistep engineering of pyrrolysyl-tRNA synthetase to genetically encode Nɛ-(O-azidobenzyloxycarbonyl) lysine for site-specific protein modification. Chem Biol. 15(11): 1187–1197. ArticlePubMed
  • Young DD, Young TS, Jahnz M, Ahmad I, Spraggon G, et al. 2011. An evolved aminoacyl-tRNA synthetase with atypical polysubstrate specificity. Biochemistry. 50(11): 1894–1900. ArticlePubMed
  • Zhang Y, Malamakal RM, Chenoweth DM. 2015. Aza-glycine induces collagen hyperstability. J Am Chem Soc. 137(39): 12422–12425. ArticlePubMed

Figure & Data

References

    Citations

    Citations to this article as recorded by  
    • Advancing microbial engineering through synthetic biology
      Ki Jun Jeong
      Journal of Microbiology.2025; 63(3): e2503100.     CrossRef

    • ePub LinkePub Link
    • Cite this Article
      Cite this Article
      export Copy Download
      Close
      Download Citation
      Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

      Format:
      • RIS — For EndNote, ProCite, RefWorks, and most other reference management software
      • BibTeX — For JabRef, BibDesk, and other BibTeX-specific software
      Include:
      • Citation for the content below
      Expanding the genetic code: In vivo approaches for incorporating non-proteinogenic monomers
      J. Microbiol. 2025;63(3):e2501005  Published online March 28, 2025
      Close
    • XML DownloadXML Download
    Figure
    Expanding the genetic code: In vivo approaches for incorporating non-proteinogenic monomers
    Image Image Image
    Fig. 1. Classification of non-proteinogenic monomers (npMs). Non-proteinogenic monomers (npMs) are categorized into non-proteinogenic amino acids (npAAs) and exotic monomers (exMs). npAAs include non-canonical L-amino acids, D-amino acids, and β-amino acids, while exMs include α- and β-hydroxy acids. The numbers refer to the corresponding chemical structures as indicated in the main text.
    Fig. 2. Schematic representation of the orthogonal translation system for the in vivo incorporation of non-proteinogenic monomers (npMs). Orthogonal aminoacyl-tRNA synthetases (aaRS) and their corresponding tRNAs are engineered to aminoacylate non-proteinogenic monomers (npMs) while avoiding interference from endogenous aaRS/tRNA pairs and canonical amino acids. The aminoacylated tRNAs carrying npMs are delivered to the ribosome, where they occupy the A-site through interactions with elongation factor Tu (EF-Tu) and GTP. The translation process incorporates the npMs into the growing peptide chain, resulting in novel biopolymers with expanded chemical diversity.
    Fig. 3. Overview of translation-independent tRNA acylation and screening methodologies. To engineer aminoacyl-tRNA synthetases (aaRSs) or screen orthogonal aaRS/tRNA pairs, translation-independent methodologies have been utilized. These methods exploit the chemical distinctions between aminoacylated and non-aminoacylated tRNAs to enable precise selection and analysis. Non-aminoacylated tRNAs are selectively oxidized at their 3′ end, rendering them incapable of extension, while aminoacylated tRNAs remain unmodified, allowing subsequent extension. tREX facilitates screening by differentiating aminoacylated tRNAs through either by size separation via gel electrophoresis or by using biotinylated oligonucleotides. Building upon tREX, the bio-mREX approach improves screening efficiency by linking mRNAs encoding aaRS variants to their corresponding tRNAs. This technique employs split tRNAs with an intervening loop to assign specific mRNAs to each tRNA, and utilizes biotinylated extensions on aminoacylated tRNAs, which can then be captured using streptavidin beads. The START methodology employs sequence-barcoded tRNAs by assigning unique barcodes to the anticodon region, as PylRS tolerates additional nucleotides. This approach utilizes next-generation sequencing (NGS) to directly connect the genotype of aaRS mutants to their acylation activity, enabling high-throughput analysis and facilitating directed evolution.
    Expanding the genetic code: In vivo approaches for incorporating non-proteinogenic monomers
    Application Engineered peptide Introduced npMs Additional features Method Ref
    Biomaterial D-peptide hydrogelator D-Phe, D-Lys, D-Tyr phosphate Phosphatase substrate compatibility, resistant to proteolytic degradation SPPS Li et al. (2013)
    Collagen mimetic peptide Aza-glycine Triple-helix thermal stability SPPS Zhang et al. (2015)
    Peptide linker for hydrogels D-Val, D-Pro, D-Met, D-Ser, D-Arg Resistance to enzymatic degradation, increased cytotoxicity SPPS Bomb et al. (2023)
    Therapeutic Agonist of GLP-1 receptor cyclic β-amino acids Native-like potency, proteolysis resistance SPPS Johnson et al. (2014)
    Aurein 1.2 peptide analog β-Leu, β-Lys, β-Ile, β-Ala, β-Ser, β-Phe, cyclic β-amino acids Increased helical stability, improved selectivity, reduced hemolysis SPPS Lee et al. (2017)
    Ubiquitin chain binding peptide D-Ala, D-Phe, N-methyl-Gly, N-methyl-Ala High affinity, protease resistance, membrane permeability FIT Rogers et al. (2021)
    Ganglioside GM1-binding peptide tag β-Trp, β-Tyr, β-Lys Binding affinity, protease resistance SPPS Hetényi et al. (2022)
    Inhibitor of SARS-CoV-2 main protease cyclic γ-amino acids, cyclic β-amino acids High affinity, proteolytic stability, prolonged serum stability FIT Miura et al. (2023, 2024b)
    Inhibitor of interferon gamma receptor 1 cyclic β-amino acids, cyclic γ-amino acids Enhanced binding affinity, inhibitory activity, prolonged serum stability FIT Miura et al. (2024a)
    Table 1. Engineered peptides for practical applications through in vitro integration of non-proteinogenic monomers


    Journal of Microbiology : Journal of Microbiology
    TOP