Review Article

Making Circles: Recent Advance in Chemical and Enzymatic Approaches in Peptide Macrocyclization

Giang KT Nguyen1 and Clarence TT Wong2*

1Wilmar International, 1, Research Link, Temasek Life Sciences Laboratory, Singapore

2Department of Chemistry, The Chinese University of Hong Kong, Hong Kong

*Corresponding author: Clarence TT Wong, Department of Chemistry, The Chinese University of Hong Kong, Hong Kong, Tel: + 852 39431326; E-mail: ClarenceTTW@cuhk.edu.hk

Citation: Nguyen GKT, Wong CTT (2017) Making Circles: Recent Advance in Chemical and Enzymatic Approaches in Peptide Macrocyclization. J Chem Allied Res 2017: 1-13. doi:https://doi.org/10.29199/JCAR-101011

Received: 27 July, 2017; Accepted: 29 August, 2017; Published: 13 September, 2017

Abstract

Macrocyclic peptides have emerged as an important class of molecules for drug discovery due to their enhanced stability and bioavailability. Since the 1990s, ligation chemistries have been extensively studied for preparing peptide macrocycles. The ligation chemistries usually go through an energetically favored capture coupled with an acyl transfer reaction to overcome the disfavored entropy ring formation and facilitate the formation of the cyclic amide bond. Concurrently with chemical methods, several enzymatic approaches utilizing peptide ligases have also been explored. These peptide ligases are enzymes that involve in the production of naturally-occurring cyclic peptides in diverse organisms. Many enzymes have been isolated and proved to be highly efficient in the production of cyclic peptides. Here, we review the recent advance in chemical ligation and enzymatic approaches in making cycles.

Introduction

In the last few decades, peptides have been the center of drug discovery, and they have been applied in a wide range of areas including medicine and biotechnology. Peptides are now one of the most popular candidates that are dominating the clinical trials, and increasing amounts of FDA approved peptide-based drugs are being released to the market. Surprisingly, before early 80’s, peptides were not being defined as druggable molecules because of their instability and poor bioavailability. They are easily damaged by chemical and physical denaturation, hydrolysis by enzymatic degradation, low membrane penetration, and poor gut absorption. The turning point leading to such dramatic change in the perspective of peptides was initiated by the introduction of the paradigm shifting technology named the Solid Phase Peptide Synthesis (SPPS) [1]. It allows routine and automation chemical synthesis of a long peptide chain. The ease of peptides perpetration makes researchers able to design or identify novel peptides through rational design and combinatorial peptide library. Most importantly, the SPPS allows researchers to perform extensive structural and side-chain modifications as well as the use of unnatural amino acids on the peptides, which marked the beginning of turning peptide and protein from “food” to drug. Among all the modification strategy, backbone cyclization is one the most common and efficient approaches to turn peptides into druggable compounds.

Macrocyclic peptides have emerged as an important class of molecules in drug discovery due to their enhanced conformational rigidity and resistance against proteolytic degradation. In many cases, macrocyclization increases the receptor binding affinity, confers druglike properties, and improves oral bioavailability [2,3]. A large number of natural macrocyclic peptides ranging from 5-78 residues have been found in diverse organisms [4]. A large variety of bioactivities including anticancer, antimicrobial, immunosuppressive, oxytocin receptor and neurotensin receptor binding have been identified in the cyclic peptide families. Furthermore, large macrocycles are also well suited for inhibiting protein-protein interactions, which are often defined as “undruggable” targets by conventional drug-discovery approaches [5]. To conduct more detailed study on these cyclic peptides, chemical synthesis was the only way to cyclize peptides, which was a formidable challenge in the early days. The classical method involves the use of side-chain protected peptides to perform cyclization through direct lactamization between the N- and C-termini via strong enthalpic activation. This method was widely used in the 80’s to synthesize cyclic peptide sizes ranging from 5-12 amino acids [6]. However, the efficiency of this approach is highly sequence-dependence, which requires extensive time and effort to optimize the coupling conditions to minimize the formation of ambiguous polymers, epimerization, and to enhance solubility. Moreover, this method is only capable of cyclizing peptides < 15 amino acids due to the disfavored enthalpy of large ring size. To overcome these problems, advanced chemical and enzymatic ligations were employed to perform intramolecular ligation between N- and C- termini [7]. In this review article, state-of-the-art methodologies including chemical ligation reactions and ligases-mediated ligation, which have been used to produce end-to-end backbone cyclic peptides will be discussed.

Chemical Ligations

Distinguish themselves from the classical approach, the chemical ligation is capable of joining two unprotected peptide fragments unambiguously without the use of any coupling reagent. In general, chemical ligation is a two-step reaction that involves the chemoselective capture reaction followed by the proximity-driven acyl transfer reaction to generate a native amide bond at the ligation site [8]. The chemoselective capture forms a new covalent bond between an orthogonal pair that brings the two peptides together but distinguishes themselves from the amino acids side-chains that bear different functionalities. After the capture reaction, a spontaneous, energetically-favored proximity-driven O-to-N or S-to-N acyl transfer occurs and results in an amide bond. Such acyl transfer reactions are commonly found in enzymatic reactions of bond breaking and forming, therefore they are also called biomimetic ligations. In the 1980’s, the acyl transfer was first demonstrated by Kemp et al., by using an organic template to join peptides together via O-to-N acyl transfer [9]. Based on Kemp’s reaction, a series of biomimetic ligation chemistries were developed and are being “evolved”. In this article, the chemical ligations are grouped into two categories including O-to-N and S-to-N acyl transfer-mediated chemical ligation.

O-to-N Acyl Transfer-Mediated Ligation

Pseudoproline ligation: In 1993, Tam developed the pseudoproline ligation that can be performed in water, without the use of protecting groups, coupling reagent, and organic templates. The reaction occurs between an ester aldehyde and the bifunctional N-terminal amino acid bearing a free amine and a beta-hydroxyl or a beta-thiol group (Cys, Thr, or Ser) [10,11]. The amino group at N-terminus forms a reversible imine with the C-terminal ester aldehyde. Then the nucleophilic group at the beta position attacks the imine to form an oxazolidine or thiazolidine ring. Then a spontaneous five-member ring-driven O-to-N acyl transfer occurs and leads to the formation of a stable amide bond with a thiazolidine/oxazolidine (Figure 1A). Botti et al., demonstrated the use of pseudoproline ligation to perform intramolecular ligation using the amino acid sequence derived from the V3 loop on gp120 of HIV-1 to form a series of cyclic peptides in 1996. The result showed that the ligation method could cyclize peptides from the smallest ring of 5 to the largest ring of 26 amino acids [12].

Serine/Threonine ligation: In 2010, Li reported a modified version of pseudoproline ligation that is capable of generating a natural Ser/Thr amino acid at the ligation site named Ser/Thr Ligation (STL) [13]. Instead of an ester aldehyde, they put a Salicylaldehyde (SAL) at the C-terminus of a peptide. Similar to pseudoproline ligation, the aldehyde on SAL forms an oxazolidine ring with the N-terminal Ser/Thr in 1:1 pyridine: acetic acid as solvent. Then 6-member ring O-to-N acyl transfer occurs that results in an amide-linked stable benzylidene acetal intermediate. The unnatural benzylidene acetal linkage can be removed by acid treatment, which then generates a natural amide bond with Ser/Thr at the ligation site (Figure 1B) [14]. STL has been utilized in synthesizing different proteins [15-17]. It has been used in cyclizing natural occurring anticancer peptides yunnanin C, cyclomontanin B and cyclic peptide analogues [18-20]. This work demonstrated the ease of generating an analogous library of yunnanin C by using a single cyclization reaction condition. Lam et al., reported the first chemical synthesis of FDA-approved cyclic lipopeptide daptomycin via STL cyclization. The success of synthesizing daptomycin allows the study of structure-activity relationship and analogous design [21]. In 2013, our group showed that with the unique STL ligation mechanism, highly constrained cyclic tetrapeptides could be cyclized without the use of any flexible residues (Gly), turning residues (Pro), tertiary amide, or other d-amino acids [22]. This plausible cyclization can overcome the disfavored energy process by going through the ring-contraction strategy by the prior formation of the 16-membered ring during the oxazolidine formation. Then the 16-membered ring undergoes an O-to-N acyl transfer that results in a 12-membered ring cyclic tetrapeptide after acid treatment. 

The ligation mechanisms of different ligation methodologies

Figure 1: The ligation mechanisms of different ligation methodologies.

 

α-ketoacid-hydroxylamine (KAHA) ligation: Bode’s group reported the new and robust KAHA ligation to ligate peptides [23]. Three different types of KAHA ligation was introduced with different N-terminal hydroxylamines including free hydroxylamine, O-substituted hydroxylamine, and the cyclic alkoxyamine. These reactions proceed different reaction pathways to form an amide bond on unprotected peptides. In general, the reaction takes place between the N-terminal hydroxylamine and the C-terminal α-ketoacid and results in a native amide bond with homoserine at the ligation site (Figure 1C). KAHA ligation has been used in the synthesis of gramicidin S, tyrocidine A, Hymenamide B, semi gramicidin S, stylostatin A as well as a library of cyclic peptide analogues [24,25].

S-To-N Acyl Transfer-Mediated Ligation

Native chemical ligation: After the introduction of pseudoproline ligation in 1994, Kent and coworkers introduced the Native Chemical Ligation (NCL) in the same year, and Tam et al., reported it shortly in 1995 [26,27]. It was the first ligation that generates natural amino acid at the ligation site. NCL is playing a significant role in peptide chemistry, and it is the most used ligation method in protein synthesis [28-30]. The NCL uses a highly chemoselective thiol-thioester exchange reaction which occurs between the N-terminal Cys and the C-terminal thioester and results in a thioester-linked intermediate. Then the thioester undergoes a spontaneous, proximity-driven S-to-N acyl transfer (Figure 1D). The superior nucleophilicity of thiol in basic buffer permits a robust and efficient thiol-thioester exchange reaction as well as an energetically-favored S-to-N acyl transfer. In 1999, Tam and co-workers performed macrocyclization of natural-occurring cyclic peptides, called cyclotide, with over 30 amino acids with three intramolecular disulfide bonds using NCL [31]. During the synthesis of the four cyclotides, kalata B1, cyclopsychotride, circulin A, and circulin B, they discovered that the cyclization rate and yield were enhanced in the presence of intramolecular Cys residues. In the same year, the “thia-zip” mechanism was proposed, which hypothesized that trans-thioesterification reaction occurs intramolecularly and becomes thiolactone macrocycles [32]. The reversible trans-thioesterification chain-reaction begins from the adjacent Cys and ends at the N-terminal Cys. Lastly, an S-to-N-acyl transfer occurs at the N-terminal Cys and leads to the formation of an amide bond (Figure 2). Such ring-expansion process brings the two ends in proximity which allows the cyclization to be performed at a great distance of peptide chain. Cyclotides and other biological imperative cyclic peptides were also synthesized and engineered by NCL-mediated cyclization which includes conotoxin, sunflower trypsin inhibitor, defensins, MCOTI-I, II, and a series of engineered cyclic peptides [33-37]. The collection of works showed that the cyclized peptides have higher serum stability and some of the analogs showed superior oral-bioavailable and membrane penetration. Besides the traditional solution phase NCL, Barany and coworker reported the solid phase on-resin NCL [38]. This method requires the attachment of resin using the side-chain of Fmoc-Asp-allyl, and after peptide elongation, the C-terminal allyl protecting group can be removed, and thiophenol can be coupled to the C-terminus to form a peptide thioester on-resin followed by NCL cyclization.

The mechanism of thia-zip macrocyclization

Figure 2: The mechanism of thia-zip macrocyclization. When peptide containing a N-terminal Cys and C-terminal thioester (A), thiol-thioester exchange reaction occurs between the nearby thiolate and the thioester in aqueous buffer at pH ~7.4-8.5 and forms a thiolactone ring (B). The reversible reaction shuffles through an energetically-favored thiolactone ring-expansion process from the nearby Cys to the N-terminal Cys (C-D). Then spontaneous S-to-N acyl transfer occurs (E) and leads to the formation of a native amide bond (F). The orange color indicates the ring size and the ring expansion process.

 

Despite the high efficiency of NCL, the synthesis of base-labile peptide thioester requires the use of boc-chemistry, which requires the use of the strong acid such as hydrogen fluoride or TfOH for the side-chain deprotection. Therefore, new methods of peptide thioester preparation based on fmoc-chemistry were introduced in the last decade (Figure 3). Tate’s group proposed the use of “Safety-catch linker” to perform NCL and cyclization of cysteine knot MCoTI-II [39,40]. The C-terminal sulfonamide can be activated by iodoacetonitrile and cleaved by 3-mercaptopropionate to form a peptide thioester. Another well-studied approach is to use the N-to-S acyl transfer reaction to generate peptide thioester. N-4,5-Dimthoxy-2-Mercaptobenzyl (Dmmb) [41], N-methyl Cys [42], and Weinreb amide derivatives [43] have been used as the surrogate to catalyze the acyl transfer and demonstrated in the synthesis of model peptides. In 2010, Melnyk and Liu reported the use of bis(2-Sulfanylethyl)Amino (SEA) or N, N-Bis(2-Mercaptoethyl)-Amides (BMEA) to generate peptide thioester independently [44-47]. The C-terminus surrogate group at pH 7 and in the presence of reducing agents, it undergoes N-to-S acyl transfer to generate peptide thioester. Because of the controllable N-to-S acyl transfer, the SEA ligation is capable of performing tandem ligations for large protein synthesis and controlled peptide cyclization [48]. Taichi et al., designed the Thioethylalkylamido (TEA) linker that can be used with a milder acidic condition of pH 3.0 [49]. They have successfully used this method to produce cyclic peptides including kalata B1, cyclic omega-conotoxin MVIIA, and engineered sunflower trypsin inhibitor [34,50,51]. Recently, Liu and co-workers introduced the use of peptide hydrazide to generate peptide thioester via a one-pot, two-step mechanism. Under acidic pH, the peptide hydrazide reacts with NaNO2 to generate acyl azide at the C-terminus [52-55]. In the presence of thiols, it attacks acyl azide to give peptide thioester. Then by going through NCL mechanism, proteins or cyclic peptides can be synthesized. In 2011, Zheng et al., uses the hydrazide methods to synthesize a series of cyclic peptides with sizes ranging from 5 up to 42 amino acids with ~20 % yield [56]. By using this methods, Möbius, and bracelet types of cyclotides, as well as the MCoTI-II trypsin inhibitor were synthesized.

Cysteine-free ligation: Despite the popularity of NCL, it is limited by the low abundance of Cys in natural peptides and proteins. Thus, tremendous effort was made to develop methods that can be performed NCL without Cys at the ligation site. In 1998, Tam and Yu reported the methionine ligation. They used a homocysteine to perform NCL followed by S-methylation to generate the methionine at the ligation site (Figure 1E) [57]. Another approach is to prepare thiol-containing unnatural amino acids to perform NCL followed by desulfurization to create a natural amino acid (Figure 1F). Dawson et al., demonstrated the use of Raney nickel or palladium aluminium to convert Cys to Ala. They have successfully synthesized the 21-amino acid cyclic antibiotic microcin J25 [58]. Quaderer and Hilvert used selenocysteine instead of cysteine to perform NCL in much quicker reaction rate to form a cyclic RGD motif peptide. Then deselenization was conducted using Raney nickel to generate an alanine from selenocysteine [59]. Danishefsky and co-worker reported the use of Tris(2-Carboxyethyl) Phosphine (TCEP) and a radical initiator VA-044 to perform desulfurization [60]. They demonstrated this powerful desulfurization by converting Cys to Ala on intermolecular ligation as well as peptide cyclization of natural-occurring cortogossamide.

The scheme of five different approaches to generate peptide thioester for NCL.

Figure 3: The scheme of five different approaches to generate peptide thioester for NCL.

 

Traceless Staudinger ligation: Traceless Staudinger ligation was developed by Bertozzi based on the Staudinger reaction. The ligation performs between C-terminal thioester-phosphine and N-terminal azide to form an iminophosphorane intermediate followed by aza-ylide-thioester transfer to form a native amide bond (Figure 1G) [61]. Hackenberger employed the traceless Staudinger ligation to perform peptide cyclization on protected or unprotected peptides [62].

2-Formylthiophenol ligation: Recently, Tung et al., reported the 2-formylthiophenol ligation that peptide formylthiophenol thioester can react with an amino group to form an amide bond regardless of N-terminal amino acid, including Pro [63]. Authors demonstrated the synthesis of five different sizes (5-11 amino acids) of natural-occurring anticancer peptides with over 40 % isolated yield. The formation of hemiaminal between aldehyde and amine was suspected to be the intermediate followed by S-to-N acyl transfer to generate an amide bond in a basic organic solvent. However, this reaction mechanism cannot distinguish the amino group between N-terminal amine and side-chain of Lys. Therefore, the cyclic peptides generated cannot contain internal Lys within the sequence.

Enzymatic Ligation

Chemoenzymatic approach using a peptide ligase would provide an attractive alternative and would be useful for cyclization of non-cysteine containing peptides. Furthermore, the use of enzymes as biocatalysts for cyclization has received increasing attention due to their high chemoselectivity, non-toxic, and cost-effectiveness. In this review, we focus on the recent progress on chemoenzymatic methods for the preparation of macrocyclic peptides and proteins.

Intein

Inteins are autocatalytic protein splicing elements. They induce self-excision and subsequent ligation of two flanking sequences, named exteins, by the formation of a native peptide bond. Protein splicing was first observed in 1988 in yeast trifluoperazine-resistant TFP1-408 gene [64-66], and was confirmed in the yeast vacuolar membrane ATPase VMA-1 gene [67] and yeast TFP-1 gene in 1990 [68]. Up to date, there are more than three hundred inteins identified with wide occurrence in all three domains of living organisms [69]. The general mechanism of intein-splicing has been elaborated as a four-step acyl shift reaction: (1) N-O/S acyl shift between N-extein and intein to afford an active ester/thioester bond at the splicing site where the amide bond is in cisoid conformation mediated by intein [70], (2) Transesterification between the first residue of C-extein (Cys, Ser) and newly formed ester/thioester bond at N-splicing site, which resulted in the formation of branched intermediate, (3) N-N acyl shift at C-terminal Asn of intein to form a succinimide and spontaneously releases intein from the branched intermediate, (4) S/O-N acyl shift in proximity to afford an amide bond between N- and C-extein. In application, inteins are usually engineered into shorter mini-inteins by removing the homing endonuclease element [71] or split into two for trans-splicing [72,73].

Two recombinant approaches are developed for intein-mediated protein cyclization, including Expressed Protein Ligation (EPL) [74] and intein-mediated Protein Trans-Splicing (PTS) [75]. The target protein is coexpressed with selected intein at C-terminus or both termini as fusion proteins, followed by in vitro or in vivo cyclization reaction. Intein mediates formation of ester/thioester at C-terminus of the target protein, which requires an incoming nucleophile at the N-terminus. Thus, the target protein needs to start with Cys or Ser/Thr [76-79]. By constructing a series of precursor peptides from cyclotide hedyotide B1, it has been shown that the C-terminal residues adjacent to intein also influence cyclization efficiency. Steric effect generated by β-branch and backbone hindrance remarkably reduce the cyclization yield [80]. The intein-mediated splicing process has been developed into various technologies including affinity tags [81-87], ligation and cyclization [74,75,78,88-90], protein semi-synthesis and biosensors [91,92]. Nevertheless, the requirement of recombinant expression instead of independent usage of intein limits its application to be a more commonly used ligation tool. Its substrate requirement for N-terminal Cys/Ser/Thr also narrows its versatility.

Subtiligase

Subtiligases are engineered enzymes derived from the protease subtilisin BPN’ isolated from Bacillus subtilis. The first ligase was designed and named thiolsubtilisin by converting an active hydroxyl functional moiety in the active triad to thiol in organic condition [93,94], and later modified through S221C single mutation that allows ligation to take place in aqueous condition [95]. The double mutation S221C and P225A was introduced by Wells’ group in the 1990s that further increase the catalytic efficiency [96]. Structural study by X-ray crystallography showed the ligation mechanism involving a thiol-mediated nucleophilic attack of amide/ester bond in substrate resulted in the formation of thioester intermediate followed by aminolysis by the N-terminal amine of the incoming peptide. C-terminal modification to glycolate-Phe-amide ester facilitates subtiligase-mediated ligation. This optimized subtiligase displays fast catalytic activities (kcat of up to 20 s-1) making it a powerful tool in protein semi-synthesis and engineering [97]. In general, the subtiligase-mediated ligation takes place in aqueous buffer with pH 8.0 at room temperature for 1 to 3 h.

Peptide

Sequence

Length

Yield (%)

1

ADVQAVCSQKNV-CO-Glc-F-NH2

12

36

2

ACKNGQTNCYQSY-CO-Glc-F-NH2

13

68

3

KETAAAKFERQHMDSSTSAA-CO-Glc-F-NH2

20

56

4

STMSITDCRETGSSKYPNCAY-CO-Glc-F-NH2

21

75

5

SSSNYCNQMMKSRNLTKDRCKPVNTFVHESL-CO-Glc-F-NH2

31

85

Table 1: Subtiligase-mediated cyclization of peptides with C-terminal esters.

 

The reaction was performed in the presence of 5 µM subtiligase, 2-5 mM peptide substrate, 100 mM tricine buffer, pH 8.5. Reaction was conducted at 25°C for 1-2 h and monitored by HPLC.

*Glc = glycolate ester.

Subtiligase has been shown to cyclize peptides between 12 and 31 amino acids with yields of 30%-85% (Table 1). The catalytic efficiency of subtilisin was found to be influenced by residues at P1-P4 and P1'-P3' positions. These sites have been extensively investigated by peptide library and phage display [96,98]. Mutagenesis study further expands the substrate selectivity of subtiligase and enhance its stability (Table 2) [96,99]. It also requires an extended structure at the N-terminus of the substrate. Ligation site within a helix should be avoided during experimental design. Subtiligase mediates fast reactions of protein terminal labelling [100], cyclization [101], and preparation of peptide thioester and thioacid [102,103]. Due to the large binding surface involving seven sites, the minimal length of substrate for macrocyclization requires 12 residues [101]. The requirement of C-terminal activation by esterification can be easily achieve using chemical synthesis but remains a limitation to the engineering of recombinant-prepared substrates.

Position

Favorable residue

Unfavorable residues

P4

Hydrophobic residues

-

P3

Flexible

P2

Flexible

P1

M, Y, L

G, P, T, V, L

P1'

M, F, K, L, R, A, S, W, Y

E, D, P, N, T, I, V

P2'

V, H, Y, S, A, F, N

P, G

P3'

Flexible

Table 2: Summary of the substrate specificity of subtiligase BPN’ (S221C/P225A).

 

Sortase A

Sortase A (SrtA) is a membrane-bound Cys transpeptidase found in Gram+ bacteria Staphylococcus aureus. It anchors proteins with a sorting sequence LPXTG (X can be E, D, K, A, N or Q) to the bacterial cell wall [104]. Sortase A mediates the transpeptidation by forming a thioester linkage between Cys184 of its active site and Thr in the LPXTG motif [105]. The protein-srtA thioester intermediate is prone for nucleophilic attack by a penta-Gly branch of lipid II that leads to attachment of protein to the glycopeptide region of lipid II followed by transferring of protein to cell wall peptidoglycan [106]. Although, its natural function is not a cyclase, sortase A has been used for cyclization of various peptides and proteins.

Sortase A-Mediated Ligation (SML) has been firstly demonstrated by Mao et al., using short peptides RE(Edans)LPKTGnK(Dabcyl)R (n=0 to 5) with Gn(n=0 to 5)-peptide [107]. They used fluorescence to measure sortase-mediated cleavage rate and HPLC to monitor the ligation rate. The results showed that the minimal requirement for N-terminal nucleophile is a single aminoglycine and the length of N-terminal Gly affects ligation rate. An earlier Kinetic study revealed that S1' position of sortase A is specific to Gly and S2' position also favors Gly [108], suggesting that sortase A prefers GG in ligation reaction.

Peptide

Sequence

Substrate: Enzyme Ratio

Time (h)

Yield (%)

1

GGG-eGFP-LPETGLE-His6

0.375:1

3

40

2

G-Cre-LPETG-His6

1:1

21.5

90

3

GGG-eGFP-LPETG-His6

1:1

24

90

4

G-His6-UCHL3-LPETG-XX

1:3

4

10

5

 

GGDSHEKRHHGYRRKFHEKHHS-HREFPFYGDYGSNYLYDNLPETGG

3:1

 

21

 

90

 

6

GGVTSAPDTLPKTGGS

2.5:1

21

67

7

GG-SFTI1-LPTEGG

3:1

9

70

8

GGG-KB1-TGG

3:1

24

80

9

G-Vc1.1-GLPETGGS

50:1

24

50

Table 3: Examples of sortase-mediated cyclization of peptides and proteins.

 

SML has been developed for protein labelling [109-111], macrocyclization [112,113] and ligation to peptides or non-peptidyl compounds (lipid, amino-poly (ethylene glycol), G3-polystyrene beads) [112,114]. It is compatible to both recombinant and chemically prepared peptides and proteins. However, sortase A has a long recognition signal and low the catalytic efficiency, which often requires equal molar ratio of enzyme to the substrate for efficient cyclization (Table 3). Recently, there has been extensive effort to improve the catalytic properties of sortase A using direct evolution by several independent research groups.

GmPOPB

GmPOPB is a member of the prolyl oligopeptidase isolated from the poisonous mushroom Galerina marginata [115,116]. It shares 36% and 37% sequence identity to PCY1 and porcine POP, respectively. GmPOPB is involved in the biosynthesis of α-amanitin toxin, an 8-residue cyclic peptide and a potent inhibitor of RNA polymerase II and III [117-119]. The amanitin precursor contain a 10-residue leader propeptide, a mature core peptide of 8 residues, and a 17-residue C-terminal propeptide [120-122]. Recombinant GmPOPB cleaves the amanitin precursor by a two-step process after two internal Pro residues in an ordered manner to produce the mature cyclic peptide. In the first processing event, GmPOPB act as a protease cutting after the first Pro residue to release a 25-residue intermediate consisting of the core peptide and the C-terminal prosequence. In the second event, GmPOPB act as a ligase, which cleaves after the second Pro and consecutively ligates with the N-terminal Ile to complete the macrocyclization.

GmPOPB is C-terminal specific for Pro at the P1 position and able to accommodate precursor peptides of up to 35 amino acids. Little is known about the substrate requirement of the core peptide beyond the C-terminal Pro. It has been tested to cyclize only one example, an 8-residue cyclic peptide IWGIGCNP. On the other hand, there is better understanding of the C-terminal recognition sequence. Cyclization screening of a set of peptide substrates with varying precursor sequences showed that GmPOPB displays a very stringent substrate requirement for the 17-residue C-terminal propeptide, which adopts an α-helical structure (Table 4). Truncation of the propeptide by five amino acids or changing of selected residues to Gly cause dramatic reduction in the cyclization efficiency. It has also been shown that the C-terminal Cys of the propeptide is highly conserved and essential for the substrate recognition. Changing this amino acid to Ala causes loss of activity. Interestingly, GmPOPB able to cyclize the α-amanitin precursor (AbAMA1) from a distantly related species Amanita bisporigera, indicating the secondary structure rather than the exact primary sequence being important for cyclization. The major advantages of GmPOPB is its ability to cyclize small peptides with relative high efficiency. Kinetic analysis showed that the Km and kcat values for GmPOPB for the native substrate are about 5.6 s-1 and 25 µM, respectively. However, poor understanding of the substrate scope limits its application for synthesis of macrocyclic peptides. GmPOPB fails to cyclize phallacidin, a related mushroom toxin and thus it remains to be determined of GmPOPB can cyclized peptide unrelated to α-amanitin.

Peptide

Sequence

Major Product

Yield/Remark

1

MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIC

cyclo(IWGIGCNP)

>90%

2

IWGIGCNPWTAEHVDQTLASGNDIC

cyclo(IWGIGCNP)

>90%

3

NATRLPIWGIGCNPWTAEHVDQTLASGNDIC

None

No cyclization

4

MFDTAAAAAPIWGIGCNPWTAEHVDQTLASGNDIC

cyclo(IWGIGCNP)

<2%

5

MFDTNATAAPIWGIGCNPWTAEHVDQTLASGNDIC

cyclo(IWGIGCNP)

<2%

6

MFDTNATRLPIWGIGCNPWTAEHVD

IWGIGCNPWTAEHVD

No cyclization

7

MFDTNATRLPIWGIGCNPWTAEHVDQTLAS

IWGIGCNPWTAEHVDQTLAS

No cyclization

8

IWGIGCNPWTAEGVGQGLASGNDIC

cyclo(IWGIGCNP)

<1%

9

IWGIGCNPWTAEHGDQGLASGNDIC

cyclo(IWGIGCNP)

~2%

10

IWGIGCNPWTAEHVDQTGASGNDIC

cyclo(IWGIGCNP)

~5%

11

MFDTNATRLPIWGIGCNPWTAEHVDQTLASGNDIA

cyclo(IWGIGCNP) (trace) IWGIGCNPWTAEHVD-QTLASGNDIA

<5%

12

MSDINATRLPIWGIGCNPCVGDDVTTLLTRGEALC

cyclo(IWGIGCNP)

>90%

13

MSDINATRLPAWLVDCPCVGDDVTTLLTRGEALC

AWLVDCPCVGD-DVTTLLTRGEALC

No cyclization

14

MSDINATRLPAWLATCPCAGDD

AWLVDCP (linear)

No cyclization

Table 4: Peptide Substrates for cyclization by GmPOPB.

The mature core peptide is underlined.

 

Transglutaminase

Transglutaminases are a group of enzymes that catalyze the acyl transfer reaction between the γ-carboxyamide of glutamines in peptides/proteins and ε-amino group of Lys (or a wide variety of primary amines) to form a protease-resistant isopeptide bond [123,124]. They are ubiquitous and have a widespread occurrence in microorganisms, plants, invertebrates and mammals [125-127]. Animal and plant transglutaminases are calcium-dependent enzymes, whereas microbial transglutaminases do not require calcium for their activities . They have been used extensively in food industries as “meat glue” to bond proteins together and thus improve the texture of meat [128,129]. Examples include imitation crabmeat, and fish balls. It allows a manufacturer to imitate the texture and taste of a more expensive product, such as lobster tail, using a relatively low-cost material. Recently, transglutaminases have been explored for peptide macrocyclization [130]. Furthermore, it has also been reported that about 1% of statherin, a 43-residue phospho-peptide found in human saliva, exists in the cyclic form by the action of transglutaminase [131].

While it has been well characterized that transglutaminases catalyze the specific isopeptide bond formation between the side chain of Gln and the side chain of Lys residue, there is no clear specific peptide sequence preferred by the enzymes. Probing of substrate specificity using phage selection of random peptide libraries showed that certain peptide sequences have a higher selectivity as Gln-donor substrates. For example, the peptide Ac-WALQRPH-NH2 has shown a 19-fold decrease in Km value (3 mM) than the commonly used Cbz-Gln-Gly (58 mM) [132,133]. However, the precise mechanism and requirements for the selectivity remained to be determined. Using WALQRPH as a Gln-donor sequence, a 3 residue spacer (GGG), and an acceptor KS sequence, a set of 10 peptides has been designed to evaluate the effect of length and amino acid composition for cyclization by transglutaminases (Table 5) [130]. The result show that peptide of variable ring sizes from 1.2 to 2.6 kDa can be cyclized efficiently. Truncation of two residues on the N-terminus cause loss of cyclization which suggested that certain length is required for efficient activity. In the absence of the acceptor Lys residue, Gln is deamidated to form Glu residue. Microbial transglutaminases are commercially available at relatively low cost. The cyclization is efficient but lack specificity if the peptides or proteins contain multiple Gln or Lys residues. Most Gln and Lys residues would act as substrates to a varying degree of efficiency depending on their accessibility to the enzymes.

Peptide

Sequence

Time (h)

Yield/Remark

1

H-WALQRPHGGGKS-NH2

5

>90%

2

H-GQRPHGGGKS-NH2

5

No cyclization

3

H-WALQSGSGGGKS-NH2

5

>90%

4

H-ALQSGSGGGKS-NH2

5

>90%

5

H-LQSGSGGGKS-NH2

5

No cyclization

6

H-ALQSGSRGGGKS-NH2

5

>90%

7

H-WALQSGSGGGGS-NH2

5

No cyclization

8

H-ALQAYDGWLPWEIHVKS-NH2

6

50%

9

H-ALQACSDRFRNCPADEALCAKS-NH2

6

>90%

10

H-WALQACSDRFRNCPADEALCAKS-NH2

6

>90%

Table 5: Peptide Substrates for cyclization by microbial transglutaminase.

 

Butelase

Butelase 1 is the fastest known ligase with catalytic efficiencies of up to 1,340,000 M-1 s-1 [134]. It is a highly promiscuous enzyme and has been shown to cyclize and ligate a broad range of non-native peptides and proteins of various origins with high efficiency [135-139]. It is discovered recently from the medicinal plant Clitoria ternatea (butterfly pea), which is also a common ornamental plant, fodder and cover crop in Southeast Asia [140]. Butelase 1 belongs to the legumain family of the cysteine proteases (family C13, clan CD) and shares 37% and 70% sequence homology with human legumain and VmPE-1 from mungo bean, respectively [140]. Butelase 1 presents in high yield (~ 5 mg/kg) and can be purified from pods of C. ternatea [141]. It is a cyclase responsible for the backbone cyclization of cyclotides, the largest family of plant cyclic peptides. The first cyclotide kalata B1 was discovered in the early 1970s, and the finding that Butelase 1 could cyclize kalata B1 provided the first in vitro evidence for its maturation, some 40 years after its discovery [140,142].

Butelase 1 is C-terminal specific for Asn/Asp (Asx) at the P1 position and requires a C-terminal dipeptide His-Val at the P1' and P2' positions for substrate recognition [141]. It acts preferentially as a cyclase but not a protease. It does not hydrolyze the legumain substrate Z-AAN-AMC even with an extended incubation of 30 h. Although the precise mechanism of cyclization remained to be determined, it has been speculated that butelase 1 first recognizes and cleaves the peptide bond between Asx and His-Val to form an acyl-enzyme intermediate, which is subsequently resolved by the amino group of the N-terminal residue to form a head-to-tail macrocycle. Butelase 1 displays a broad tolerance for the N-terminal residue at the P1'' position, accepting almost 20 natural amino acids except for Pro. Interestingly, it has a more stringent specificity for residue at the P2'' position, and highly favors Ile/Leu/Val and to some extent Cys residues. Butelase 1 has been shown to cyclize various peptides and proteins from 1 to 30 kDa with high efficiency (Table 6). It is the only known naturally occurring cyclase with the ability to cyclize proteins. Interestingly, butelase has also been reported to mediate intermolecular peptide ligation and site-specific modification of proteins. This property allows incorporation of a functional probes or tags into a target protein under mild reaction condition. The high catalytic efficiency together with a simple recognition motif make butelase 1 highly useful in peptide macrocyclization. It has a broad substrate scope capable of cyclizing peptides and proteins from 10 to >200 residue with high efficiency. A typical butelase-mediated reaction requires 100 to 1000 folds less amount of enzymes than other peptide ligases. It has been shown that butelase can cyclize peptides at up to 200,000 folds faster than sortase A [134]. Butelase 1 also enables the cyclization of non-cysteine containing peptides which are difficult to be synthesized by chemical method. Furthermore, butelase 1 has a simple recognition motif which requires no C-terminal hydrazide, thioester or unusual amino acids. Importantly, no recognition sequence except Asx was left in the final cyclized product making butelase-mediated reaction a traceless ligation. Butelase 1, however, cannot cyclize denature proteins and required a properly folded conformation.

Substrate

Sequence

Time (min)

Yield (%)

SA-kB1-NHVIA

GLPVCGETCVGGTCNTPGCTCSWPVCTRNHV

12

>95

SFTI-NHV

GRCTKSIPPICFPNHV

45

>95

Conotoxin MrIA

GVCCGYKLCHPCAGNHV

12

>95

Histatin-3

GLDSHAKRHHGYKRKFHEKHHSHRGYRSNYLYDNHV

12

>90

Thanatin

GSKKPVPIIYCNRRTGKCQRMNHV

240

59

Galanin

GLTSGWTLNSAGYLLGPHAVGNHRSFSDKNHV

5

>95

Apelin

GLVQPRGSRNGPGPWQGGRRKFRRQRPRLSHKGPMPFNHV

5

>95

Salusin

Neuromedin U 

GISGALPPAPAAPRPALRAQRAGPAGPGAKNHV

RVDEEFQSPFASQSRGYFLFRPRNHV

5

5

>95

>95

Rat neuromedin U

GIKYKVNEYQGPVAPSGGFFLFRPRNHV

5

>95

Green Fluorescent Protein (GFP)

GISMSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYG

KLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDF

FKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELK

GIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRH

NIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNE

KRDHMVLLEFVTAAGITLGMDELYKNHV

15

>90

Interleukin-1 Receptor Antagonist (IL-1Ra)

GISYDYMEGGDIRVRRLFCRTQWYLRIDKRGKVKGTQEMKNN

YNIMEIRTVAVGIVAIKGVESEFYLAMNKEGKLYAKKECNEDC

NFKELILENHYNTYASAKWTHNGGEMFVALNQKGIPVRGKKT

KKEQKTAHFLPMAITNHV

15

>90

Human growth hormone (Somatropin)

GITPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLC

HPEELVLLGHSLGIPWAPLSSCPSQALQLAGCLSQLHSGLFLYQ

GLLQALEGISPELGPTLDTLQLDVADFATTIWQQMEELGMAPA

LQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHLA

QPKNHV

15

>90

Table 6: Cyclization yields of peptide substrates mediated by butelase 1.

 

Acknowledgement

This work is supported by Direct Grants from The Chinese University of Hong Kong (Grant nos: 4053200).

References

  1. Merrifield RB (1963) Solid Phase Peptide Synthesis I The Synthesis of a Tetrapeptide. J Am Chem Soc 85: 2149-2154.
  2. Fosgerau K, Hoffmann T (2015) Peptide therapeutics: current status and future directions. Drug Discovery Today 20: 122-128.
  3. Craik DJ, Fairlie DP, Liras S, Price D (2013) The Future of Peptide-based Drugs. Chem Biol Drug Des 81: 136-147.
  4. Thorstholm L, Craik DJ (2012) Discovery and applications of naturally occurring cyclic peptides. Drug Discovery Today Technologies 9: 13-21.
  5. Russo A, Aiello C, Grieco P, Marasco D (2016) Targeting “Undruggable” Proteins: Design of Synthetic Cyclopeptides. Curr Med Chem 23: 748-762.
  6. White CJ, Yudin AK (2011) Contemporary strategies for peptide macrocyclization. Nat Chem 3: 509-524.
  7. Tam JP, Wong CTT (2012) Chemical synthesis of circular proteins. J Biol Chem 287: 27020-27025.
  8. Bondalapati S, Jbara M, Brik A (2016) Expanding the chemical toolbox for the synthesis of large and uniquely modified proteins. Nat Chem 8: 407-418.
  9. Kemp DS (1981) The amine capture strategy for peptide bond formation-an outline of progress. Biopolymers 20: 1793-1804.
  10. Liu CF, Tam JP (1994) Chemical Ligation Approach to Form a Peptide Bond between Unprotected Peptide Segments Concept and Model Study. J Am Chem Soc 116: 4149-4153.
  11. Tam JP, Miao Z (1999) Stereospecific Pseudoproline Ligation of N-Terminal Serine, Threonine, or Cysteine-Containing Unprotected Peptides. J Am Chem Soc 121: 9013-9022.
  12. Botti P, Pallin TD, Tam JP (1996) Cyclic Peptides from Linear Unprotected Peptide Precursors through Thiazolidine Formation. J Am Chem Soc 118: 10018-10024.
  13. Li X, Lam HY, Zhang Y, Chan CK (2010) Salicylaldehyde ester-induced chemoselective peptide ligations: enabling generation of natural peptidic linkages at the serine/threonine sites. Org Lett 12: 1724-1727.
  14. Wong CTT, Li T, Lam HY, Zhang Y, Li X (2014) Realizing serine/threonine ligation: scope and limitations and mechanistic implication thereof. Front Chem 2: 28.
  15. Lee CL, Liu H, Wong CTT, Chow HY, Li X (2016) Enabling N-to-C Ser/Thr Ligation for Convergent Protein Synthesis via Combining Chemical Ligation Approaches. J Am Chem Soc 138: 10477-10484.
  16. Zhang Y, Xu C, Lam HY, Lee CH, Li X (2013) Protein chemical synthesis by serine and threonine ligation. Proc Natl Acad Sci USA 110: 6657-6662.
  17. Li T, Liu H, Li X (2016) Chemical Synthesis of HMGA1a Proteins with Post-translational Modifications via Ser/Thr Ligation. Org Lett 18: 5944-5947.
  18. Wong CTT, Lam HY, Li X (2014) Effective synthesis of cyclic peptide yunnanin C and analogues via Ser/Thr Ligation (STL)-mediated peptide cyclization. Tetrahedron 70: 7770-7773.
  19. Wong CTT, Lam HY, Li X (2013) Effective synthesis of kynurenine-containing peptides via on-resin ozonolysis of tryptophan residues: synthesis of cyclomontanin B. Org Biomol Chem 11: 7616-7620.
  20. Zhao JF, Zhang XH, Ding Y, Yang YS, Bi XB, et al. (2013) Facile Synthesis of Peptidyl Salicylaldehyde Esters and Its Use in Cyclic Peptide Synthesis. Org Lett 15: 5182-5185.
  21. Lam HY, Zhang Y, Liu H, Xu J, Wong CTT, et al. (2013) Total Synthesis of Daptomycin by Cyclization via a Chemoselective Serine Ligation. J Am Chem Soc 135: 6272-6279.
  22. Wong CTT, Lam HY, Song T, Chen G, Li X (2013) Synthesis of constrained head-to-tail cyclic tetrapeptides by an imine-induced ring-closing/contraction strategy. Angew Chem Int Ed 52: 10212-10215.
  23. Bode JW, Fox RM, Baucom KD (2006) Chemoselective Amide Ligations by Decarboxylative Condensations of N-Alkylhydroxylamines and α-Ketoacids. Angew Chem Int Ed 45: 1248-1252.
  24. Rohrbacher F, Deniau G, Luther A, Bode JW (2015) Spontaneous head-to-tail cyclization of unprotected linear peptides with the KAHA ligation. Chem Sci 6: 4889-4896.
  25. Fukuzumi T, Ju L, Bode JW (2012) Chemoselective cyclization of unprotected linear peptides by α-ketoacid-hydroxylamine amide-ligation. Org Biomol Chem 10: 5837-5844.
  26. Dawson P, Muir T, Clark-Lewis I, Kent S (1994) Synthesis of proteins by native chemical ligation. Science 266: 776-779.
  27. Tam JP, Lu YA, Liu CF, Shao J (1995) Peptide synthesis using unprotected peptides through orthogonal coupling methods. Proc Natl Acad Sci USA 92: 12485-12489.
  28. Lu W, Qasim MA, Kent SBH (1996) Comparative Total Syntheses of Turkey Ovomucoid Third Domain by Both Stepwise Solid Phase Peptide Synthesis and Native Chemical Ligation. J Am Chem Soc 118: 8518-8523.
  29. Xu R, Ayers B, Cowburn D, Muir TW (1999) Chemical ligation of folded recombinant proteins: Segmental isotopic labeling of domains for NMR studies. Proc Natl Acad Sci USA 96: 388-393.
  30. Kent SBH (2009) Total chemical synthesis of proteins. Chem Soc Rev 38: 338-351.
  31. Tam JP, Lu YA, Yang JL, Chiu KW (1999) An unusual structural motif of antimicrobial peptides containing end-to-end macrocycle and cystine-knot disulfides. Proc Natl Acad Sci USA 96: 8913-8918.
  32. Tam JP, Lu Y-A, Yu Q (1999) Thia Zip Reaction for Synthesis of Large Cyclic Peptides: Mechanisms and Applications. J Am Chem Soc 121: 4316-4324.
  33. Chan LY, Craik DJ, Daly NL (2016) Dual-targeting anti-angiogenic cyclic peptides as potential drug leads for cancer therapy. Sci Rep 6: 1-13.
  34. Hemu X, Tam JP (2017) Macrocyclic Antimicrobial Peptides Engineered from ω-Conotoxin. Curr Pham Des 23: 2131-2138.
  35. Aboye TL, Clark RJ, Burman R, Roig MB, Craik DJ, et al. (2011) Interlocking disulfides in circular proteins: toward efficient oxidative folding of cyclotides. Antioxid Redox Signal 14: 77-86.
  36. Wong CTT, Rowlands DK, Wong CH, Lo TWC, Nguyen GKT, et al. (2012) Orally active peptidic bradykinin B1 receptor antagonists engineered from a cyclotide scaffold for inflammatory pain treatment. Angew Chem Int Ed Engl 51: 5620-5624.
  37. Clark RJ, Craik DJ (2010) Native chemical ligation applied to the synthesis and bioengineering of circular peptides and proteins. Biopolymers 94: 414-422.
  38. Tulla-Puche J, Barany G (2004) On-Resin Native Chemical Ligation for Cyclic Peptide Synthesis. J Org Chem 69: 4101-4107.
  39. Thongyoo P, Roqué-Rosell N, Leatherbarrow RJ, Tate EW (2008) Chemical and biomimetic total syntheses of natural and engineered MCoTI cyclotides. Org Biomol Chem 6: 1462-1470.
  40. Thongyoo P, Tate EW, Leatherbarrow RJ (2006) Total synthesis of the macrocyclic cysteine knot microprotein MCoTI-II. Chem Commun 2006: 2848-2850.
  41. Kawakami T, Sumida M, Nakamura K, Vorherr T, Aimoto S (2005) Peptide Thioester Preparation Based on an N-S Acyl Shift Reaction Mediated by a Thiol Ligation Auxiliary. Tetrahedron Lett 46: 8805-8807.
  42. Hojo H, Onuma Y, Akimoto Y, Nakahara Y, Nakahara Y (2007) N-Alkyl cysteine-assisted thioesterification of peptides. Tetrahedron Lett 48: 25-28.
  43. Rao C, Liu CF (2017) Peptide Weinreb amide derivatives as thioester precursors for native chemical ligation. Org Biomol Chem 15: 2491-2496.
  44. Ollivier N, Dheur J, Mhidia R, Blanpain A, Melnyk O (2010) Bis(2-sulfanylethyl) amino Native Peptide Ligation. Org Lett 12: 5238-5241.
  45. Boll E, Dheur J, Drobecq H, Melnyk O (2012) Access to Cyclic or Branched Peptides Using Bis(2-sulfanylethyl)amido Side-Chain Derivatives of Asp and Glu. Org Lett 14: 2222-2225.
  46. Yang R, Hou W, Zhang X, Liu CF (2012) N-to-C Sequential Ligation Using Peptidyl N,N-Bis(2-mercaptoethyl)amide Building Blocks. Org Lett 14: 374-377.
  47. Hou W, Zhang X, Li F, Liu CF (2011) Peptidyl N, N-Bis(2-mercaptoethyl)-amides as Thioester Precursors for Native Chemical Ligation. Org Lett 13: 386-389.
  48. Boll E, Ebran JP, Drobecq H, El-Mahdi O, Raibaut L, et al. (2015) Access to Large Cyclic Peptides by a One-Pot Two-Peptide Segment Ligation/Cyclization Process. Org Lett 17: 130-133.
  49. Taichi M, Hemu X, Qiu Y, Tam JP (2013) A Thioethylalkylamido (TEA) Thioester Surrogate in the Synthesis of a Cyclic Peptide via a Tandem Acyl Shift. Org Lett 15: 2620-2623.
  50. Qiu Y, Taichi M, Wei N, Yang H, Luo KQ, et al. (2017) An Orally Active Bradykinin B1 Receptor Antagonist Engineered as a Bifunctional Chimera of Sunflower Trypsin Inhibitor. J Med Chem 60: 504-510.
  51. Hemu X, Taichi M, Qiu Y, Liu DX, Tam JP (2013) Biomimetic synthesis of cyclic peptides using novel thioester surrogates. Biopolymers 100: 492-501.
  52. Zheng JS, Tang S, Qi YK, Wang ZP, Liu L (2013) Chemical synthesis of proteins using peptide hydrazides as thioester surrogates. Nat Protoc 8: 2483-2495.
  53. Li J, Li Y, He Q, Li Y, Li H, et al. (2014) One-pot native chemical ligation of peptide hydrazides enables total synthesis of modified histones. Org Biomol Chem 12: 5435-5441.
  54. Huang YC, Fang GM, Liu L (2015) Chemical synthesis of proteins using hydrazide intermediates. Natl Sci Rev 3: 107-116.
  55. Fang GM, Wang JX, Liu L (2012) Convergent chemical synthesis of proteins by ligation of peptide hydrazides. Angew Chem Int Ed Engl 51: 10347-10350.
  56. Zheng JS, Tang S, Guo Y, Chang HN, Liu L (2012) Synthesis of Cyclic Peptides and Cyclic Proteins via Ligation of Peptide Hydrazides. ChemBioChem 13: 542-546.
  57. Tam JP, Yu Q (1998) Methionine ligation strategy in the biomimetic synthesis of parathyroid hormones. Biopolymers 46: 319-327.
  58. Yan LZ, Dawson PE (2001) Synthesis of peptides and proteins without cysteine residues by native chemical ligation combined with desulfurization. J Am Chem Soc 123: 526-533.
  59. Quaderer R, Hilvert D (2002) Selenocysteine-mediated backbone cyclization of unprotected peptides followed by alkylation, oxidative elimination or reduction of the selenol. Chem Commun (Camb) 2002: 2620-262.
  60. Wan Q, Danishefsky SJ (2007) Free-Radical-Based, Specific Desulfurization of Cysteine: A Powerful Advance in the Synthesis of Polypeptides and Glycopolypeptides. Angew Chem Int Ed Engl 46: 9248-9252.
  61. Saxon E, Armstrong JI, Bertozzi CR (2000) A “traceless” Staudinger ligation for the chemoselective synthesis of amide bonds. Org Lett 2: 2141-2143.
  62. Kleineweischede R, Hackenberger CP (2008) Chemoselective Peptide Cyclization by Traceless Staudinger Ligation. Angew Chem Int Ed Engl 47: 5984-5988.
  63. Tung CL, Wong CTT, Li X (2015) Peptide 2-formylthiophenol esters do not proceed through a Ser/Thr ligation pathway, but participate in a peptide aminolysis to enable peptide condensation and cyclization. Org Biomol Chem 13: 6922-6926.
  64. Shih CK, Wagner R, Feinstein S, Kanikennulat C, Neff N (1988) A Dominant Trifluoperazine Resistance Gene from Saccharomyces-Cerevisiae has Homology with F0F1 ATP Synthase and Confers Calcium-Sensitive Growth. Mol Cell Biol 8: 3094-3103.
  65. Bowman EJ, Tenney K, Bowman BJ (1988) Isolation of genes encoding the Neurospora vacuolar ATPase Analysis of vma-1 encoding the 67-kDa subunit reveals homology to other ATPases. J Biol Chem 263: 13994-14001.
  66. Zimniak L, Dittrich P, Gogarten JP, Kibak H, Taiz L (1988) The cDNA sequence of the 69-kDa subunit of the carrot vacuolar H+-ATPase Homology to the beta-chain of F0F1-ATPases. J Biol Chem 263: 9102-9112.
  67. Hirata R, Ohsumk Y, Nakano A, Kawasaki H, Suzuki K, et al. (1990) Molecular structure of a gene, VMA1, encoding the catalytic subunit of H (+)-translocating adenosine triphosphatase from vacuolar membranes of Saccharomyces Cerevisiae. J Biol Chem 265: 6726-6733.
  68. Kane PM, Yamashiro CT, Wolczyk DF, Neff N, Goebl M, et al. (1990) Protein splicing converts the yeast TFP1 gene product to the 69-kD subunit of the vacuolar H (+)-adenosine triphosphatase. Science 250: 651-657.
  69. Perler FB (1999) InBase, the New England Biolabs Intein Database. Nucleic Acids Res 27: 346-347.
  70. Klabunde T, Sharma S, Telenti A, Jacobs WR, Sacchettini JC (1998) Crystal structure of GyrA intein from Mycobacterium xenopi reveals structural basis of protein splicing. Nat Struct Biol 5: 31-36.
  71. Chong S, Xu MQ (1997) Protein Splicing of the Saccharomyces cerevisiae VMA Intein without the Endonuclease Motifs. J Biol Chem 272: 15587-15590.
  72. Wu H, Xu MQ, Liu XQ (1998) Protein Trans-Splicing and Functional Mini-Inteins of a Cyanobacterial dnaB Intein. Biochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology 1387: 422-432.
  73. Southworth MW, Adam E, Panne D, Byer R, Kautz R, et al. (1998) Control of Protein Splicing by Intein Fragment Reassembly. EMBO J 17: 918-926.
  74. Muir TW, Sondhi D, Cole PA (1998) Expressed Protein Ligation: A General Method for Protein Engineering. Proc Natl Acad Sci USA 95: 6705-6710.
  75. Scott CP, Abel-Santos E, Wall M, Wahnon DC, Benkovic SJ (1999) Production of Cyclic Peptides and Proteins In Vivo. Proc Natl Acad Sci USA 96: 13638-13643.
  76. Austin J, Kimura RH, Woo YH, Camarero JA (2010) In Vivo Biosynthesis of An Ala-Scan Library Based on the Cyclic Peptide SFTI-1. Amino Acids 38: 1313-1322.
  77. Iwai H, Pluckthun A (1999) Circular Beta-Lactamase: Stability Enhancement by Cyclizing the Backbone. FEBS Lett 459: 166-172.
  78. Camarero JA, Muir TW (1999) Biosynthesis of A Head-To-Tail Cyclized Protein with Improved Biological Activity. J Am Chem Soc 121: 5597-5598.
  79. Kimura RH, Tran AT, Camarero JA (2006) Biosynthesis of The Cyclotide Kalata B1 by Using Protein Splicing. Angew Chem Int Ed 45: 973-976.
  80. Hemu X, Qiu Y, Tam JP (2014) Peptide Macrocyclization Through Amide-To-Amide Transpeptidation. Tetrahedron 70: 7707-7713.
  81. Chong S, Mersha FB, Comb DG, Scott ME, Landry D, et al. (1997) Single-Column Purification of Free Recombinant Proteins Using a Self-Cleavable Affinity Tag Derived from A Protein Splicing Element. Gene 192: 271-281.
  82. Wood DW, Wu W, Belfort G, Derbyshire V, Belfort M (1999) A Genetic System Yields Self-Cleaving Inteins For Bioseparations. Nat Biotech 17: 889-892.
  83. Wood DW, Derbyshire V, Wu W, Chartrain M, Belfort M, et al. (2000) Optimized Single-Step Affinity Purification with A Self-Cleaving Intein Applied to Human Acidic Fibroblast Growth Factor. Biotech Prog 16: 1055-1063.
  84. Zettler J, Schütz V, Mootz HD (2009) The Naturally Split Npu DnaE Intein Exhibits an Extraordinarily High Rate in The Protein Trans-Splicing Reaction. FEBS Lett 583: 909-914.
  85. Wang Z, Li N, Wang Y, Wu Y, Mu T, et al. (2012) Ubiquitin-Intein And SUMO2-Intein Fusion Systems for Enhanced Protein Production and Purification. Protein Expr Purif 82: 174-178.
  86. Ramirez M, Valdes N, Guan D, Chen Z (2013) Engineering split intein DnaE from Nostoc punctiforme for rapid protein purification. Protein Eng Des Sel 26: 215-223.
  87. Guan D, Ramirez M, Chen Z (2013) Split Intein Mediated Ultra-Rapid Purification of tagless Protein (SIRP). Biotech Bioeng 110: 2471-2481.
  88. Evans TC, Martin D, Kolly R, Panne D, Sun L, et al. (2000) Protein Trans-Splicing and Cyclization by A Naturally Split Intein From the DnaE Gene of Synechocystis Species PCC6803. J Biol Chem 275: 9091-9094.
  89. Kimura R, Camarero JA (2005) Expressed Protein Ligation: A New Tool for The Biosynthesis of Cyclic Polypeptides. Protein Pept Lett 12: 789-794.
  90. Tavassoli A, Benkovic SJ (2007) Split-Intein Mediated Circular Ligation Used in The Synthesis of Cyclic Peptide Libraries in E Coli. Nat Protoc 2: 1126-1133.
  91. Mootz HD (2009) Split Inteins as Versatile Tools for Protein Semisynthesis. Chembiochem 10: 2579-2589.
  92. Cheriyan M, Perler FB (2009) Protein Splicing: A Versatile Tool for Drug Discovery. Adv Drug Del Rev 61: 899-907.
  93. Polgar L, Bender ML (1966) A new enzyme containing a synthetically formed active site Thiol-Subtilisin1. J Am Chem Soc 88: 3153-3154.
  94. Neet KE, Koshland DE (1966) The Conversion of Serine at the Active Site of Subtilisin to Cysteine: A “Chemical Mutation”. Proc Natl Acad Sci USA 56: 1606-1611.
  95. Nakatsuka T, Sasaki T, Kaiser ET (1987) Peptide Segment Synthesis Catalyzed by The Semisynthetic Enzyme Thiolsubtilisin. J Am Chem Soc 109: 3808-3810.
  96. Abrahmsen L, Tom J, Burnier J, Butcher KA, Kossiakoff A, et al. (1991) Engineering Subtilisin and Its Substrates for Efficient Ligation of Peptide Bonds in Aqueous Solution. Biochemistry 30: 4151-4159.
  97. Jackson DY, Burnier J, Quan C, Stanley M, Tom J, et al. (1994) A Designed Peptide Ligase for Total Synthesis of Ribonuclease A with Unnatural Catalytic Residues. Science 266: 243-247.
  98. Chang TK, Jackson DY, Burnier JP, Wells JA (1994) Subtiligase: A Tool for Semisynthesis of Proteins. Proc Natl Acad Sci USA 91: 12544-12548.
  99. Atwell S, Wells JA (1999) Selection for Improved Subtiligases by Phage Display. Proc Natl Acad Sci USA 96: 9497-9502.
  100. Wildes D, Wells JA (2010) Sampling the N-Terminal Proteome of Human Blood. Proc Natl Acad Sci USA 107: 4561-4566.
  101. Jackson DY, Burnier JP, Wells JA (1995) Enzymic Cyclization of Linear Peptide Esters using Subtiligase. J Am Chem Soc 117: 819-820.
  102. Tan XH, Yang R, Wirjo A, Liu CF (2008) Subtiligase As a Hydrothiolase for the Synthesis of Peptide Thioacids. Tetrahedron Lett 49: 2891-2894.
  103. Tan XH, Wirjo A, Liu CF (2007) An Enzymatic Approach to the Synthesis of Peptide Thioesters: Mechanism and Scope. Chembiochem 8: 1512-1515.
  104. Mazmanian SK, Liu G, Ton-That H, Schneewind O (1999) Staphylococcus Aureus Sortase, An Enzyme that Anchors Surface Proteins to the Cell Wall. Science 285: 760-763.
  105. Perry AM, Ton-That H, Mazmanian SK, Schneewind O (2002) Anchoring of Surface Proteins to the Cell Wall of Staphylococcus Aureus III Lipid II is an in vivo Peptidoglycan Substrate for Sortase-Catalyzed Surface Protein Anchoring. J Biol Chem 277: 16241-16248.
  106. Gaspar AH, Marraffini LA, Glass EM, Debord KL, Ton-That H, et al. (2005) Bacillus Anthracis Sortase A (Srta) Anchors LPXTG Motif-Containing Surface Proteins to the Cell Wall Envelope. J Bacteriol 187: 4646-4655.
  107. Mao H, Hart SA, Schink A, Pollok BA (2004) Sortase-Mediated Protein Ligation: A New Method for Protein Engineering. J Am Chem Soc 126: 2670-2671.
  108. Huang X, Aulabaugh A, Ding W, Kapoor B, Alksne L, et al. (2003) Kinetic Mechanism of Staphylococcus Aureus Sortase SrtA. Biochemistry 42: 11307-11315.
  109. Popp MW, Antos JM, Grotenbreg GM, Spooner E, Ploegh HL (2007) Sortagging: A Versatile Method for Protein Labeling. Nat Chem Biol 3: 707-708.
  110. Antos JM, Chew GL, Guimaraes CP, Yoder NC, Grotenbreg GM, et al. (2009) Site-Specific N- and C-Terminal Labeling of a Single Polypeptide Using Sortases of Different Specificity. J Am Chem Soc 131: 10800-10801.
  111. Yamamoto T, Nagamune T (2009) Expansion of The Sortase-Mediated Labeling Method for Site-Specific N-Terminal Labeling of Cell Surface Proteins on Living Cells. Chem Commun (Camb) 9: 1022-1024.
  112. Parthasarathy R, Subramanian S, Boder ET (2007) Sortase A as A Novel Molecular “Stapler” For Sequence-Specific Protein Conjugation. Bioconjugate Chem 18: 469-476.
  113. Bolscher JG, Oudhoff MJ, Nazmi K, Antos JM, Guimaraes CP, et al. (2011) Sortase A as a tool for high-yield histatin cyclization. FASEB J 25: 2650-2658.
  114. Antos JM, Miller GM, Grotenbreg GM, Ploegh HL (2008) Lipid Modification of Proteins Through Sortase-Catalyzed Transpeptidation. J Am Chem Soc 130: 16338-16343.
  115. Luo H, Hong SY, Sgambelluri RM, Angelos E, Li X, et al. (2014) Peptide Macrocyclization Catalyzed by A Prolyl Oligopeptidase Involved in Alpha-Amanitin Biosynthesis. Chem Biol 21: 1610-1617.
  116. Luo H, Hallen-Adams HE, Walton JD (2009) Processing of the Phalloidin Proprotein by Prolyl Oligopeptidase from the Mushroom Conocybe Albipes. J Biol Chem 284: 18070-18077.
  117. Kaneko H, Tomomasa T, Inoue Y, Kunimoto F, Fukusato T, et al. (2001) Amatoxin Poisoning from Ingestion of Japanese Galerina Mushrooms. J Toxicol Clin Toxicol 39: 413-416.
  118. Bushnell DA, Cramer P, Kornberg RD (2002) Structural Basis of Transcription: Alpha-Amanitin-RNA Polymerase II Cocrystal at 2.8 A Resolution. Proc Natl Acad Sci USA 99: 1218-1222.
  119. Brueckner F, Cramer P (2008) Structural Basis of Transcription Inhibition by Alpha-Amanitin And Implications for RNA Polymerase II Translocation. Nat Struct Mol Biol 15: 811-818.
  120. Hallen HE, Luo H, Scott-Craig JS, Walton JD (2007) Gene Family Encoding the Major Toxins of Lethal Amanita Mushrooms. Proc Natl Acad Sci USA 104: 19097-19101.
  121. Luo H, Hallen-Adams HE, Scott-Craig JS, Walton JD (2012) Ribosomal Biosynthesis of Alpha-Amanitin In Galerina Marginata. Fungal Genet Biol 49: 123-129.
  122. Li P, Deng W, Li T (2014) The Molecular Diversity of Toxin Gene Families in Lethal Amanita Mushrooms. Toxicon 83: 59-68.
  123. Griffin M, Casadio R, Bergamini CM (2002) Transglutaminases: Nature’s Biological Glues. Biochem J 368: 377-396.
  124. Ohtsuka T, Sawa A, Kawabata R, Nio N, Motoki M (2000) Substrate Specificities of Microbial Transglutaminase for Primary Amines. J Agric Food Chem 48: 6230-6233.
  125. Ando H, Adachi M, Umeda K, Matsuura A, Nonaka M, et al. (1989) Purification and Characteristics of a Novel Transglutaminase Derived from Microorganisms. Agric Bioi Chem 53: 2613-2617.
  126. Della Mea M, Caparros-Ruiz D, Claparols I, Serafini-Fracassini D, Rigau J (2004) Atpng1p. The First Plant Transglutaminase. Plant Physiol 135: 2046-2054.
  127. Ikura K, Nasu T, Yokota H, Tsuchiya Y, Sasaki R, et al. (1988) Amino Acid Sequence of Guinea Pig Liver Transglutaminase from Its cDNA Sequence. Biochemistry 27: 2898-2905.
  128. Motoki M, Segurob K (1998) Transglutaminase and Its Use for Food Processing. Trends Food Sci Tech 9: 204-210.
  129. Zhu Y, Tramper J (2008) Novel Applications for Microbial Transglutaminase Beyond Food Processing. Trends Biotechnol 26: 559-565.
  130. Touati J, Angelini A, Hinner MJ, Heinis C (2011) Enzymatic Cyclisation of Peptides with A Transglutaminase. Chembiochem 12: 38-42.
  131. Cabras T, Inzitari R, Fanali C, Scarano E, Patamia M, et al. (2006) HPLC-MS Characterization of Cyclo-Statherin Q-37, A Specific Cyclization Product of Human Salivary Statherin Generated by Transglutaminase 2. J Sep Sci 29: 2600-2608.
  132. Oteng-Pabi SK, Keillor JW (2013) Continuous Enzyme-Coupled Assay for Microbial Transglutaminase Activity. Anal Biochem 441: 169-173.
  133. Sugimura Y, Yokoyama K, Nio N, Maki M, Hitomi K (2008) Identification of Preferred Substrate Sequences of Microbial Transglutaminase from Streptomyces Mobaraensis Using a Phage-Displayed Peptide Library. Arch Biochem Biophys 477: 379-383.
  134. Nguyen GK, Kam A, Loo S, Jansson AE, Pan LX, et al. (2015) Butelase 1: A Versatile Ligase for Peptide and Protein Macrocyclization. J Am Chem Soc 137: 15398-15401.
  135. Nguyen GK, Hemu X, Quek JP, Tam JP (2016) Butelase-Mediated Macrocyclization of D-Amino-Acid-Containing Peptides. Angew Chem Int Ed 55: 12802-12806.
  136. Cao Y, Nguyen GK, Tam JP, Liu CF (2015) Butelase-Mediated Synthesis of Protein Thioesters and Its Application For Tandem Chemoenzymatic Ligation. Chem Commun (Camb) 51: 17289-17292.
  137. Nguyen GK, Cao Y, Wang W, Liu CF, Tam JP (2015) Site-Specific N-Terminal Labeling of Peptides and Proteins Using Butelase 1 And Thiodepsipeptide. Angew Chem Int Ed Engl. 54: 15694-15698.
  138. Hemu X, Qiu Y, Nguyen GK, Tam JP (2016) Total Synthesis of Circular Bacteriocins By Butelase 1. J Am Chem Soc 138: 6968-6971.
  139. Cao Y, Nguyen GK, Chuah S, Tam JP, Liu CF (2016) Butelase-Mediated Ligation as an Efficient Bioconjugation Method for the Synthesis of Peptide Dendrimers. Bioconjug Chem 27: 2592-2596.
  140. Nguyen GK, Wang S, Qiu Y, Hemu X, Lian Y, et al. (2014) Butelase 1 is an Asx-Specific Ligase Enabling Peptide Macrocyclization and Synthesis. Nat Chem Biol 10: 732-738.
  141. Nguyen GK, Qiu Y, Cao Y, Hemu X, Liu CF, et al. (2016) Butelase-Mediated Cyclization and Ligation of Peptides and Proteins. Nat Protoc 11: 1977-1988.
  142. Bi X, Yin J, Nguyen GKT, Rao C, Halim NBA, et al. (2017) Enzymatic Engineering of Live Bacterial Cell Surfaces Using Butelase 1. Angew Chem Int Ed Engl 56: 7822-7825.