Circular Proteins in Plants

SOLUTION STRUCTURE OF A NOVEL MACROCYCLIC TRYPSIN INHIBITOR FROM MOMORDICA COCHINCHINENSIS*

Maria Elena Felizmenio-Quimio, Norelle L. Daly, and David J. CraikDagger

From the Institute for Molecular Bioscience, University of Queensland, Brisbane, 4072 Queensland, Australia

Received for publication, February 22, 2001, and in revised form, April 2, 2001

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Much interest has been generated by recent reports on the discovery of circular (i.e. head-to-tail cyclized) proteins in plants. Here we report the three-dimensional structure of one of the newest such circular proteins, MCoTI-II, a novel trypsin inhibitor from Momordica cochinchinensis, a member of the Cucurbitaceae plant family. The structure consists of a small beta -sheet, several turns, and a cystine knot arrangement of the three disulfide bonds. Interestingly, the molecular topology is similar to that of the plant cyclotides (Craik, D. J., Daly, N. L., Bond, T., and Waine, C. (1999) J. Mol. Biol. 294, 1327-1336), which derive from the Rubiaceae and Violaceae plant families, have antimicrobial activities, and exemplify the cyclic cystine knot structural motif as part of their circular backbone. The sequence, biological activity, and plant family of MCoTI-II are all different from known cyclotides. However, given the structural similarity, cyclic backbone, and plant origin of MCoTI-II, we propose that MCoTI-II can be classified as a new member of the cyclotide class of proteins. The expansion of the cyclotides to include trypsin inhibitory activity and a new plant family highlights the importance and functional variability of circular proteins and the fact that they are more common than has previously been believed. Insights into the possible roles of backbone cyclization have been gained by a comparison of the structure of MCoTI-II with the homologous acyclic trypsin inhibitors CMTI-I and EETI-II from the Cucurbitaceae plant family.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Over the last few years a number of small disulfide-rich proteins that have a head-to-tail cyclized peptide backbone have been discovered in plants. These circular proteins include kalata B1 (1), the circulins (2, 3), and cyclopsychotride A (4) from the Rubiaceae family, and various peptides from plants in the Violaceae family (3, 5-8). They contain ~30 amino acids, including six highly conserved cysteine residues that form three structurally important disulfide bonds. The latter contribute to the molecules having well defined and stable three-dimensional structures (8). Because of their well defined structures and potent biological activities, the molecules may be regarded as miniproteins. We recently proposed that the known circular proteins most likely form part of a large family of proteins that we refer to as the plant cyclotides (8). They contain the conserved cysteine spacing CX3-CX4-CX4-7-CX1-CX4-5-CX5-7 within their circular backbone.

The plant cyclotides display a diverse range of biological activities ranging from uterotonic action (kalata B1 (1)), to anti-HIV activity (circulins (2)), to hemolytic activity (5, 9, 10), and inhibition of neurotensin binding (cyclopsychotride A (4)). Although their precise role in plants has not yet been reported, it appears that they are most likely present as defense molecules. All of their biological activities in mammalian systems seem to be related in one way or another to membrane interactions, a common feature of plant defense molecules. Recently the cyclotides have also been reported to display a wide range of anti-microbial activities against Gram-positive and Gram-negative bacteria, and against certain fungi (9), further enhancing the likelihood that they are indeed plant defense molecules.

The three-dimensional structures of three members of the plant cyclotide family have now been determined, including the prototypic member kalata B1, the first macrocyclic peptide for which a full three-dimensional structure was determined (1), circulin A (11), and cycloviolacin O1 (8). The structures are all highly conserved and contain a distorted triple stranded beta -sheet, perhaps better described as a well defined beta -hairpin with a third strand forming a beta -bulge (8). The three disulfide bonds are arranged in a so called cystine knot, in which an embedded ring in the structure formed by two disulfide bonds and their connecting backbone segments forms a ring that is penetrated by the third disulfide bond (12-15). Although a seemingly unlikely structural motif, the cystine knot has now been seen in a variety of small disulfide-rich proteins from both the plant and animal kingdoms (15), and seems to be associated with exceptionally high stability and resistance to proteolytic cleavage of molecules containing it. This is particularly so for the cyclotides, which contain a cyclic cystine knot motif (8).

Recently Hernandez et al. (16) reported two new macrocyclic peptides derived from Momordica cochinchinensis, a vine plant from the Cucurbitaceae plant family. The two molecules, named MCoTI-I and MCoTI-II are trypsin inhibitors and comprise 34 amino acids, a cyclized peptide backbone, and three disulfide bonds. They are homologous to a large range of open-chain trypsin inhibitors from plants (16), but are the first reported macrocyclic trypsin inhibitors (although note that a smaller, single disulfide-containing cyclic trypsin inhibitor was recently reported from sunflower seeds (17)). This exciting discovery supports our view that macrocyclic peptides are much more common in plants than has previously been realized, and that many more are likely to be discovered over coming years.

The sequences of the two new trypsin inhibitors, MCoTI-I and MCoTI-II, show little homology with the known plant cyclotides apart from the six-conserved cysteine residues and the macrocyclic peptide backbone, as seen by a comparison of MCoTI-II and kalata B1 in Fig. 1. Note that because the molecules are cyclic the choice of numbering of residue 1 is arbitrary but throughout the text we have adopted the numbering scheme used by Hernandez et al. (16). In addition, we adopted our earlier nomenclature for defining the six cysteine residues (labeled I-VI) and the six backbone loops that make up the sequences between the cysteine residues (loops 1-6). This numbering scheme is defined in Fig. 1.


View larger version (14K):
[in this window]
[in a new window]
 
Fig. 1.   Sequence alignment of MCoTI-II with kalata B1. The disulfide connectivities are represented at the top of the diagram and the beta -sheet regions of kalata B1 are represented as arrows. Cysteine residues are numbered by Roman numerals. The loops (regions between successive Cys residues) are numbered at the bottom of the diagram based on the numbering scheme previously used for kalata B1 (8). The numbering scheme used for MCoTI-II throughout the text is based on the paper describing the isolation of this peptide (16). Residue 1 for this numbering scheme is shown on the sequence of MCoTI-II.

The three-dimensional structure of MCoTI-II has been modeled (16) based on its sequence similarity to open chain trypsin inhibitors, however, no experimental determination of the three-dimensional structure of the new macrocyclic peptides is available. We were interested to see whether this class was similar to the plant cyclotides and contained the cystine knot motif. We therefore isolated and purified a sample of MCoTI-II from M. cochinchinensis and herewith report the three-dimensional structure determined using NMR spectroscopy and simulated annealing calculations.

One of the motivating factors for examining structures of this class of molecules is that they are known to be highly stable and, indeed, in the case of kalata B1, are apparently orally bioavailable, a most unusual feature for peptide-based molecules. This bioavailability is demonstrated in the native medicine applications of Oldenlandia affinis, the plant from which kalata B1 is derived. Women of certain tribes in the Congo region ingest a tea made from the plant to accelerate labor (18). The possibility of using the cyclic cystine knot macrocyclic disulfide knotted framework as a template in pharmaceutical design applications is thus very attractive (19). The three-dimensional structure reported here provides a basis for these applications, as well as demonstrating that the new trypsin inhibitors form part of the cyclotide family.

    EXPERIMENTAL PROCEDURES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Isolation and Purification of MCoTIs-- Ripe frozen fruits of M. cochinchinensis were purchased from a Vietnamese store in Melbourne, Australia. The dormant seeds were homogenized using a blender and extracted with 20 mM sodium acetate. The fine debris was removed using a cotton plug. 40% Acetone, H2O (v/v) solution was added to the crude extract at a ratio of ~1:1 (v/v). Acetone was reduced using a rotary evaporator after which the mixture was cooled down to 4 °C and centrifuged for 30 min. The supernatant was then passed through a filter before purification using preparative RP-HPLC1 on a Vydac C18 column. Gradients of 0.1% aqueous trifluoroacetic acid and 90% acetonitrile, 0.09% trifluoroacetic acid were employed with a flow rate of 8 ml/min, and the eluant monitored at 230 nm. Further purification was performed using semipreparative RP-HPLC on a Vydac C18 column. The final purity was examined with analytical RP-HPLC and electrospray mass spectrometry.

NMR Spectroscopy-- Samples for 1H NMR measurements contained 1 mM peptide in either 99.99% D2O or 90% H2O, 10% D2O (v/v) at pH 3.5. Spectra were recorded at 288, 293, and 298 K on a Bruker AVANCE 750 MHz spectrometer. The two-dimensional NMR experiments employed were similar to those used for the related cyclic peptide circulin A (11) and included DQF-COSY, E-COSY, and TOCSY using a MLEV-17 spin lock sequence with a mixing time of 80 ms, and NOESY with mixing times of 150, 200, 250, and 300 ms. Slowly exchanging NH protons were detected by acquiring a series of one-dimensional and TOCSY spectra of the fully protonated peptide immediately following dissolution in D2O. 3Jalpha H-beta H coupling constants were measured from E-COSY spectra and 3JNH-alpha H coupling constants were measured from DQF-COSY spectra, transformed over the fingerprint region to 8 × 1 K. Spectra were processed according to methods given in Daly et al. (11). Chemical shifts were referenced to sodium 2,2-dimethyl-2-silapentane-5-sulfonate at 0.00 ppm and the spectra were analyzed and assigned using XEASY (20).

Structural Restraints-- Peak volumes were integrated using XEASY and distance restraints obtained from CALIBA with appropriate pseudoatom corrections. Backbone dihedral restraints were inferred from 3JNH-alpha H coupling constants, with phi  restrained to 120 ± 30° for a 3JNH-alpha H greater than 9 Hz, and 65 ± 30° for a 3JNH-alpha H less than 6 Hz. Stereospecific assignments and chi 1 angle restraints were obtained for 7 residues using coupling constants measured in an ECOSY spectrum in conjunction with NOE intensities. Both proline residues were in the trans-conformation based on characteristic NOEs. The beta -methylene protons of the proline residues were stereospecifically assigned based on the peak intensities in the NOESY spectra. Six hydrogen bonds were determined based on slow exchange data and preliminary structure calculations. Hydrogen bond restraints of 1.7-2.2 and 2.7-3.2 Å were used for the HNi-Oj and Ni-Oj distances, respectively.

Structure Calculations-- Three-dimensional structures were initially calculated using DYANA (21) to check NOE restraints for violations and resolve ambiguous cross-peaks. Dihedral angle and hydrogen bond restraints were subsequently included in these structure calculations. The final 50 structures were generated using a torsion angle dynamics protocol (22, 23) within the program X-PLOR 3.851 (24). Refinement of these structures was achieved using the conjugate gradient Powell algorithm with 1000 cycles of energy minimization (25) in a refined forcefield based on the program CHARMm (26). Structures were analyzed using PROMOTIF (27) and PROCHECK-NMR (28).

    RESULTS
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

The seeds of M. cochinchinensis were extracted with sodium acetate as described previously (16) and the extract was purified using RP-HPLC. Various peptides were isolated and their molecular weights were analyzed by mass spectrometry. Molecular weights consistent with the previously isolated macrocyclic peptides, MCoTI-I (3480) and MCoTI-II (3453), were present in addition to isomers corresponding to the beta -aspartyl (3453) and succinimide (3435.5) forms of MCoTI-II (16). In the previous study (16) the peptide bond between Asp-4 and Gly-5 was shown to be susceptible to succinimide and beta -aspartyl formation and this was confirmed in the current study.

Sufficient quantities of MCoTI-II and its beta -aspartyl isomer were isolated to allow characterization with NMR spectroscopy. Amino acid sequencing was performed essentially using NMR spectroscopy, as the spin systems observed in TOCSY spectra and the sequential connectivities observed in NOESY spectra were completely consistent with those of the published sequence (16). Assignments were made using established techniques (29) and the 1H chemical shifts are supplied as supplementary material. The fingerprint regions of a TOCSY and NOESY spectra of MCoTI-II are given in Fig. 2. The amide region is very well dispersed and this facilitated the assignment process. The excellent dispersion and large number of cross-peaks in the NOESY spectrum also provided the first indication of a well defined three-dimensional structure.


View larger version (33K):
[in this window]
[in a new window]
 
Fig. 2.   NMR spectra of MCoTI-II. Regions of the 80-ms TOCSY spectrum (top) and 200 ms NOESY spectrum of MCoTI-II in 90% H2O, 10% D2O at 293 K. Spin systems are shown in the TOCSY spectrum and the sequential connectivities in the alpha H-NH region of the NOESY spectrum. The one-letter code for amino acids combined with the residue number is used to denote assignment. The excellent signal dispersion is indicative of a well defined three-dimensional structure.

As the beta -aspartyl homologue of MCoTI-II has an identical molecular weight to the normal (alpha -aspartyl) isomer, NMR spectroscopy was used to discriminate between these molecules. Only one form (i.e. that eluting later on HPLC) displayed the expected sequential NOE connectivities (Fig. 3a) between residues 4 and 5, which strongly suggests this form contains the "normal" peptide backbone. Comparison of the alpha H chemical shifts of both isomers (Fig. 3b) suggests that the molecules have very similar structures, with the exception of the region surrounding residues 4 and 5. Having established the similarity of the two forms, in the current study we have determined the three-dimensional structure of the isomer of MCoTI-II with the alpha -Asp-4 configuration.


View larger version (18K):
[in this window]
[in a new window]
 
Fig. 3.   Comparison of the alpha  and beta  aspartyl isomers of MCoTI-II. a, structure of alpha  and beta  aspartic acid residues and sequential NOEs expected to be observed in the alpha -Asp form. b, comparison of the alpha -proton chemical shifts of the two isomers of MCoTI-II. The fact that the differences occur only near residue Asp-5 suggests that isomerization of this residue (alpha /beta aspartic acid) is responsible for the differences between the two isolated peptides.

Analysis of the NOE data, coupling constants and slow exchange amide protons (Fig. 4a) allowed the secondary structure to be elucidated as shown in Fig. 4b. A beta -hairpin is present within residues 26-34 and there are beta -sheet interactions between this hairpin and residues 13-15. An analysis of secondary shifts (i.e. the differences between observed chemical shifts and random coil values (30)) supported this interpretation of the secondary structure and indicated the presence of beta -strands but no helical regions.


View larger version (20K):
[in this window]
[in a new window]
 
Fig. 4.   NMR information on the secondary structure of MCoTI-II. a, summary of the sequential and medium range NOE information, 3JNH-alpha H coupling constants and slow exchange NH data for MCoTI-II. The top three lines give the sequential NOE data, with the height of the bars indicating the relative NOE strength (strong, medium, or weak). Open and filled squares represent 3JNH-alpha H coupling constants of >= 9 and <= 6.0 Hz, respectively. Filled circles indicate slow exchange amide protons. b, schematic representation of the beta -sheet region of MCoTI-II showing the interstrand NOEs (arrows); alpha alpha NOEs across the sheet are shown as solid lines and potential hydrogen bonds are shown as broken lines.

The three-dimensional structure of MCoTI-II was determined using torsion angle dynamics coded within the program XPLOR (24). The disulfide bonding pattern had not been determined chemically and so it was important to define this using NMR methods. For small disulfide-rich molecules in which the disulfide bonds form a compact core such an approach is often the only one available due to the difficulty of obtaining suitable cleavage products. Preliminary structure calculations without any disulfide bonds included as restraints were consistent with the disulfide connectivities being identical to those in linear squash trypsin inhibitors (31) previously characterized and those in the cyclotide peptides (8) (i.e. Cys I-Cys IV, Cys II-Cys V, and Cys III-Cys VI). This conclusion was based on an analysis of distances between all possible pairs of S atoms in the preliminary structures (Supplementary Material). This clearly shows that the optimal S-S pairing corresponds to the Cys I-Cys IV, Cys II-Cys V, and Cys III-Cys VI connectivity. Subsequent structure calculations were performed with the disulfide bonds formally connected in this way and these yielded structures of low energy and excellent covalent geometry. Combined with the consistency with the homologous squash inhibitors and the plant cyclotides this was taken as strong evidence for the proposed disulfide connectivity. Hydrogen bonds were also inferred from the preliminary structure calculations in conjunction with the slow exchange data, and all were consistent with the beta -sheet shown in Fig. 4b.

The family of structures for MCoTI-II is well defined over most of the molecule (residues 8-34), however, there are few medium or long range NOEs for the first seven residues and this results in a lack of definition in this region (Fig. 5a). This is reflected in the angular order parameters (Supplementary Material) that are generally >0.9 for the phi  and psi  angles for residues 8-34, but vary between 0.2 and 0.9 for residues 1-7. Geometric and energetic statistics that define the 20 lowest energy structures are given as supplementary data. The RMSD's for residues 8-34 are 0.70 ± 0.17 and 1.86 ± 0.38 Å for the backbone atoms and all heavy atoms, respectively. The corresponding values are 0.45 ± 0.11 and 1.56 ± 0.22 Å for superimposition over residues 12-34.


View larger version (39K):
[in this window]
[in a new window]
 
Fig. 5.   Three-dimensional structure of MCoTI-II. a, stereoview of the family of 20 lowest energy structures for MCoTI-II. The structures are superimposed over the backbone atoms of residues 12-33 and the disulfide bonds are shown in pink. b, ribbon representation of the secondary structure elements of MCoTI-II with the disulfide connectivities shown in stick form. Diagrams were generated using MOLMOL (39).

The major features of the three-dimensional structure are a beta -sheet region and several turns. Analysis of the secondary structure elements in the final family of calculated structures with PROMOTIF (27) reveals a gamma  turn between residues 18 and 20 and beta -turns between residues 15-18 and 22-25. A beta -hairpin is formally recognized by PROMOTIF between residues 26 and 34, as expected from the preliminary secondary structure analysis, and a type I beta -turn between residues 28 and 31 connects the strands. The beta -sheet interactions observed between the hairpin and residues 13-15 (Fig. 4b) are not formally recognized by PROMOTIF as part of a triple-stranded beta -sheet. However, this is also found for other related small disulfide-rich peptides and is hence not surprising. In the plant cyclotides (19) the third beta -strand is only loosely connected to the well defined beta -hairpin. Thus, despite the presence of NOEs and slowly exchanging amide protons characteristic of beta -sheets, the orientations of the strands in the structure calculations are such that they are not formally recognized as part of a triple-stranded beta -sheet in all of the structures.

The disulfide bonds of MCoTI-II form a cystine knotted structure similar to that in the plant cyclotides. In this arrangement the embedded ring formed by Cys I-Cys IV and Cys II-Cys V and their connecting backbone segments is threaded by the third disulfide bond, Cys III-Cys VI. A schematic representation of the secondary structure and the cystine knot of MCoTI-II is given in Fig. 5b.

Since the peptide backbone of MCoTI-II is cyclic, a convenient way to describe the structural features is in terms of the six segments, or loops, that are flanked by pairs of successive Cys residues. Fig. 6 shows the backbone considered in this way and highlights several features of the derived structures, including the location of the three beta -strands and the various turns. The disulfide bond connectivity is also shown. Loop 1, between Cys I and Cys II contains a highly positively charged (+3) hexapeptide segment which, based on homology with corresponding acyclic trypsin inhibitors, comprises the putative active site residues Lys/Ile. Loop 2 contains two positive and two negative charges and includes a beta -turn between residues 15-18 and a gamma  turn between residues 18 and 20. Loop 3 contains no charged residues but incorporates a beta -turn within its PGA sequence. Loop 4 contains a single amino acid (I) and is incorporated within one of the beta -strands that make up the beta -hairpin. It is followed by loop 5, which contains a type I beta -turn and leads into loop 6, which incorporates the second strand of the beta -hairpin. In the final folded structures the disulfide bonds "pinch" the circular backbone and loops 2 and 5 are drawn very close to one another.


View larger version (19K):
[in this window]
[in a new window]
 
Fig. 6.   Schematic representation of MCoTI-II structural features. The six cysteine residues are shown as filled circles and labeled I-VI, starting at the sequence CPKI. The disulfide connectivities are shown as lines connecting the cysteine residues along the circular backbone. The position of the turns, beta -strands, and backbone loops are indicated, along with the location of charges.

Loop 6 protrudes from the structure more than any of the other loops and is also the most disordered. Fig. 5 shows that individual structures vary widely in their placement of this loop relative to the core of the molecule. This may reflect intrinsic flexibility or simply a lack of NOEs defining this region. Loop 6 contains a high Gly content and hence the number of possible protons, and hence NOEs, is reduced relative to larger amino acids. To assess whether it was lack of NOEs, or flexibility that results in the disorder for loop 6, the chemical shift separation of the beta -protons of various amino acid side chains was examined. In general, rigid conformations are associated with widely separated beta -proton shifts while, in contrast, degenerate beta -proton shifts can be indicative of conformational averaging. For example, in an analysis of acyclic permutants of kalata B1, those permutants with decreased stability, based on amide exchange rates, displayed a decreased shift dispersion for cysteine beta -protons (32).

Analysis of the beta -proton chemical shift separations for the six cysteine residues of MCoTI-II supports the proposal of flexibility in loop 6. The disulfide bond connecting residues 8 to 25 is the closest disulfide bond to the disordered region and the beta -protons are essentially degenerate for both of these residues, suggesting conformational averaging in this region of the molecule. By contrast, the beta -shifts for the remaining four cysteine residues are separated by 0.17 to 0.34 ppm and correspond to well defined regions of the molecule. The flexibility in this putative linker region was predicted in the model determined by Hernandez et al. (16). The model also suggested contacts between Gly-6 and Ala-24, and indeed NOEs are observed between these residues in our study. However, the possible hydrogen bonding predicted between Ser-1 and Arg-28 was observed in very few of the NMR structures.

A comparison of MCoTI-II with the trypsin inhibitor CMTI-I (31) and the plant cyclotide, kalata B1, is given in Fig. 7. It is apparent the proteins possess similar topologies. For example, the backbone atoms of the cysteine residues of MCoTI-II and CMTI-I superimpose with an RMSD of 0.98 Å. The equivalent RMSD for kalata B1 and MCoTI-II is much higher (6.02 Å) because of the differences in spacings between the cysteine residues. However, when superimposed over the backbone atoms of the four cysteine residues that make up the ring of the cystine knot the RMSD value is 1.83 Å. Another trypsin inhibitor, EETI-II (33), also has a similar structure to MCoTI-II and superimposes with an RMSD of 1.37 Å over the backbone atoms of the cysteine residues.


View larger version (14K):
[in this window]
[in a new window]
 
Fig. 7.   A comparison of the three-dimensional structures of CMTI-I (a), MCoTI-II (b), and kalata B1 (c). The disulfide bonds are shown in yellow. The coordinates for EETI-II and kalata B1 were obtained from the Protein Data Bank. The overall structures are very similar for both the trypsin inhibitors, CMTI-I and MCoTI-II, and for the plant cyclotide kalata B1. The diagram was generated using MOLMOL (39).


    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

In the current study we have determined the three-dimensional structure of MCoTI-II, a 34-amino acid trypsin inhibitor from M. cochinchinensis. It has the unusual feature of a head-to-tail cyclized peptide backbone, and thus joins the ranks of an increasing number of small circular proteins discovered over the last five years. The main element of secondary structure is a beta -hairpin and associated cystine knot framework. A third strand of beta -sheet is loosely connected with this hairpin and the molecule also contains several turns. The overall fold is very similar to that of the squash trypsin inhibitors EETI-II and CMTI-I. Both have high sequence homology with MCoTI-II but are "conventional" proteins in that they lack the head-to-tail cyclization of MCoTI-II.

The similarity of the overall fold of the cyclic inhibitor to those of linear homologues raises the question of what is the advantage/biological role of cyclization? Several possibilities could be contemplated, including factors relating to conformational or dynamic stabilization, changes in physicochemical properties, addition of new functionality, or protection from proteolytic cleavage. We examine these possibilities in turn and then compare this newly discovered cyclic plant protein with the previously reported cyclotide family (8) of macrocyclic knotted peptides.

Possible conformational effects of cyclization could include stabilization of the active site residues into a preferred conformation for binding, or entropic stabilization associated with tethering the ends of linear homologues. The termini of small peptides are often disordered and hence contribute to unfavorable entropic losses on binding to target proteins. Cyclization can potentially reduce such losses by reducing mobility near the termini. However, this does not seem to be the primary role of cyclization in the case of MCoTI-II, since the putative linker involved in cyclization is in fact quite flexible. While the gene coding for MCoTI-II has yet to be reported, the mature peptide is likely to be derived from a larger precursor protein, processing of which results in the ligation of N- and C-terminal regions to form loop 6. This is based on analogy with the gene for the related open chain peptide TGTI-II from Towel gourd (34), whose sequence is consistent with loops 1-5 forming a contiguous region of the peptide chain. From an entropic perspective it is therefore surprising that the introduced loop 6 linker region is highly disordered, and apparently flexible, in the cyclic protein. The assumption that the disorder in this region reflects flexibility, rather than simply a lack of NOEs is supported by an analysis of chemical shift dispersion of beta -methylene protons.

Given that entropic factors do not explain the role of cyclization it was of interest to see if there might be specific conformational changes associated with cyclization that are important in a static rather than dynamic sense, e.g. in stabilizing a particular binding conformation. It has been suggested that the mechanism of inhibition of the MCo peptides is identical to that for other squash trypsin inhibitors, based on the sequence conservation in the putative inhibitory loop (16). The local conformations of the residues presumed to be at the active site are summarized in Table I for MCoTI-II and a range of linear homologues, both in solution and complexed to trypsin. Comparison of a linear squash inhibitor, CMTI-I (31), with the cyclic MCoTI-II reveals similar backbone angles in solution, with the most significant difference being in the psi  angle of the P1 residue (Lys-10) and to a lesser extent in the phi  angle of the P1' residue. The solution structures of MCoTI-II and CMTI-I are also very similar to the bound structures listed in Table I with the exception of the psi  angle of the P1 residue. The lack of a major conformational change between MCoTI-II and linear homologues, either in solution or bound states, suggests that cyclization does not play a significant role in determining the active site conformation.

                              
View this table:
[in this window]
[in a new window]
 
Table I
Backbone angles (°) of MCoTI-II and selected trypsin inhibitors

The final inhibitor listed in Table I, sunflower trypsin inhibitor I differs significantly from the others in that it is much smaller (14 residues), contains a single disulfide bond and unlike all except MCoTI-II, is cyclic (17). It is a particularly potent trypsin inhibitor despite this small size and hence provides an opportunity to assess the essential residues for activity. Its three-dimensional structure in complex with bovine beta -trypsin has been determined using x-ray crystallography (17) and the active site residues are in similar positions to those observed for the other squash inhibitors (Table I). The fact that potent trypsin inhibition can be obtained with such a small inhibitor is consistent with the fact that peripheral regions of the MCoTI-II inhibitor are not essential for binding activity and function, and hence play other roles. Conformational or entropic roles for the loop 6 linker region have been discounted and hence factors relating to additional functionality or modulation of biophysical properties can be considered.

Another possible role for cyclization is that of protection from proteolytic cleavage. Indeed, based on the discussion above, which suggests that conformational or entropic roles are unlikely, this seems the most likely function of cyclization in MCoTI-II. Cyclization of small peptides is a well established method used in the pharmaceutical industry to protect small bioactive peptides against cleavage by in vivo exopeptidases. Being a seed component, the MCoTI-II peptide most likely has to survive intact for long periods prior to germination and its function would be compromised if broken down by exoproteases. MCoTI-II has also been shown to be resistant to the endoprotease thermolysin (16). This is analogous with the plant cyclotide, kalata B1, which is resistant to a range of enzymes including thermolysin, trypsin, and pepsin (1). In general the naturally occurring cyclization of macrocyclic peptides appears to be an effective means of stabilization, much as is the case for small synthetic peptides.

Kalata B1 is just one member of what is now a large family of cyclotides (8). Although MCoTI-II is of similar size (34 versus 29 amino acids) and cystine spacings, it displays no sequence homology to the previously identified plant cyclotides. Nevertheless, the three-dimensional structure is similar, and contains identical elements of secondary structure and the cystine knot motif. The combined presence of a cyclic peptide backbone and a cystine knot motif means that MCoTI-II fits into the cyclic cystine knot structural framework (8, 15). The structural similarity suggests the possibility that the cyclotides and the Momordica peptides may be evolutionarily related, despite the fact that they have different biological activities. Furthermore, the observation of different biological activities for the various members of the cyclotide family provides proof of the concept that the cyclic cystine knot framework is particularly stable and amenable to grafting of a range of different bioactivities (15).

Fig. 7 shows that the size of the cystine knot motif in the Momordica peptides is slightly different from those of the known cyclotides and that the embedded ring in the structure contains 11 amino acids rather than eight. This makes the cystine knot slightly looser, however, the derived family of structures are still very well defined over the core of the cystine knot. By contrast, the linker region associated with backbone cyclization appears much more flexible in the macrocyclic trypsin inhibitors than in the previously reported plant cyclotides. In the former case, the linker region consists of small hydrophilic amino acids, whereas in the plant cyclotides the residues are generally larger.

The previously reported cyclotides are characterized by their long retention times on RP-HPLC. This is presumed to result from a patch of surface-exposed hydrophobic residues (1). In the case of kalata B1 we have shown that this patch arises after formation of the native disulfide connectivities (10). The macrocyclic peptides from MCo elute significantly earlier than the cyclotides, suggesting decreased hydrophobicity. The MCo peptides also contain many more charged residues, particularly positively charged residues, than the cyclotides. The distribution of hydrophobic and charged residues for MCoTI-II is shown in Fig. 8. Several of the hydrophobic residues are surface exposed, presumably as a consequence of the core of the molecule containing the disulfide bonds and leaving very little room for other residues. These surface-exposed hydrophobic residues appear mainly on one face of the molecule. The other face of the molecule contains most of the positively and negatively charged residues, with the positive residues clustered together. Clustering of hydrophobic and cationic residues has previously been found to be crucial for antimicrobial activity. The distribution of these residues on the surface of MCoTI-II indicates that such an activity may be possible for this molecule. This is supported by the report that a preparation from M. cochinchinensis seeds, which contained mainly MCoTI-II, exhibited antimicrobial properties (16).


View larger version (58K):
[in this window]
[in a new window]
 
Fig. 8.   The three-dimensional structure of MCoTI-II shown in CPK format. The negatively charged residues are in red, positively charged in dark blue, hydrophobic residues in green, polar residues in light blue, and cysteine residues in yellow. Surface exposed hydrophobic residues appear mainly on one face and the other face contains most of the positively and negatively charged residues. The views are rotated 180° about the vertical axis with respect to each other. The diagram was generated using MOLMOL (39).

An interesting difference between the previously reported cyclotides and the circular squash peptides is the absence of linear counterparts for the cyclotides. Linear squash inhibitors have been known for many years, however, so far there has been no evidence for linear forms of the cyclotides. This may be a consequence of the isolation techniques that have been employed to date, or may reflect highly efficient processing of the cyclotides from putative linear precursors. Despite these differences between MCoTI-II and the plant cyclotides the similar three-dimensional structure and cyclic backbone suggests they can be regarded as part of the same family of proteins. Thus the cyclotide family can be expanded to include the new cyclic trypsin inhibitors MCoTI-I and MCoTI-II.

This expansion of the cyclotide family to include trypsin inhibitors from the Cucurbitaceae family highlights the importance and functional variability of these peptides. The increasing incidence of discoveries of naturally occurring circular proteins coincides with the excitement currently associated with the use of intein-based methods for the artificial production of circular proteins (35-38).

    ACKNOWLEDGEMENTS

We thank the Institute for Molecular Bioscience, a Special Research Center of the Australian Research Council.

    FOOTNOTES

* This work was supported in part by a grant from the Australian Research Council (to D. J. C.).The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The atomic coordinates and the structure factors (code 1IB9) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

The on-line version of this article (available at http://www.job.org) contains Tables 1-3 and Fig. 1.

Dagger Australian Research Council Senior Fellow. To whom correspondence should be addressed: Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia. Tel.: 61-7-3365-4945; Fax: 61-7-3365-2487; E-mail: d.craik@imb.uq.edu.au.

Published, JBC Papers in Press, April 5, 2001, DOI 10.1074/jbc.M101666200

    ABBREVIATIONS

The abbreviations used are: RP-HPLC, reversed-phase high performance liquid chromatography; MCo, Momordica cochinchinensis; TOCSY, two-dimensional total correlation spectroscopy; NOE, nuclear Overhauser effect; NOESY, two-dimensional NOE spectroscopy; DQF-COSY, double quantum filtered correlation spectroscopy; RMSD, root mean square deviation.

    REFERENCES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

1. Saether, O., Craik, D. J., Campbell, I. D., Sletten, K., Juul, J., and Norman, D. G. (1995) Biochemistry 34, 4147-4158[Medline] [Order article via Infotrieve]
2. Gustafson, K. R., Sowder, R. C., II, Henderson, L. E., Parsons, I. C., Kashman, Y., Cardellina II, J. H., McMahon, J. B., Buckheit, R. W., Jr., Pannell, L. K., and Boyd, M. R. (1994) J. Am. Chem. Soc. 116, 9337-9338
3. Gustafson, K. R., Walton, L. K., Sowder, R. C. I., Johnson, D. G., Pannell, L. K., Cardellina, J. H. I., and Boyd, M. R. (2000) J. Nat. Prod. 63, 176-178[CrossRef][Medline] [Order article via Infotrieve]
4. Witherup, K. M., Bogusky, M. J., Anderson, P. S., Ramjit, H., Ransom, R. W., Wood, T., and Sardana, M. (1994) J. Nat. Prod. 57, 1619-1625[Medline] [Order article via Infotrieve]
5. Schöpke, T., Hasan Agha, M. I., Kraft, R., Otto, A., and Hiller, K. (1993) Sci. Pharm. 61, 145-153
6. Hallock, Y. F., Sowder, R. C. I., Pannell, L. K., Hughes, C. B., Johnson, D. G., Gulakowski, R., Cardellina, J. H. I., and Boyd, M. R. (2000) J. Org. Chem. 65, 124-128[CrossRef][Medline] [Order article via Infotrieve]
7. Göransson, U., Luijendijk, T., Johansson, S., Bohlin, L., and Claeson, P. (1999) J. Nat. Prod. 62, 283-286[CrossRef][Medline] [Order article via Infotrieve]
8. Craik, D. J., Daly, N. L., Bond, T., and Waine, C. (1999) J. Mol. Biol. 294, 1327-1336[CrossRef][Medline] [Order article via Infotrieve]
9. Tam, J. P., Lu, Y. A., Yang, J. L., and Chiu, K. W. (1999) Proc. Natl. Acad. Sci. U. S. A. 96, 8913-8918[Abstract/Free Full Text]
10. Daly, N. L., Love, S., Alewood, P. F., and Craik, D. J. (1999) Biochemistry 38, 10606-10614[CrossRef][Medline] [Order article via Infotrieve]
11. Daly, N. L., Koltay, A., Gustafson, K., R., Boyd, M. R., Casas-Finet, J. R., and Craik, D. J. (1999) J. Mol. Biol. 285, 333-345[CrossRef][Medline] [Order article via Infotrieve]
12. McDonald, N. Q., and Hendrickson, W. A. (1993) Cell 73, 421-424[Medline] [Order article via Infotrieve]
13. Isaacs, N. W. (1995) Curr. Opin. Struct. Biol. 5, 391-395[CrossRef][Medline] [Order article via Infotrieve]
14. Pallaghy, P. K., Nielsen, K. J., Craik, D. J., and Norton, R. S. (1994) Protein Sci. 3, 1833-1839[Abstract/Free Full Text]
15. Craik, D. J., Daly, N. L., and Waine, C. (2001) Toxicon 39, 43-60[CrossRef][Medline] [Order article via Infotrieve]
16. Hernandez, J. F., Gagnon, J., Chiche, L., Nguyen, T. M., Andrieu, J. P., Heitz, A., Trinh Hong, T., Pham, T. T., and Le Nguyen, D. (2000) Biochemistry 39, 5722-30[CrossRef][Medline] [Order article via Infotrieve]
17. Luckett, S., Garcia, R. S., Barker, J. J., Konarev, A. V., Shewry, P. R., Clarke, A. R., and Brady, R. L. (1999) J. Mol. Biol. 290, 525-33[CrossRef][Medline] [Order article via Infotrieve]
18. Gran, L., Sandberg, F., and Sletten, K. (2000) J. Ethnopharmacol. 70, 197-203[CrossRef][Medline] [Order article via Infotrieve]
19. Craik, D. J., Daly, N. L., and Nielsen, K. J. (2000) PCT Int. Patent Appl. 43 pp WO 0015654
20. Eccles, C., Guntert, P., Billeter, M., and Wüthrich, K. (1991) J. Biomol. NMR 1, 111-130[Medline] [Order article via Infotrieve]
21. Guntert, P., Mumenthaler, C., and Wüthrich, K. (1997) J. Mol. Biol. 273, 283-298[CrossRef][Medline] [Order article via Infotrieve]
22. Rice, L. M., and Brünger, A. T. (1994) Proteins 19, 277-290[Medline] [Order article via Infotrieve]
23. Stein, E. G., Rice, L. M., and Brünger, A. T. (1997) J. Magn. Reson. 124, 154-164[CrossRef][Medline] [Order article via Infotrieve]
24. Brünger, A. T. (1996) X-PLOR Manual Version 3.851 , Yale University, New Haven, CT
25. Clore, G. M., Brünger, A. T., Karplus, M., and Gronenborn, A. M. (1986) J. Mol. Biol. 191, 523-551[Medline] [Order article via Infotrieve]
26. Brooks, B. R., Bruccoleri, R. E., Olafson, B. D., States, D. J., Swaminathan, S., and Karplus, M. (1983) J. Comput. Chem. 4, 187-217
27. Hutchinson, E. G., and Thornton, J. M. (1996) Protein Sci. 5, 212-220[Abstract/Free Full Text]
28. Laskowski, R. A., Rullmannn, J. A., MacArthur, M. W., Kaptein, R., and Thornton, J. M. (1996) J. Biomol. NMR 8, 477-486[Medline] [Order article via Infotrieve]
29. Wüthrich, K. (1986) NMR of Proteins and Nucleic Acids , Wiley-Interscience, New York
30. Wishart, D. S., Bigam, C. G., Holm, A., Hodges, R. S., and Sykes, B. D. (1995) J. Biomol. NMR 5, 67-81[Medline] [Order article via Infotrieve]
31. Holak, T. A., Gondol, D., Otlewski, J., and Wilusz, T. (1989) J. Mol. Biol. 210, 635-648[Medline] [Order article via Infotrieve]
32. Daly, N. L., and Craik, D. J. (2000) J. Biol. Chem. 275, 19068-19075[Abstract/Free Full Text]
33. Heitz, A., Chiche, L., Le-Nguyen, D., and Castro, B. (1989) Biochemistry 28, 2392-2398[Medline] [Order article via Infotrieve]
34. Ling, M. H., Qi, H. Y., and Chi, C. W. (1993) J. Biol. Chem. 268, 810-814[Abstract/Free Full Text]
35. Scott, C. P., Abel-Santos, E., Wall, M., Wahnon, D. C., and Benkovic, S. J. (1999) Proc. Natl. Acad. Sci. U. S. A. 96, 13638-13643[Abstract/Free Full Text]
36. Paulus, H. (2000) Annu. Rev. Biochem. 69, 447-496[CrossRef][Medline] [Order article via Infotrieve]
37. Evans, T. C., Jr., Benner, J., and Xu, M. Q. (1999) J. Biol. Chem. 274, 18359-18363[Abstract/Free Full Text]
38. Iwai, H., and Pluckthun, A. (1999) FEBS Lett. 459, 166-172[CrossRef][Medline] [Order article via Infotrieve]
39. Koradi, R., Billeter, M., and Wüthrich, K. (1996) J. Mol. Graph. 14, 51-5[CrossRef][Medline] [Order article via Infotrieve]., 29-32
40. Huang, Q., Liu, S., and Tang, Y. (1993) J. Mol. Biol. 229, 1022-1036[CrossRef][Medline] [Order article via Infotrieve]
41. Bode, W., Greyling, H. J., Huber, R., Otlewski, J., and Wilusz, T. (1989) FEBS Lett. 242, 285-292[CrossRef][Medline] [Order article via Infotrieve]
42. Helland, R., Berglund, G. I., Otlewski, J., Apostoluk, W., Andersen, O. A., Willassen, N. P., and Smalas, A. O. (1999) Acta Crystallogr. D Biol. Crystallogr. 55, 139-148[CrossRef][Medline] [Order article via Infotrieve]
43. Di Marco, S., and Priestle, J. P. (1997) Structure 5, 1465-1474[Medline] [Order article via Infotrieve]


Copyright © 2001 by The American Society for Biochemistry and Molecular Biology, Inc.