©1995 by The American Society for Biochemistry and Molecular Biology, Inc.
Structure-Activity Relationship of the Leucine-based Sorting Motifs in the Cytosolic Tail of the Major Histocompatibility Complex-associated Invariant Chain (*)

(Received for publication, June 12, 1995; and in revised form, August 2, 1995)

Andrea Motta (1)(§) Bjørn Bremnes (2) Maria A. Castiglione Morelli (3) Rainer W. Frank (4) Gabriella Saviano (1) Oddmund Bakke(§) (2)

From the  (1)From Istituto per la Chimica di Molecole di Interesse Biologico del CNR, I-80072 Arco Felice (Napoli), Italy, the (2)Division of Molecular Cell Biology, Department of Biology, University of Oslo, N-0316 Oslo, Norway, the (3)Dipartimento di Chimica, Università della Basilicata, I-85100 Potenza, Italy, and the (4)Zentrum für Molekulare Biologie, Universität Heidelberg, Postfach 106249, D-69052 Heidelberg, Federal Republic of Germany

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

The cytosolic tail of the major histocompatibility complex-associated invariant chain protein contains two Leu-based motifs that both mediate efficient sorting to the endocytic pathway. Nuclear magnetic resonance data on a peptide of 27 residues corresponding to the cytosolic tail of human invariant chain indicate that in water at pH 7.4 the membrane distal motif Leu^7-Ile^8 lies within a nascent helix, while the membrane proximal motif Met-Leu is part of a turn. The presence of a small amount of methanol stabilizes an alpha helix from Gln^4 to Leu with a kink on Pro. Point mutations of the cytosolic tail of the protein suggest that amino-terminal residues located in spatial proximity to the Leu motifs contribute to efficient internalization and targeting to endosomes in transfected COS cells. Residues on the spatially opposite side of the Leu motifs were, on the other hand, mutated with no measurable effect on targeting. Structural and biological data thus suggest that the signals are not continuous but consist of ``signal patches'' formed by the three-dimensional structure of the cytosolic tail of invariant chain.


INTRODUCTION

In transmembrane proteins, specific signals for endosomal or lysosomal sorting have been identified within the cytosolic tails. In a number of proteins sorted to the endosomal pathway, a Tyr-containing motif seems to mediate internalization from the plasma membrane (for review, see (1) and (2) )) and form a tight beta turn functionally comparable to the internalization signal found in cell surface receptors(3, 4, 5, 6) . However, recent NMR analysis of a synthetic peptide corresponding to the extreme 21 carboxyl-terminal amino acid residues of the cytosolic domain of trans-Golgi network (TGN) (^1)protein TGN38/41 (which is routed from the plasma membrane and back to TGN) shows that this Tyr-containing internalization signal lies within a nascent helix (7) .

Tyrosine signals are not universal for sorting of membrane proteins to the endosomal/lysosomal pathway. In fact, the cytosolic tail of the lysosomal membrane protein LIMP II contains no Tyr but a Leu-Ile signal, located two residues from the carboxylic end, which mediates efficient sorting(8, 9) . Letourneur and Klausner (10) reported that a Leu-Leu motif could mediate lysosomal targeting of the CD3- and - chains of the T-cell receptor complex. This signal was functional, even if located at the carboxyl-terminal end, and together with a Tyr-based motif they were individually sufficient to induce endocytosis and delivery to lysosomes. Two similar signals were also reported in the cytosolic tail of the mannose 6-phosphate-receptor by Johnson and Kornfeld(11) . The above studies may suggest that the Tyr signal is primarily mediating sorting to the endocytic pathway via internalization from the plasma membrane, whereas the Leu signal alone, or in combination with the Tyr signal, mediates direct sorting from the TGN to the endocytic pathway. However, both signals internalize efficiently plasma membrane proteins, and further information is needed to clarify the requirements for selecting the pathway from the TGN to endosomes (for further discussion see(2) ).

The invariant chain (Ii), which efficiently targets the associated major histocompatibility complex class II molecules to the endosomal/lysosomal pathway, is another example of a protein with cytosolic Leu-sorting motifs(12, 13, 14) . Invariant chains from different species contain in their cytosolic tails pairs of Leu-Ile and Met-Leu (Ile-Leu), and mutational analysis of the cytosolic Ii tail fused to reporter molecules have shown that these signals can independently mediate endosomal targeting(15, 16) . Distribution and internalization studies also show that a significant fraction of the Ii molecules, alone or in complex with major histocompatibility complex class II, were transported to endosomes via the plasma membrane(17) . Furthermore, the Leu-Ile and Met-Leu motifs individually mediate rapid internalization of a chimeric protein (INA) obtained by fusing neuraminidase (NA) to the cytosolic tail of Ii(15) . In another study the Ii tail was fused to the transferrin receptor, but the measured sorting via the plasma membrane was not sufficient to account for the amount of protein synthesized, indicating sorting also directly from the TGN(16) . The accumulated data thus suggest that Leu-based motifs are actively engaged in the sorting of membrane proteins to the endosomal/lysosomal pathway, both directly from the TGN and via the plasma membrane.

Here we report structural and mutational studies on the 27-amino acid synthetic peptide Ii-(1-27) (Fig. SI), corresponding to the amino-terminal region of human Ii. NMR results demonstrate, for the first time, that in water Leu^7-Ile^8 and Met-Leu internalization motifs lie within a nascent helix and a turn, respectively, but can take up a kinked helix in the presence of small amount of methanol. Site-directed mutagenesis has been performed to determine in which context the two internalization signals are functional. By comparing the biological data with the three-dimensional structure we conclude that the spatial arrangement of the leucine motif is essential for a functional signal and that it is not a continuous signal but a ``signal patch'' that depends upon the secondary structure of the protein.

Met^1-Asp-Asp-Gln-Arg^5-Asp-Leu-Ile-Ser-Asn-Asn-Glu-Gln-Leu-Pro-Met-Leu-Gly-Arg-Arg-Pro-Gly-Ala-Pro-Glu-Ser-Lys

Scheme I. Amino acid sequence of the Ii-(1-27) peptide corresponding to the cytosolic tail of human invariant chain. The di-leucine-like signals are in italics.




EXPERIMENTAL PROCEDURES

Peptide Synthesis

The peptide was synthesized on polyoxyethylene-polystyrene graft resin in a continuous flow instrument constructed and operated as described by Frank and Gausepohl(18) . Peptide chain assembly was performed using Fmoc (fluorenylmethoxycarbonyl) chemistry (19) and in situ activation of amino acid building blocks by PyBOP(20) . The peptide was purified by reversed phase high performance liquid chromatography and characterized by laser desorption mass spectrometry.

NMR Data Collection

Samples were prepared by dissolving lyophilized Ii-(1-27) in 95% ^1H(2)O, 5% ^2H(2)O (v/v) and in 20% C^2H(3)O^1H, 80% ^1H(2)O (v/v), in phosphate buffer at pH 7.4 and 283 K. No corrections of pH readings were made for isotope effects. The concentration was 2.6 times 10M in all solvents. C^2H(3)O^1H was obtained from Cambridge Isotope Laboratories (Woburn, MA). Spectra were acquired with a Bruker AMX 500-MHz spectrometer interfaced to an Aspect X32 computer and referenced to 3-(trimethylsilyl)propionic acid sodium salt. Two-dimensional NMR spectra, namely double-quantum-filtered correlated spectroscopy (DQF-COSY)(21) , nuclear Overhauser enhancement (NOE) spectroscopy (NOESY) (22) and total correlation spectroscopy (TOCSY)(23) , were recorded in the phase-sensitive mode with quadrature detection in (1) provided by the time-proportional phase incrementation scheme(24) . Usually, 16-32 transients of 2048 points were collected for each of the 512 t(1) increments, with a spectral width of 6024 Hz. Time-domain data matrices were all zero-filled in the (1) dimension to 2048 and to 4096 in the (2), thus yielding a digital resolution of 5.88 and 2.94 Hz/point, respectively. Lorentz-to-Gauss resolution enhancement in both dimensions were used as weighting functions before transformation. NOESY experiments were obtained with different mixing times with no random variation. TOCSY experiments, modified to be clean(25) , were recorded with a spin-lock period of 0.064 s, achieved with the MLEV-17 pulse sequence(26) . Irradiation of the strong solvent signal was achieved in the coherent mode (27) Zuiderweg et al., 1986) during the relaxation time, and in the case of the NOESY, during the mixing time. Slowly exchanging protons at pH (uncorrected meter reading) 7.0 and 283 K, were identified by recording a NOESY spectrum (0.20 s mixing time) of the peptide immediately after dissolution in C^2H(3)O^2H-^2H(2)O. Measurement of the ^3J coupling constants was obtained from one-dimensional experiments acquired with 131,072 points and application of strong Lorentz-to-Gauss resolution enhancement or estimated, as apparent values, by measuring antiphase peak separations in the DQF-COSY experiment(28) .

Calculation of Structures

The input used for structure calculations of the peptide consisted of 186 NOE upper distance constraints, which were derived from NOESY spectra recorded at 0.060, 0.10, and 0.20 s of mixing times. A semiquantitative correlation between NOESY cross-peaks and proton-proton distances was obtained by comparing the effects observed at different mixing times with the standard distances observed in protein structures(29) . Strong, medium, and weak intraresidual and sequential NOEs observed at 0.060 s were translated into three distance ranges: 0.26-0.28 nm, 0.29-0.31 nm, and 0.33-0.36 nm, respectively. Looser upper limits were set for medium range NOEs observed only in the 0.10-s and the 0.20-s NOESY spectra, with a cut-off distance restraints of 0.40 and 0.46 nm, respectively. The beta-methylene groups were stereospecifically assigned with the program HABAS(30) , which uses intraresidual and sequential NOEs and vicinal coupling constants ^3J and ^3J. Five beta-methylene groups (namely, those of residues 4, 11, 12, 13, and 25) were stereo-specifically assigned. When no stereo-specific assignment was possible for methylene and methyl protons, distance constraints were corrected for the pseudo atom representation(31) . For methyl groups, an additional correction of 0.05 nm was added for the highest apparent intensity of methyl resonances(32) . Distance-geometry calculations were performed with the DIANA program(33) . 20 dihedral angle constraints based on experimental ^3J coupling constants were also used as input data. The variable target function was changed according to the standard strategy from level 1 to 27. A total of 100 starting structures were generated from random choices of dihedral angles. The best nine structures, in terms of distance violations, were chosen for further refinement. Energy minimization calculations were performed with the GROMOS software package(34) ; bond length constraints were applied with the SHAKE method(35, 36) . In order to release the strain caused by bad van der Waal contacts, while retaining the features of the original distance-geometry structures, 800 steps of steepest descent restrained energy minimization were applied. The list of nonbonded neighboring atom pairs was updated every 10 cycles of energy minimization. A cut-off radius of 0.08 nm was used beyond which no nonbonded interactions were evaluated. Distance restraints obtained from NMR measurements were incorporated into calculations as a semiharmonic potential function with a force constant of 1000 kJbulletnmbulletmol. Graphical representation and root mean square deviation analysis between energy-minimized structure pairs were carried out with the program SYBYL (Tripos, Inc).

Expression Vector and Plasmid Constructions

pSV51L is a late replacement vector with a short polylinker behind the SV40 promoter, and this vector has been shown to give high transient expression of several proteins in simian cells(12, 37) . The cDNA for neuraminidase is from human influenza strain A7/Victoria3/75(38) , and the various INA constructs consisting of the cytosolic tail of Ii have been described earlier(15, 39) . The deletion mutants Delta11INA is lacking the amino acid 2-11 of the Ii tail, and Delta12-29INA is lacking residues 12-29. Point mutations in the cytosolic tail of Ii were introduced by site directed mutagenesis in a single-stranded M13mp19 vector by the method of Kunkel(40) . The mutagenized tail regions were confirmed by DNA sequencing.

Cell Culture and Transient Expression in COS Cells

The COS cells are derived from CV1 cells transformed with an origin-defective mutant of SV40 coding for the wild type T-antigen(41) . The cell lines were maintained in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum (DMEM-FCS). The cells were seeded into 35-mm wells the day before transfection at 50% confluence. For transfection 0.5 µg of plasmid DNA was dissolved in 0.1 ml of DMEM with 10% NuSerum (Collaborative Research). The solution was mixed with 1 ml of DMEM-NuSerum containing 400 µg/ml DEAE-dextran and 0.1 mM chloroquine. The cells were washed twice in phosphate-buffered saline (pH 7.4) before the DNA was added (1 ml/35-mm well). After 3-4 h at 310 K, 6% CO(2) the cells were given a Me(2)SO shock (10% dimethyl sulfoxide, 2-3 min) and grown for a further 2 days in DMEM-FCS for expression of the proteins.

Antibodies and Immunofluorescence

The mouse monoclonal antibody NC71 which both recognize the luminal domain of neuraminidase was a gift of Dr A. Douglas, Mill Hill, London. Two days after transfection, the cells were labeled for immunofluorescence with fluorescein isothiocyanate and Texas Red-conjugated goat anti-rabbit Ig and goat anti-mouse Ig antibodies (Dianova, Hamburg, Federal Republic of Germany), as described earlier(12) .

Purification and I Labeling of Antibodies

Mouse ascites NC71 (IgG(1)) was precipitated with 0.18 g/ml Na(2)SO(4) and purified on a protein A-Sepharose (Pharmacia) column at pH 8.0 before iodination by IODO-GEN(TM) as described by the manufacturer (Pierce). The amount of acid-soluble and precipitable material was determined by trichloroacetic acid precipitation and counting in a Cobra Auto-Gamma counter. The amount of soluble radioactivity in the samples was usually as low as 1-2%.

Internalization of I-Labeled Antibodies

Transiently transfected COS cells in 35-mm wells were incubated with I-labeled NC71 in DMEM-FCS (approximately 1 µg/ml) on ice for 2 h. The cells were then washed six times in ice-cold phosphate-buffered saline with 2% FCS. The chase was performed in DMEM-FCS at 310 K for different periods of time. Cells were then cooled on ice and treated twice with 0.5 M acetic acid in 0.15 M NaCl (pH 2.5) for 7 min. This step removed 95-98% of the surface bound antibody. The cells were removed from the wells by lysis in 1 M NaOH. The acid wash and the lysed samples were counted in the -counter. Each time point was performed in duplicate. Internalized antibody was calculated as the antibody resistant to the low pH wash relative to the antibody bound before the chase period. Typically, when cells were sham-transfected with the vector without insert and incubated with I-NC71 on ice and washed, the cell-bound activity was less than 1% of the activity bound to cells transfected with INA variants.


RESULTS

NMR Analysis

Ii-(1-27) peptide was studied at pH 7.4 in phosphate buffer at 283 K in water and water/methanol mixture, respectively. Assignment of proton spin systems was obtained with the sequential methodology outlined by Wüthrich(29) . From the amide protons, TOCSY experiments allowed identification of the alpha and the beta protons of almost all of amino acids. Residues with long side chains were identified by a combination of TOCSY and NOESY experiments. Individual spin systems were placed in the primary structure by identification of characteristic short and medium range NOE connectivities.

Fig. 1reports the summary of NOE information for Ii-(1-27) in water at 283 K and pH 7.4. Together with the strong alphaCH(i)-NH connectivities, a number of NOEs were observed between the NH resonances of sequential amino acids in the regions before and after Pro, the strongest being observed in the Asp^6-Asn region and between Met and Leu. Proximity of NHs of adjacent residues requires a kink in the backbone, a conformation associated with a beta turn or with an alpha helix(42) . Accordingly, the NOEs between pairs of NHs in the region Arg^5-Glu suggest that consecutive turns are present in this part of the peptide and that Met-Leu is part of a turn. An ensemble of consecutive turns resembles a helix-like conformation, since distance constraints in tight turns are similar to those in helical segments(29) . This is confirmed by alphaCH(i)-NH(i) connectivities (Arg^5-Leu^7, Asp^6-Ile^8, Leu^7-Ser^9, Ile^8-Asn, and Ser^9-Asn), and by betaCH(i)-NH(i) cross-peaks (Gln^4-Arg^5, Leu^7-Ile^8, Ser^9-Asn, and Glu-Gln). However, the contemporary presence of alphaCH(i)-NH(i) and NH(i)-NH(i) NOEs (Fig. 1) indicates that the local helix-like structure in the region at the amino-terminal side of Pro dynamically transforms into extended conformations.


Figure 1: Diagrammatic representation of sequential and secondary structural interresidues NOEs observed for Ii-(1-27) in water at 283 K and pH 7.4. Atoms involved in the NOEs are listed in the text. Strong, medium, and weak NOEs are indicated by thick, medium, and thin lines, respectively. Empty bars refer to effects to the protons of Pro residues.



The absence of a stable helical structure is confirmed by the lack of characteristic NOEs and by the ^3J coupling constants values. We did not observe loop to loop interproton NOEs (e.g. alpha of residue i to the NHs of residues i+3 and i+4), although several residues show the NH to NH interaction. On the other hand, only the amides of residues Leu^7 and Ser^9 have coupling constants (5.9 Hz) similar to that expected for a stable helical structure(43) , while the ^3J coupling constants for the remaining residues in that region all are > 7 Hz. These results (stretch of sequential and short range NOEs, the absence of helix-defining NOEs within a conformation ensemble giving rise to largely averaged coupling constants) argue for the presence of ``nascent helix'' structure(44) , including the Leu^7-Ile^8 sorting signal of invariant chain. The term nascent helix refers to an ensemble of interconverting extended chain and turn-like structures existing over a peptide sequence at the earliest stages of helix initiation.

From Pro onward, the peptide also assumes turn-like conformations up to Ala, as suggested by NH(i)-NH(i), betaCH(i)-NH(i) and alphaCH(i)-NH(i) NOE connectivities (Fig. 1). In particular, the presence of strong NOEs between the amide protons of Met and Leu and between alphaCH of Pro and NH of Leu suggest that the second signal Met-Leu is part of a turn.

A nascent helix can be stabilized by small amounts of organic co-solvent(44) . Upon addition of methanol at a concentration of 20% (v/v) to the Ii-(1-27) aqueous solution, all resonances needed to be reassigned because one-to-one comparison between the two solvents was not possible. Complete sequential assignment was made as described above. Stabilization of the helical structure was confirmed by the sequential (alphaCH(i)-NH(i) and NH(i)-NH(i)) and medium-range (alphaCH(i)-NH, n geq 2, and alphaCH(i)-betaCH(i)) NOEs(45) , slowly exchanging amide protons(29) , and ^3J coupling constants (43) . Fig. 2summarizes the observed NOEs, the relative exchange rates of amide protons and the apparent ^3J coupling constants for Ii-(1-27) at 283 K and pH 7.4 in the presence of 20% methanol. The fact that in the region Gln^4-Leu^14 the NH(i)-NH(i) NOEs are intense, while the alphaCH(i)-NH(i) NOEs are much weaker, implies a generally helical structure(42) . The observation of several unambiguous alphaCH(i)-NH(i), alphaCH(i)-betaCH(i) and a single alphaCH(i)-NH(i) cross-peaks (Fig. 2) supports the presence of a helix.


Figure 2: ^3Jcoupling constants, NOEs, and hydrogen-deuterium exchange rate observed for Ii-(1-27) in 20% methanol, 80% water, 283 K and pH 7.4. Coupling constants are labeled as follows: bullet, parent ^3J < 5.0 Hz; bullet, measured ^3J> 7.0 Hz; circle, 6.0 Hz leq ^3J leq 7.0 Hz. NOE intensities are indicated by the height of the bars, and empty bars refer to effects to the protons of Pro residues. Filled circles also label residues for which slow exchange of amide protons with deuterons was observed.



Further corroborative data come from slowly exchanging amides; except for Gln, all the amide protons in the Gln^4-Leu^14 region are in slow exchange. The slow exchange most likely indicate hydrogen bonding, since it is unlikely that a slowly exchanging proton is buried in the interior of a biomolecule as small as the cytosolic tail of Ii. ^3J < 6 Hz in the Gln^4 to Leu^14 region also supports the presence of a helix(43) . Furthermore, alphaCH(i)-NH(i) cross-peaks, suggestive of a 3 helix(45) , were not detected in the middle region of the peptide and we conclude that the Gln^4-Leu^14 region of Ii-(1-27) forms an alpha helix.

From Pro onward, we observed strong alphaCH(i)-NH(i) NOE connectivities, meaning that the conformation is essentially extended. However, the presence of NH(i)-NH(i), betaCH(i)-NH(i), two alphaCH(i)-NH(i) (Pro-Leu and Met-Gly^18) and a single small alphaCH(i)-NH(i) (Leu^14-Leu) NOE connectivities (Fig. 2) is indicative of local structures and short range order. betaCH(i)-NH(i) connectivities are commonly observed in type I or type III turns(42) , but while type III turns are very similar to type I, the presence of a Pro in the trans isomer is compatible with both type I and type II classes of turns(46) . In principle, they can be distinguished on the basis of ^3J coupling constants at positions 2 and 3 of the turns, and NOE connections(42, 45) . Except for Met and Lys, for which we measured a ^3J > 7 Hz, and Gly and Ser (both with a ^3J < 6 Hz), all the residues of the carboxyl-terminal region showed coupling constants of approximately 6.5 Hz. Furthermore, the amides of Leu, Ala, and Ser are slowly exchanging, thus suggesting the formation of hydrogen bond (45) and the possible formation of consecutive turns in the regions Leu^14 to Leu, Arg to Ala, and Ala to Ser. The NMR data slightly favor a type I and two type II turns, respectively, for the above segments. In fact, a CH-NH NOE connectivity between Pro and Met suggested that the region Leu^14 to Leu forms a type I beta turn, since the CH-NH distance is between 0.19 and 0.35 nm (47) . The presence of an alphaCH(i)-NH(i) NOE between Leu^14 and Leu suggests that the type I turn can actually prolong the helix up to Leu. For Arg to Ala and Ala to Ser, the observation of alphaCH-CH NOE connectivities between Arg and Pro, and Ala and Pro for the trans isomer of both regions, suggests the presence of two type II beta turns.

Methanol is known to induce structure in peptides (see, for example, (48) ). However, at the used ratio of 80% water/20% methanol (v/v), the mixture has a helix-promoting ability slightly higher than that of pure water(48) , indicating that the structure is only stabilized by the presence of methanol. The possibility that the secondary structure arises through aggregation was ruled out by investigating a 10-fold diluted sample of Ii-(1-27). No differences in chemical shift and line width of the NH resonances were observed in one-dimensional spectra, and NOESY experiments confirmed all the connectivities, and thus the structure, described above.

From NMR data in water/methanol, 100 randomly selected starting conformations were generated by means of distance-geometry calculations. The best nine structures, in terms of smallest target function values, were subjected to restrained energy minimization. Before minimization, they fulfilled quite well the whole set of NMR restraints with no violations of the upper bounds of the distance restraints greater than 0.05 nm, and of dihedral angle restraints greater than 5°. The energy of the refined structures were all in the narrow range from -1877 to -2080 kJbulletmol; the maximal distance constraint violation was 0.050 ± 0.001 nm, and the average sum of distance constraint violations was 0.293 ± 0.001 nm. None of the selected structures presented additional short interproton distances not experimentally observed. Fig. 3A shows a superposition of all nine structures for the region covering residues Met^1-Pro, all compatible with NOEs data, since calculations of structures from NOEs is a means to assess possible and favored conformational states and not single structures. The convergence achieved over the well defined alpha helix from Gln^4-Leu^14 was good, with a root mean square deviation for its backbone atoms of 0.090 nm. The region Pro-Glu (Fig. 3B) did not converge to a consensus structure, reflecting the absence of structurally significant NOEs for this region of the polypeptide and the variability in its relative position with respect to the helix. The average root mean square deviation for backbone atoms of residues 16-25 is 0.293 nm. No unique conformation could be determined for the Met^1-Asp^2-Asp^3 and the Ser-Lys segments.


Figure 3: View of the nine calculated NMR structures of Ii-(1-27). The structures are superimposed for minimum root mean square deviation of backbone atoms (N, alphaC, C`, and O) for the region Met^1-Pro (overlaid over residues 4-14) (A) and region from Pro to Lys (overlaid over residues 16-25) (B). For clarity only side chains of the Pro residues are shown.



Effect of Point Mutations on the Two Internalization Signals

To elucidate the requirements for a structural context of the Leu^7-Ile^8 and Met-Leu signals, we performed a set of point mutations on the INA fusion protein changing several residues to alanine (Fig. 4). To monitor internalization from the plasma membrane, the transfected COS cells were incubated with I-labeled antibodies on ice and chased at 310 K. In order to study one signal at a time, the region of the other signal was either deleted or destroyed by mutating Leu^7 or Leu to Ala and the time response for internalization of the bound antibody was measured as described earlier(15) . The time to reach 25% internalization is noted in Fig. 4. Constructs were considered not actively internalized if they did not reach 15% internalization after 10 min, in analogy with INA constructs with both internalization signals deleted(15) .


Figure 4: Cellular distribution and internalization rates of INA and its constructs. Mutations of amino acids into alanine are indicated by A, while dashes refer to conserved residues. Empty space indicates deletion within each construct. The peptide studied by NMR and corresponding to Ii-(1-27) is indicated by a continuous line under the sequence numbers. Localization in vesicular staining or plasma membrane is indicated by V and PM, respectively. For the internalization time, NI means not internalized, indicating that less than 15% of construct molecules internalized after 10-min chase at 310 K. Each value of internalization rate is the mean of at least three independent experiments.



Regarding the Leu^7-Ile^8 signal, point mutation of Arg^5 and Asp^6 to Ala (construct 6, Fig. 4), no change of the internalization rate was detected, whereas both Gln^4 (construct 5) and Asp^3 (construct 4) prevented internalization. Alteration of amino acids at the carboxyl terminus of the signal can be changed to alanine without affecting the internalization signal; mutation of the potentially phosphorylatable Ser^9 (construct 9), and Asn and Asn (construct 10), did not change the rate of internalization in line with earlier studies(15) .

For the second signal, Met-Leu, mutation of Pro to Ala (construct 21) abolished internalization, whereas neither mutation of Gln nor Leu^14 (constructs 19 and 20, respectively) altered the internalization efficiency. The negative residue Glu (construct 15), however, reduced internalization to background level. For all the INA mutations, immunofluorescence studies showed a strong plasma membrane staining for the constructs that were not internalized and vesicular staining (V, localization column in Fig. 4) corresponding to endosomes for the constructs that were actively internalized. When native Ii harboring the identical cytosolic tail mutations as INA was expressed in COS cells, the corresponding Ii construct accumulated on the plasma membrane or in endosomes like the INA molecule (data not shown), verifying that the tetrameric NA is a reporter molecule that reflects the endosomal sorting properties of the trimeric native Ii molecule(15) .

Deletion of the first 11 residues (construct 18) reached 25% internalization in 4.0 min, whereas elimination of the first signal by point mutations (construct 14) reached 25% internalization in 2.0 min. This may indicate that additional amino-terminal residues than those provided in construct 17 modulate the efficiency of the Met-Leu signal. Our biological data thus confirm that the cytosolic tail of Ii comprises two autonomous endosomal sorting signals that function in internalization (15, 16, 39) and in addition point out that a functional Leu-based sorting signal requires specific geometrically neighboring residues.


DISCUSSION

We have applied NMR spectroscopy to study the solution structure of a synthetic peptide corresponding to the cytosolic region of the protein Ii and containing the two internalization signals.

In aqueous solution at pH 7.4 we detected NOEs characteristic of a nascent helix involving the membrane distal motif Leu^7-Ile^8, while the membrane proximal motif Met-Leu is part of a turn. In the presence of 20% methanol, the nascent helix was stabilized. We observed a regular alpha helix in the region Gln^4-Leu^14 and a segment of consecutive beta turns between Leu^14 and Ser, namely a type I (Leu^14-Leu) and two type II (Arg-Ala and Ala-Ser) beta turns. The presence of a small NOE between the alphaCH of Leu^14 and the NH of Leu suggests that the turn is actually part of the helix. Inspection of the calculated backbone and dihedral angles of the segment Leu^14-Pro-Met-Leu shows that except for minor variation in Leu^14, all residues retain values appropriate for an alpha helix. The calculations thus indicate that the type I turn can be accommodated in the helix with minor conformational changes on Leu^14 without the rest of the helical residues being perturbed. The resulting structure is an alpha helix from Gln^4 to Leu with a bend at Pro. Previous calculations on model helical polypeptides containing proline also confirm that it is sufficient to introduce a conformational change of only one residue in order to accommodate proline in a distorted helix(49, 50) . Kinked proline alpha helices with minor conformational changes and minimal disruption of the helix hydrogen bonding have also been observed in crystal structures of proteins(51) .

Comparison of the cytosolic tails of Ii from human, mouse, and rat shows that Pro is conserved(12) , while in the chicken (^2)a Pro is present at site 17. This suggests that the proline conserved in that area might have a definite structural/functional role. It is noteworthy that mutation of Pro to Ala (construct 21, Fig. 4) abolishes internalization. Considering that Ala is the most helix favoring of the 20 commonly occurring amino acids (52, 53) such a substitution is expected to preserve the helix while avoiding the bend. In fact, energy minimization calculations (not shown) on Ala Ii-(1-27) peptide have found a regular helix from Gln^4 to Leu. Accordingly, it is tempting to speculate that a kinked helix is required for internalization of Ii.

Our biological data show that leucine motifs are influenced by residues located at their amino-terminal side as Ser^9, Asn, and Asn can be changed to alanine without affecting the internalization signal. This is in line with other studies showing that the signals work independently from the carboxyl-terminal residues (8, 9, 10) . In addition, amino acids spatially close to the signals are fundamental for their correct functioning and this can be rationalized by referring to the three-dimensional structure of Ii-(1-27). The presence of Ala instead of Gln^4 hampers efficient internalization (Fig. 4). The helical wheel diagram (Fig. 5) indicates that Gln^4 is positioned on the same side of the helix as Leu^7 and Ile^8, so that its mutation alters the surroundings of the signal. Accordingly, if residues pointing away from the signal are changed, the mutation is expected to be irrelevant. In fact, mutation of Arg^5 and Asp^6, which are found on the opposite side of the helix (Fig. 5), does not alter the sorting capabilities of the molecule. Mutation of the negatively charged Asp^3 into Ala also abolishes internalization. This may be related either to the need of negatively charged residue to be located at the amino-terminal side of the signal (see below) and/or to the specific ability of Asp to stabilize alpha helical structures when flanking the amino terminus(54, 55) . The side chain of Ala at site 3 cannot form a hydrogen bond with the free NH groups at the amino terminus of the helix as does Asp residue, resulting in a strong decrease of helicity, which may prevent internalization. An indirect confirmation of the relevance of the second hypothesis would be the finding that Asp^3 becomes part of the helix when the concentration of Ii-(1-27) is increased. (^3)Altogether our results suggest that for a correct functioning, the first signal requires the specific structural context generated by the helix and the amino-terminal residues with a specific spatial arrangement around Leu^7-Ile^8.


Figure 5: Helical wheel representation of the Met^1-Leu^14 region of the Ii-(1-27) peptide. Leu^7 and Ile^8 are boxed.



The segment Leu^14-Pro-Met-Leu takes up a turn and forms the kinked part of the helix. Point mutations reveal that except for Leu^14 (construct 20 in Fig. 4), it is essential to have all the other residues unaltered for a functional second signal. By mutating Pro, Met, and Leu (constructs 21, 22, and 23, respectively), no internalization was observed. It must be noted that substitution of residues within the helix with Ala does not modify the helix, rather it alters the chemical nature of the environment of the signal. In fact, only the substitution Ala for Pro has a structural explanation, since it destabilizes the turn (56) and/or removes the kink from the helix (see above), while both Ala and the native residues Met and Leu at sites 16 and 17 have low preferences for turns(56) . As a partial confirmation to the relevance of the chemical properties of the side chains, we observed reduced internalization to background level through the second signal for the substitution Glu to Ala. This finding points out the necessity of a negatively charged residue on the amino-terminal side of the second signal, suggesting as a putative consensus signal for Ii two hydrophobic residues and the negatively charged amino-terminal residue: Asp^3(Glu)-Xaa-Xaa-Xaa-Leu^7(Met)-Ile^8(Leu). The two signals might work differently at various stages of the intracellular pathway, requiring specific structural contributions. For Ii, and the fusion protein INA, it is not clear whether it is sorted to the endosomal pathway solely via the plasma membrane and/or directly to endosomes from TGN. The most likely interpretation of the accumulated data is that both pathways are functional(2) . Since endosomal localization is not detected when active internalization is destroyed, this would lead to the conclusion that both pathways are affected by the same point mutations. The sorting machinery at TGN and plasma membrane might thus see elements of the same sorting signal or require common structural elements.

The Ii protein assembles as a trimer(57) , and the luminal domain encoded by exon 6 is essential for this process both in vitro(58) and in vivo. (^4)Recently, Arneson and Miller (59) suggested that a single Ii tail is needed for internalization from the plasma membrane, whereas at least two intact tails in the complex are needed for efficient TGN to endosomal targeting. The NMR structure reported above is based on the Ii monomeric form and appears to be biologically relevant although sorting at some stages of the intracellular pathway may require more than one tail. Preliminary NMR results (^5)on more concentrated samples of Ii-(1-27) indicate that the peptide starts to self-associate at a concentration 30-fold higher than that used in the present study and that the overall structure of the monomeric Ii-(1-27) described above is preserved. Further studies are thus required to elucidate whether the tail forms multimers and whether this influences the structure of the single molecule; however, the study presented here is certainly of interest for future modeling of the oligomer.

Similar to our results, NMR data on a 22-residue peptide corresponding to the amino-terminal cytosolic tail of the native low density lipoprotein receptor(4) , containing a Tyr in the internalization signal, indicate a nascent helix upstream of a turn containing a Pro at position 2. Comparing our results with those of Bansal and Gierasch (4) and Wilde et al.(7) on Tyr-containing signals, we suggest that Tyr and Leu signals might share a common structural model. Although the nature of the specific molecules that are able to recognize leucine signal is not known, the structural similarity of the helix suggests that the site of interaction may be similar for the two signals. The sorting signals can be found in both type I and type II membrane proteins and can satisfactorily be transplanted from one molecule to another (for reviews, see Refs. 1, 6, and 8), suggesting that they are possibly recognized by cytosolic proteins that are not membrane bound or otherwise have a distinct direction with regard to the plasma membrane.


FOOTNOTES

*
This work was supported by grants from the Italian National Research Council (to A. M.), the Norwegian Research Council (to O. B. and B. B.), and the Norwegian Cancer Society (to O. B.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

§
To whom correspondence should be addressed (to one of these authors): Dr. A. Motta, Istituto per la Chimica di Molecole di Interesse Biologico del CNR, I-80072 Arco Felice (Napoli), Italy (Tel.: 39-81-8534-218/262; Fax: 39-81-8041-770; amot@nmr.icmib.na.cnr.it) or Dr. O. Bakke, Division of Molecular Cell Biology, Dept. of Biology, University of Oslo, N-0316 Oslo, Norway (Fax: 47-22-854-605; obakke@bio.uio.no).

(^1)
The abbreviations used are: TGN, trans-Golgi network; Ii, invariant chain; INA, chimeric protein obtained by fusing neuraminidase (NA) to the cytosolic tail of Ii; DQF-COSY, double-quantum-filtered correlated spectroscopy; NOE, nuclear Overhauser enhancement; NOESY, two-dimensional NOE spectroscopy; TOCSY, total correlation spectroscopy; DMEM-FCS, Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum.

(^2)
S. Ness, personal communication.

(^3)
A. Motta, B. Bremnes, M. A. Castiglione Morelli, R. W. Frank, G. Saviano, and O. Bakke, unpublished data.

(^4)
M. Gedde-Dahl, I. Freiswinkel, M. Stascheschewski, K. Schenck, N. Koch, and O. Bakke, submitted for publication.

(^5)
A. Motta, M. A. Castiglione Morelli, B. Bremnes, and O. Bakke, manuscript in preparation.


ACKNOWLEDGEMENTS

We are grateful to Prof. Bernhard Dobberstein (Zentrum für Molekulare Biologie, Universität, Heidelberg) for helpful discussion and Antonio Maiello (Consiglio Nazionale delle Ricerche, Arco Felice) for technical assistance.


REFERENCES

  1. Trowbridge, I. S., Collawn, J. F., and Hopkins, C. R. (1993) Annu. Rev. Cell Biol. 9, 129-161 [CrossRef]
  2. Sandoval, I. V., and Bakke, O. (1994) Trends Cell Biol. 4, 292-297 [CrossRef]
  3. Collawn, J. F., Stangel, M., Kuhn, L. A., Esekogwu, V., Jing, S., Trowbridge, I. S., and Tainer, J. A. (1990) Cell 63, 1061-1072 [Medline] [Order article via Infotrieve]
  4. Bansal, A., and Gierasch, L. M. (1991) Cell 67, 1195-1201 [Medline] [Order article via Infotrieve]
  5. Eberle, W., Sanders, C., Klaus, W., Schmidt, B., von Figura, K., and Peters, C. (1991) Cell 67, 1203-1209 [Medline] [Order article via Infotrieve]
  6. Vaux, D. (1992) Trends Cell Biol. 2, 189-192 [CrossRef][Medline] [Order article via Infotrieve]
  7. Wilde, A., Dempsey, C., and Banting, G. (1994) J. Biol. Chem. 269, 7131-7136 [Abstract/Free Full Text]
  8. Sandoval, I. V., Arredondo, J. J., Alcalde, J., Noriega, A. G., Vandekerckhove, J., Jimenez, M. A., and Rico, M. (1994) J. Biol. Chem. 269, 6622-6631 [Abstract/Free Full Text]
  9. Ogata, S., and Fukuda, M. J. (1994) J. Biol. Chem. 269, 5210-5217 [Abstract/Free Full Text]
  10. Letourneur, F., and Klausner, R. D. (1992) Cell 69, 1143-1157 [Medline] [Order article via Infotrieve]
  11. Johnson, K. F., and Kornfeld, S. (1992) J. Cell Biol. 119, 249-257 [Abstract]
  12. Bakke, O., and Dobberstein, B. (1990) Cell 63, 707-716 [Medline] [Order article via Infotrieve]
  13. Lotteau, V., Teyton, L., Peleraux, A., Nilsson, T., Karlsson, L., Schmid, S. L., Quaranta, V., and Peterson, P. A. (1990) Nature 348, 600-605 [CrossRef][Medline] [Order article via Infotrieve]
  14. Simonsen, A., Momburg, F., Drexler, J., Hämmerling, G., and Bakke, O. (1993) Int. Immunol. 5, 903-917 [Abstract]
  15. Bremnes, B., Madsen, T., Gedde-Dahl, M., and Bakke, O. (1994) J. Cell Sci. 107, 2021-2032 [Abstract/Free Full Text]
  16. Odorizzi, C. G., Trowbridge, I. S., Xue, L., Hopkins, C. R., Davis, C. D., and Collawn, J. F. (1994) J. Cell Biol. 126, 317-330 [Abstract]
  17. Roche, P. A., Teletski, C. L., Stang, E., Bakke, O., and Long, E. O. (1993) Proc. Natl. Acad. Sci. U. S. A. 84, 2688-2692
  18. Frank, R., and Gausepohl, H. (1988) in Modern Methods in Protein Chemistry (Tschesche, H., ed) Vol. 3, pp. 41-60, de Gruyter & Co., Berlin
  19. Carpino, L. A., and Han, G. Y. (1972) J. Org. Chem. 37, 3404-3409
  20. Coste, J., Le-Nguyen, D., and Castro, B. (1990) Tetrahedron Lett. 31, 205-209 [CrossRef]
  21. Piantini, U., Sørensen, O. W., and Ernst, R. R. (1982) J. Am. Chem. Soc. 104, 6800-6801
  22. Jeener, J., Meier, B. H., Bachmann, P., and Ernst, R. R. (1979) J. Chem. Phys. 71, 4546-4553 [CrossRef]
  23. Braunschweiler, L., and Ernst, R. R. (1983) J. Magn. Reson. 53, 521-528
  24. Drobny, G., Pines, A., Sinton, S., Weitkamp, D., and Wemmer, D. (1979) Faraday Div. Chem. Soc. Symp. 13, 49-55
  25. Griesinger, C., Otting, G., Wüthrich, K., and Ernst, R. R. (1988) J. Am. Chem. Soc. 110, 7870-7872
  26. Bax, A., and Davis, D. G. (1985) J. Magn. Reson. 65, 355-366
  27. Zuiderweg, E. R. P., Hallenga, K., and Olejniczak, E. T. (1986) J. Magn. Reson. 70, 336-343
  28. Neuhaus, D., Wagner, G., Vasák, M., Kägi, J. H. R., and Wüthrich, K. (1985) Eur. J. Biochem. 151, 257-273 [Abstract]
  29. W ü thrich, K. (1986) NMR of Proteins and Nucleic Acids , J. Wiley and Sons, New York
  30. Güntert, P., Braun, W., Billeter, M., and Wüthrich, K. (1989) J. Am. Chem. Soc. 111, 3997-4004
  31. Wüthrich, K., Billeter, M., and Braun, W. (1983) J. Mol. Biol. 169, 949-961 [Medline] [Order article via Infotrieve]
  32. Wagner, G., Braun, W., Havel, T. F., Schaumann, T., Go, N., and Wüthrich, K. (1987) J. Mol. Biol. 196, 611-639 [Medline] [Order article via Infotrieve]
  33. Güntert, P., Braun, W., and Wüthrich, K. (1991) J. Mol. Biol. 217, 517-530 [Medline] [Order article via Infotrieve]
  34. van Gunsteren, W. F., and Berendsen, H. J. C. (1987) Groningen Molecular Simulation (GROMOS) Library Manual , pp. 1-229, BIOMOS, Groningen, The Netherlands
  35. van Gunsteren, W. F., and Berendsen, H. J. C. (1977) Mol. Phys. 34, 1311-1327
  36. Ryckaert, J. P., Ciccotti, G., and Berendsen, H. J. C. (1977) J. Comp. Phys. 23, 327-341
  37. Huylebroeck, D., Maertens, G., Verhoeyen, M., Lopez, C., Raeymakers, A., Min Jou, W., and Fiers, W. (1988) Gene (Amst.) 66, 163-181 [CrossRef][Medline] [Order article via Infotrieve]
  38. van Rompuy, L., Min Jou, W., Huylebroeck, D., and Fiers, W. (1982) J. Mol. Biol. 161, 1-11 [Medline] [Order article via Infotrieve]
  39. Pieters, J., Bakke, O., and Dobberstein, B. (1993) J. Cell Sci. 106, 831-846 [Abstract/Free Full Text]
  40. Kunkel, T. A. (1985) Proc. Natl. Acad. Sci. U. S. A. 82, 488-492 [Abstract]
  41. Gluzman, Y. (1981) Cell 23, 175-182 [Medline] [Order article via Infotrieve]
  42. Wüthrich, K., Billeter, M., and Braun, W. (1984) J. Mol. Biol. 180, 715-740 [Medline] [Order article via Infotrieve]
  43. Pardi, A., Billeter, M., and Wüthrich, K. (1984) J. Mol. Biol. 180, 741-751 [Medline] [Order article via Infotrieve]
  44. Dyson, H. J., Rance, M., Houghten, R. A., Wright, P. E., and Lerner, R. A. (1988) J. Mol. Biol. 201, 201-217 [Medline] [Order article via Infotrieve]
  45. Wagner, G., Neuhaus, D., Wörgötter, E., Vasák, M., Kägi, J. R. H., and Wüthrich, K. (1986) J. Mol. Biol. 187, 131-135 [Medline] [Order article via Infotrieve]
  46. Richardson, J. S. (1981) Adv. Protein Chem. 34, 167-339 [Medline] [Order article via Infotrieve]
  47. Dyson, H. J., Rance, M., Houghten, R. A., Lerner, R. A., and Wright, P. E. (1988) J. Mol. Biol. 201, 161-200 [Medline] [Order article via Infotrieve]
  48. Brooks, C. L., III, and Nilsson, L. (1993) J. Am. Chem. Soc. 115, 11034-11035
  49. Piela, L., Némethy, G., and Scheraga, H. A. (1987) Biopolymers 26, 1587-1600 [Medline] [Order article via Infotrieve]
  50. Polinsky, A., Goodman, M., Williams, K. A., and Deber, C. M. (1992) Biopolymers 32, 399-406 [Medline] [Order article via Infotrieve]
  51. Barlow, D. J., and Thornton, J. M. (1988) J. Mol. Biol. 201, 601-619 [Medline] [Order article via Infotrieve]
  52. Padmanabhan, S., Marqusee, S., Ridgeway,, Laue, T. M., and Baldwin, R. L. (1990) Nature 344, 268-270 [CrossRef][Medline] [Order article via Infotrieve]
  53. O'Neil, K. T., and DeGrado, W. F. (1990) Science 250, 646-651 [Medline] [Order article via Infotrieve]
  54. Presta, L. G., and Rose, G. D. (1988) Science 240, 1632-1641 [Medline] [Order article via Infotrieve]
  55. Forood, B., Feliciano, E. J., and Nambiar, K. P. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 838-842 [Abstract]
  56. Wilmot, C. M., and Thornton, J. M. (1988) J. Mol. Biol. 203, 221-232 [Medline] [Order article via Infotrieve]
  57. Marks, M. S., Blum, J. S., and Cresswell, P. (1990) J. Cell Biol. 111, 839-855 [Abstract]
  58. Bijlmakers, M. E., Benaroch, P., and Ploegh, H. L. (1994) J. Exp. Med. 180, 623-629 [Abstract]
  59. Arneson, L. S., and Miller, J. (1995) J. Cell Biol. 129, 1217-1228 [Abstract]

©1995 by The American Society for Biochemistry and Molecular Biology, Inc.