Substrate Specificity of the Hepatitis C Virus Serine Protease NS3*

(Received for publication, November 11, 1995, and in revised form, January 19, 1997)

Andrea Urbani , Elisabetta Bianchi , Frank Narjes , Anna Tramontano , Raffaele De Francesco , Christian Steinkühler Dagger and Antonello Pessi

From the Istituto di Ricerche di Biologia Molecolare (IRBM) P. Angeletti, Pomezia, Rome, Italy

ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES


ABSTRACT

The substrate specificity of a purified protein encompassing the hepatitis C virus NS3 serine protease domain was investigated by introducing systematic modifications, including non-natural amino acids, into substrate peptides derived from the NS4A/NS4B cleavage site. Kinetic parameters were determined in the absence and presence of a peptide mimicking the protease co-factor NS4A (Pep4A). Based on this study we draw the following conclusions: (i) the NS3 protease domain has an absolute requirement for a small residue in the P1 position of substrates, thereby confirming previous modelling predictions. (ii) Optimization of the P1 binding site occupancy primarily influences transition state binding, whereas the occupancy of distal binding sites is a determinant for both ground state and transition state binding. (iii) Optimized contacts at distal binding sites may contribute synergistically to cleavage efficiency.


INTRODUCTION

The N-terminal third of the hepatitis C virus (HCV)1 NS3 protein contains a trypsin-like serine protease that accomplishes four out of the five processing events that take place during maturation of the nonstructural portion of the HCV polyprotein, performing cleavages at the NS3-NS4A, NS4A-NS4B, NS4B-NS5A, and NS5A-NS5B junctions (1-6). It has been shown that cleavage between NS3 and NS4A is an intramolecular event, whereas all remaining junctions are processed in trans.

In vivo, NS3 appears to form a heterodimer whereby the protease domain associates with the viral protein NS4A. The latter is a 54-residue protein that has been shown to bind to the N-terminal region of the protease via a central hydrophobic domain spanning residues 21-34 (7-13). NS4A acts as a co-factor of the protease enhancing cleavage at all sites and being an absolute requirement for processing of the NS4B/NS5A junction ex vivo (7). Several studies have shown that a peptide encompassing the central hydrophobic domain of NS4A is sufficient for eliciting activation of the protease (12, 14-17).

Serine proteases contact the P1 residue of their substrates through characteristic specificity pockets. The residues flanking the specificity pocket are important determinants of substrate recognition. Homology modelling of the S1 specificity pocket of the NS3 protease has predicted the presence of a phenylalanine as a prominent feature, thus rendering the pocket rather small and hydrophobic (18). These characteristics have led to the prediction of the preference for small, hydrophobic residues, ideally cysteine residues, in the P1 position of NS3 substrates. Radiosequencing of the single cleavage products has subsequently confirmed these predictions, yielding the consensus sequence (D/E)XXXXC(A/S) for all trans cleavage sites, with X being any amino acid and the scissile bond being located between Cys and Ala or Ser (2, 18). The homology model has been used to successfully redesign the enzyme's specificity, thereby increasing its validity. Very recentyl the three-dimensional structure of the protease has been solved by two different groups (20, 21), confirming the presence of a phenylalanine ring pointing into the pocket. The sequences of the four NS3 cleavage sites are listed in Table I, indicating that the intramolecular cleavage site between NS3 and NS4A differs from the consensus having a Thr in the P1 position.

Table I.

Sequences of NS3 cleavage sites

P6-P4' residues of the HCV Bk strain polyprotein cleavage sites. P1 and P1' residues are shown in boldface type.


Cleavage site Sequence

NS3/NS4A DLEVVT STWV
NS4A/NS4B DEMEEC ASHL
NS4B/NS5A DCSTPC SGSW
NS5A/NS5B EDVVCC SMSY

Substrate specificity of the NS3 protease has been investigated by several groups in a qualitative way using transient transfection (22, 23), in vitro translation (24), or intracellular processing of fusion proteins in Escherichia coli (25). Availability of quantitative data using peptidic substrates has been so far hampered by difficulties in expressing and purifying sufficient amounts of enzymatically active recombinant NS3 protease. We (17, 26) and others (15, 20, 21, 27-33) have recently described efficient heterologous expression and purification of the enzyme and were able to define optimized conditions for the determination of protease activity (26).

In the present work we investigate the substrate specificity of the NS3 protease domain by introducing systematic modifications in a peptide substrate derived from the sequence of the NS4A/NS4B junction. Activity on modified substrates has been determined both in the presence and absence of the NS4A co-factor. The results are discussed in the light of our previous homology model of the S1 pocket of the enzyme.


MATERIALS AND METHODS

Enzyme Preparation

The protease domain of the HCV Bk strain NS3 protein encompassing residues 1027-1206 of the viral polyprotein was purified from E. coli as described previously (26). The enzyme was homogeneous as judged from silver-stained SDS-polyacrylamide gel electrophoresis and >95% pure as judged from reversed phase HPLC performed using a 4.6 × 250-mm Vydac C4 column. The enzyme preparations were routinely checked by mass spectrometry done on HPLC purified samples using a Perkin Elmer API 100 instrument and N-terminal sequence analysis carried out using Edman degradation on an Applied Biosystems model 470A gas-phase sequencer. Both techniques indicated that in more than 90% of the enzyme molecules the N-terminal methionine and alanine had been removed, yielding an enzyme starting with Pro2. Enzyme stocks were made 50% in glycerol content, quantified by amino acid analysis or by determining their absorbance at 280 nm (17), shock-frozen in liquid nitrogen, and kept in aliquots at -80 °C until use. Control experiments had shown that this freezing procedure does not affect specific activity of the enzyme.

Peptides and HPLC Assays

Peptide synthesis was performed on a NovaSyn Gem flow synthesizer by Fmoc/t-Bu chemistry. Protecting groups were as follows: Nalfa(Fmoc), Asp(Ot-Bu), Glu(Ot-Bu), Tyr(t-Bu), Ser(t-Bu), Thr(t-Bu), His(Trt), Cys(Trt), Gln(Trt), and Trp(Boc). All amino acids were activated by benzotriazole-1-yl-oxy-tris-pyrrolidino-phosphonium hexafluorophosphate, N-hydroxybenzotriazole, and diisopropylethylamine. All peptides were assembled on a NovaSyn PR 500 resin, yielding peptide amides upon cleavage with 88% trifluoroacetic acid, 5% phenol, 2% triisopropylsilane, and 5% water.

Crude peptides were purified by reversed phase-HPLC on a Nucleosyl C18, 250 × 21 mm, 100 Å, 7 µm, using H2O, 0.1% trifluoroacetic acid and acetonitrile, 0.1% trifluoroacetic acid as eluents. Analytical HPLC was performed on a Ultrasphere C18, 250 × 4.6 mm, 80 Å, 5 µm (Beckman). Purified peptides were characterized by mass spectrometry and amino acid analysis.

Concentration of stock solutions of peptides, prepared in Me2SO and kept at -80 °C until use, was determined by quantitative amino acid analysis performed on HCl-hydrolyzed samples.

Cleavage assays were performed in 27 µl of 50 mM Tris, pH 7.5, 2% CHAPS, 50% glycerol, 10 mM dithiothreitol, to which 3 µl of substrate peptide in Me2SO, leading to 10% final Me2SO concentration, were added. Enzyme concentrations (50 nM to 6 µM) and incubation times were chosen to obtain <20% substrate conversion. Where indicated, Pep4A having the sequence GSVVIVGRIILSGR(NH2) was added at a final concentration of 3 µM. Reactions were stopped by addition of 70 µl of 0.1% trifluoroacetic acid. Cleavage of peptide substrates was determined by HPLC using a Merck-Hitachi chromatograph equipped with an autosampler. 90-µl samples were injected on a Lichrospher C18 reversed phase cartridge column (4 × 125 mm, 5 µm, Merck) or on a Beckman Ultrasphere ODS column (4.6 × 250 mm) and fragments were separated using a 3-100% acetonitrile gradient at 2%/min. Peak detection was accomplished by monitoring both the absorbance at 220 nm and fluorescence (lambda ex = 260 nm, lambda em = 305 nm).

Cleavage products were quantitated by integration of chromatograms with respect to appropriate standards. Initial rates of cleavage were determined on samples having <20% substrate conversion. Kinetic parameters were calculated from least-squares fit of initial rates as a function of substrate concentration with the help of a Kaleidagraph software, assuming Michaelis-Menten kinetics. Where individual determination of kcat and Km was not possible kcat/Km values were calculated from the slope of the linear part of the Michaelis-Menten plot at substrate concentrations Km. All experiments were repeated at least three times.

To determine Ki values, initial velocities were determined as a function of substrate concentration in the presence of three fixed competing inhibitor concentrations. Ki values were calculated from least-squares fit using a modified Michaelis-Menten equation (Equation 1),
V=V<SUB><UP>max</UP></SUB> <UP>S</UP>/(K<SUB>m</SUB>(1+I/K<SUB>i</SUB>)+<UP>S</UP>) (Eq. 1)
Alternatively, Ki values were calculated from IC50 values according to Equation 2,
K<SUB>i</SUB>=<UP>IC</UP><SUB>50</SUB>/(1+<UP>S</UP>/K<SUB>m</SUB>) (Eq. 2)

Estimation of Active Site Concentration

To determine the number of active molecules in the enzyme preparation 100 µM of the fluorogenic ester substrate Ac-DED(Edans)EEAbuPsi [COO]ASK(Dabcyl)-NH2 (34) were added to 100 nM enzyme solution added from a 5 µM stock, previously quantitated by amino acid analysis. 50-µl samples were withdrawn at timed intervals and immediately quenched by addition of 50 µl of 1% trifluoroacetic acid. A total of 48 data points were collected within 3 min of reaction. The pre-steady state burst expected to equal the concentration of active sites in the enzyme preparation was determined by extrapolation of product formation to zero time.


RESULTS

Estimation of Active Site Concentration

We first addressed the question of whether the activity of our protease preparation was attributable to a fully active enzyme or only to a fraction of enzymatically active protease molecules. To this purpose we used the fact that serine proteases follow a two-step mechanism: acylation of the active site serine residue with concomitant release of the C-terminal fragment of the substrate is followed by deacylation and the release of the N-terminal fragment. The former acylation reaction is usually the rate-limiting step for amide bond scission, whereas for ester substrates the acylation reaction is usually fast as compared with deacylation. Thus, for ester substrates the pre-steady-state release of the C-terminal leaving group creates an initial burst that equals the number of active sites encountered by the substrate (35).

To determine the number of active sites in our enzyme preparation 100 µM fluorogenic ester substrate was added to 100 nM enzyme, the reaction was stopped at timed intervals and samples were analyzed by HPLC (Fig. 1). Extrapolation of the linear phase of the reaction indicated a burst of 94 ± 10 nM demonstrating that 94% of the protease molecules in our preparation were enzymatically active.


Fig. 1. Estimation of active site concentration. To 100 nM protease in 50% glycerol, 2% CHAPS, 10 mM dithiothreitol, 50 mM Tris, pH 7.5, 100 µM fluorogenic ester substrate Ac-DED(Edans)EEAbuPsi [COO]ASK(Dabcyl)-NH2 was added. The reaction was stopped at timed intervals by addition of 1% trifluoroacetic acid. The amount of cleavage product was quantified by HPLC. Extrapolation to zero time gave 94 ± 10 nM active sites. Control experiments indicated no detectable spontaneous hydrolysis of the ester bond to occur within the incubation time of the experiment.
[View Larger Version of this Image (14K GIF file)]


Substrate Specificity

We investigated the substrate specificity of the NS3 protease by introducing several modifications into a decamer peptide whose sequence was based on the NS4A-NS4B junction. The choice for this sequence was determined by the relative ease of synthesis (the other two NS3 trans-cleavage sites have two cysteine residues that are prone to oxidation), by the fact that the corresponding junction is cleaved with a relatively high efficiency in the context of the polyprotein, and taking into account that previous mutagenesis studies have shown that this junction is intermediate with respect to its sensitivity to mutations (22). All experiments were done in buffer containing 50% glycerol and 2% CHAPS. These conditions have been worked out to yield optimum activity (26). However, glycerol might exert differential effects on the binding of polar and non-polar substrates. Since we found that glycerol activation is a very complex phenomenon, affecting both Km and kcat values of substrates, and in addition having a dramatic effect on the dissociation constant of the NS3-Pep4A complex, no attempts were made to study the influence of this agent on the interaction of the enzyme with different substrates. Our data therfore have to be interpreted with the caveat that they have been obtained in the presence of this not entirely passive solvent.

P1 Substitutions

The consensus sequence of the NS3-dependent junctions within the HCV polyprotein points to conservation of negatively charged residues in the P6 position, a cysteine or a threonine residue in the P1, and an alanine or a serine residue in the P1' positions (Table I). In an attempt to better define the P1 specificity of the enzyme a series of cysteine substitutes, with a special focus on both natural as well as non-natural small hydrophobic residues, were introduced in the P1 position using a decamer NS4A-NS4B substrate spanning from P6 to P4'. This length has been shown by earlier work to be the optimum compromise between number of residues, cleavage efficiency, and ease of detectability by HPLC (26). Most P1 substitutions resulted in uncleaved substrates (Table II). We have estimated that our cleavage detection limit was in the order of kcat/Km = 0.05 M-1 s-1. Among the natural amino acids only threonine was accepted in P1 in the absence of Pep4A while addition of the co-factor also allowed cleavage, albeit at barely detectable levels, of a substrate having a valine residue in P1. The best cysteine substitutes turned out to be homocysteine, having the side chain length increased by 1 -CH2- unit with respect to cysteine, and allylglycine, in which the sulfhydryl group of cysteine is replaced by an ethenyl group. Still, these P1 residues decreased cleavage efficiency by 5-10-fold, both in the absence and presence of Pep4A. Replacement of the SH group by an amino or a hydroxyl group (in diaminopropionic acid and serine, respectively) abolished cleavage, as did incorporation of the SH group in a thiophene ring or its carboxymethylation. Replacement of the cysteine sulfhydryl by a methyl group in aminobutyric acid was compatible with cleavage of the resulting decamer, although at the expense of a 15- (+Pep4A) to 50-fold (-Pep4A) decrease in the respective kcat/Km values. Increasing the length of the side chain by incorporating norvaline in the P1 position resulted in a peptide that was cleaved only in the presence of Pep4A.

Table II.

Kinetic parameters of cleavage of P1-modified decamer peptides corresponding to the NS4A/NS4B cleavage site DEMEEC ASHL

The following abbreviations were used: Cacm, S-acetamidomethyl cysteine; Dapa, diaminopropionic acid; hCys, homocysteine; Hyp, hydroxyproline; NVal, norvaline; Tha, thiazolidino-alanine. Data are mean ± S.D. of at least three different determinations.


P1 residue NS3
NS3 + Pep4A
Km kcat kcat/Km Km kcat kcat/Km

µM min-1 M-1 s-1 µM min-1 M-1s-1
Cys 105  ± 20 0.26  ± 0.03 47.6 42.8  ± 4.1 1.40  ± 0.07 545
hCys 312  ± 59 0.10  ± 0.01 5.30 54.1  ± 12.8 0.31  ± 0.01 95.5
Alg 205  ± 29 0.08  ± 0.01 6.50 140  ± 16 0.53  ± 0.02 63.1
Abu 288  ± 30 0.03  ± 0.01 1.74 96.7  ± 28.5 0.17  ± 0.01 29.6
Thr 244  ± 55 0.005  ± 0.001 0.34 235  ± 37 0.21  ± 0.01 15.1
NVal NDa ND ND 156  ± 27 0.15  ± 0.01 16.2
Val ND a ND ND 153  ± 54 0.009  ± 0.001 0.93
Ala Not cleaved
Pro Not cleaved
Phe Not cleaved
Ser Not cleaved
Hyp Not cleaved
Aib Not cleaved
Gly Not cleaved
Cacm Not cleaved
Leu Not cleaved
Dapa Not cleaved
Tha Not cleaved

a ND, not determined (kcat/Km < 0.05 M-1 s-1).

To determine whether inability of the protease to cleave substrates with certain P1 substituents was due to poor ground state binding or to their inability to proceed through the transition state we determined the Ki values of some selected peptides using the decamer peptide with cysteine in P1 as substrate. For amide substrates of serine proteases the relationship Ki Km ~ Kd usually holds, indicating that Ki values are very good approximations of the true dissociation constants of the enzyme substrate complex (36, 37). We verified this experimentally by determining both for the substrate having Abu as P1 residue and obtained (in the presence of Pep4A): Km = 97 ± 28 µM and Ki = 133 ± 15 µM. Next, we determined the Ki values of decamer peptides having alanine, proline, phenylalanine, or serine in P1, and that were therefore not cleaved (Table III). Interestingly, the Ki values differ only by a factor of 2-8 from the Km value determined for the wild type substrate, indicating that the P1 residue makes relatively minor contributions to ground state binding. This is true even for a bulky substituent such as phenylalanine.

Table III.

Ki values of P1-modified decamer peptides derived from the NS4A/NS4B cleavage site DEMEEC ASHL

Data are mean ± S.D. from three different determinations.


P1 residue NS3 Ki NS3 + Pep4A Ki

µM µM
Ala 323  ± 25 384  ± 23
Pro 217  ± 42 197  ± 21
Phe 202  ± 27 200  ± 22
Ser 91  ± 5 22  ± 2

P6 and P1' Substitutions

To investigate the role of the side chain length of the P6 residue we substituted the aspartic acid residue present in the NS4A-NS4B sequence by a glutamic acid. This substitution, although affecting slightly kcat and Km individually, had no significant effect on the overall cleavage efficiency (Table IV). Neutralization of the negative charge by introduction of an asparagine residue decreased the cleavage efficiency by a factor of 5. This effect was attributable mainly to an increase in Km. When the charge in the P6 position was inverted through introduction of a lysine residue, a pronounced decrease in kcat/Km was observed, which was again attributable to an impairment in ground state binding of the resulting substrate, as judged from the increase of the respective Km values. All these effects were less pronounced in the presence of the Pep4A cofactor.

Table IV.

Kinetic parameters of cleavage of P6 and P1'-modified decamer peptides corresponding to the NS4A/NS4B cleavage site

Residues in boldface type indicate modifications with respect to the wild type sequence. Data are mean ± S.D. of at least three different determinations.


Peptide NS3
NS3 + Pep4A
Km kcat kcat/Km Km kcat kcat/Km

µM min-1 M-1 s-1 µM min-1 M-1 s-1
DEMEECASHL 105  ± 20 0.26  ± 0.03 47.6 42.8  ± 4.1 1.40  ± 0.07 545
EEMEECASHL 254  ± 44 0.64  ± 0.04 42.0 166  ± 15 2.89  ± 0.20 289
NEMEECASHL 630  ± 62 0.37  ± 0.25 9.80 188  ± 44 2.90  ± 0.46 257
KEMEECASHL 1050  ± 430 0.26  ± 0.10 4.10 449  ± 91 2.68  ± 0.02 99.5
DEMEECSSHL 273  ± 21 0.32  ± 0.01 19.5 246  ± 54 1.66  ± 0.22 112
DEMEECFSHL ND ND 0.41 156  ± 54 0.20  ± 0.02 21.3

 ND, not determined.

Substitution of the P1' alanine residue by a serine residue, which is found in this position in other substrates, moderately (2-5-fold) decreased kcat/Km (Table IV). Conversely, introduction of a phenylalanine residue in P1' decreased kcat/Km by 2 orders of magnitude in the absence of the co-factor and 25-fold in its presence.

Substrate Alanine Scanning

To investigate the relative contribution to cleavage efficiency of positions other than P1, P6, and P1' we performed an alanine scanning experiment. The experiment was repeated also in the presence of saturating amounts of Pep4A. Table V summarizes the results. Only the substitution of the P1 cysteine resulted in complete abolishment of cleavage both in the presence and absence of the co-factor. Introduction of alanine residues in other positions had only slight effects on cleavage efficiency. The largest effect (a 7-fold decrease in efficiency) was observed for the P3 position in the absence of Pep4A.

Table V.

Substrate alanine scanning

Residues in boldface type indicate modifications with respect to the wild type sequence. Data are mean ± S.D. of at least three different determinations.


Peptide NS3
NS3 + Pep4A
Km kcat kcat/Km Km kcat kcat/Km

µM min-1 M-1 s-1 µM min-1 M-1 s-1
DEMEECASHL 105  ± 20 0.26  ± 0.03 47.6 42.8  ± 4.1 1.40  ± 0.07 545
AEMEECASHL 125  ± 28 0.10  ± 0.02 13.3 55.0  ± 11 0.90  ± 0.06 271
DAMEECASHL 569  ± 150 0.86  ± 0.46 25.2 143  ± 27 2.73  ± 0.84 317
DEAEECASHL 409  ± 24 0.30  ± 0.02 12.2 124  ± 22 2.13  ± 0.30 286
DEMAECASHL 790  ± 205 0.33  ± 0.06 7.0 190  ± 35 2.81  ± 0.66 246
DEMEACASHL 171  ± 23 0.26  ± 0.01 22.0 148  ± 44 2.59  ± 0.60 292
DEMEEAASHL Not cleaved Not cleaved
DEMEECAAHL 191  ± 11 0.67  ± 0.04 58.4 144  ± 5.6 3.08  ± 0.15 356
DEMEECASAL 231  ± 39 0.85  ± 0.16 61.5 130  ± 15 3.61  ± 0.96 462
DEMEECASHA 260  ± 37 0.58  ± 0.12 37.1 219  ± 10 3.00  ± 0.94 226

Inverse Alanine Scanning of a "Minimalist" Substrate

The picture that emerges from our data confirms the importance of the consensus P1 and P1' residues, and to a lower extent also of the P6 residue, in determining cleavage efficiency of a given substrate. Still, there are remarkable differences in the kinetic behavior of the single cis cleavage sites which, nevertheless contain the same consensus P6, P1, and P1' residues (22, 23). We have recently shown that these differences can be reproduced using decamer peptide substrates (26). Thus, there must exist additional determinants. Failure to detect them in the above mentioned alanine scanning experiment might indicate that it is the sum of several minor contributions that modulates the recognition of substrates containing the consensus residues in P6, P1, and P1'. To start to address this issue we synthesized a polyalanine peptide containing the consensus P6, P1, and P1' residues. Based on the results of the alanine scanning, pointing to the P3 residue as important contribution to cleavage efficiency we decided also to fix a glutamic acid in this postion. We extended the peptide to the P6' residue, which being a tyrosine facilitated HPLC detection of cleavage products via monitoring of tyrosine fluorescence. We further introduced a lysine residue in P7'.

The parent peptide Ac-DAAEACAAAAPYK was cleaved, but cleavage was slowed down 70-85-fold with respect to the wild type sequence. Inspection of the sequences of the natural cleavage sites reveals that a negatively charged residue is conserved in the P5 position of two out of four cleavage sites. Re-introduction of the wild type aspartate in our minimalist substrate indeed resulted in a modest (~2-fold) enhancement of cleavage efficiency (Table VI).

Table VI.

Inverse substrate alanine scanning

Residues in boldface type indicate modifications with respect to the minimalist sequence DAAEACAAAAPYK. Data are mean ± S.D. of at least three different determinations.


Peptide NS3
NS3 + Pep4A
Km kcat kcat/Km Km kcat kcat/Km

µM min-1 M-1 s-1 µM min-1 M-1 s-1
DEMEECASHLPYK 80.1  ± 10.0 0.5  ± 0.1 104 40.5  ± 5.2 3.5  ± 0.5 1460
DAAEACAAAAPYK 1000  ± 216 0.07  ± 0.01 1.2 330  ± 119 0.42  ± 0.08 21
DEAEACAAAAPYK 490  ± 85 0.08  ± 0.01 2.7 130  ± 76 0.38  ± 0.06 48
DAAEACAAAYPYK ND ND 0.8 ND ND 14
DAAEACAAALPYK ND ND 2.3 ND ND 56
DEAEACAAALPYK 249  ± 26 0.34  ± 0.01 22.7 42.6  ± 10 1.26  ± 0.07 492

 ND, not determined.

In the P4' position of natural substrates there appears to be a preference for hydrophobic residues (Table I). As a matter of fact, Leu, Tyr, or Trp residues are found in this position. Introduction of Tyr into the P4' position of the minimalist substrate had no detectable effect, whereas Leu modestly increased cleavage rates. Both P4' substituted peptides were very insoluble, thus not permitting individual determinations of kcat and Km. When both P3 Glu and P4' Leu were re-introduced in the sequence a 20-23-fold enhancement of cleavage efficiency was observed, yielding a substrate that was cleaved with 22-34% efficiency with respect to the wild type sequence.


DISCUSSION

The homology model of the specificity pocket of the NS3 protease predicts that both its shape and its physico-chemical environment be primarily determined by the presence of phenylalanine 213 (according to the chymotrypsin numbering). Furthermore, the pocket was predicted to be very hydrophobic and closed by the aromatic ring of the phenylalanine. While this article was in preparation the crystal structure of NS3 protease has been solved independently by two groups (20, 21). In the published structures the side chain of Phe213 is indeed pointing inside the S1 pocket, thereby confirming the model. As the sulfhydryl group of cysteine has been shown to favorably interact with the aromatic ring of phenylalanine, cysteine has been suggested to be the most reasonable P1 residue. Our kinetic data are in line with these predictions. In fact, P1 substitutions have shown the following order of preferences: Cys > hCys ~ Alg > Abu > Thr > NVal > Val. Attempts at substituting the thiol group of cysteine with a hydroxyl group in serine or an amino group in diaminopropionic acid resulted in uncleaved peptides. Most likely these side chains are too hydrophilic to favorably interact with the hydrophobic milieu of the S1 pocket. Increasing the side chain length of the P1 residue in homocysteine and in allylglycine resulted in substrates that were still reasonably well cleaved. In contrast, incorporation of NVal, having the same side chain length resulted in a more pronounced impairment of cleavage efficiency. This could be related to a favorable interaction of the SH or the allyl group with the phenylalanine ring in the pocket which is expected not to occur in the case of the methyl group in NVal.

The kinetics of Thr and Val substituted peptides, the only two branched residues for which we could detect cleavage, deserve some further comment. The peptide substrate containing Val in P1 was detectably cleaved only in the presence of Pep4A and with a relative efficiency that was 15-fold lower than that observed for Thr in the P1 position. As a matter of fact the isopropyl branch seems detrimental to productive transition state binding as judged by the preference of NVal over Val. Fig. 2 shows a schematic view of the S1 pocket together with a cysteine docked into the pocket and a comparison of the conformation of this cysteine with the conformers of Thr and Val most commonly found in proteins. In this conformation the Val side chain is more likely to encounter steric hindrances in contacting the pocket than would be expected for Thr (Fig. 2). Alternatively, it could be assumed that for both Thr and Val only a methyl group of the side chain will point into the S1 pocket, whereas the other branch (a hydroxyl group in Thr and a methyl group in Val) will point out of the pocket. In this view, the fact that Thr is preferred over Val could indicate that its hydroxyl group will make some contacts outside the pocket that cannot be made by the methyl group of Val.


Fig. 2. Schematic view of the model of the S1 pocket of the NS3 protease. Solid lines indicate the backbone of residues flanking the S1 pocket. Cysteine is docked into the pocket with its side chain pointing in the direction of Phe213. Cys, Thr, and Val are shown in the conformations most commonly found in proteins.
[View Larger Version of this Image (21K GIF file)]


It is interesting to compare our data obtained with peptidic substrates to previous reports in which point mutations were introduced into polyprotein substrates. Kolykhalov and co-workers (22) have shown that susceptibility to mutations depends on the sequence context, the NS4A/NS4B cleavage site being intermediate between the least sensitive NS3/NS4A junction and the most sensitive NS5A/NS5B cleavage site. Among several substitutions, only Arg and Asp in the P1 position of the NS4A/NS4B junction resulted in complete abolishment of cleavage while a gradient following the order Asn < Gly < Ser < Thr < Cys < Leu was observed under conditions of short metabolic labeling pulses. Remarkably, in this experiment Leu proved to be even superior to Cys as P1 residue. In similar experiments Bartenschlager et al. (23) found that in the context of a NS4A/NS4B junction cleavage was reduced but still well detectable upon substitution of the P1 Cys with Phe, Ser, Thr, and Ala. Clearly, these findings are at variance with our data. However, both the experimental context and the nature of the substrates that have been used might explain these differences. In fact, in transient transfection experiments, even using short labeling times, the accumulation of considerable amounts of substrate and cleavage products is inevitable leading to deviation from true initial rates. This fact compresses differences in cleavage efficiencies. Furthermore, it is possible that NS3 is more active on polyprotein substrates than it is using a peptide substrate, thereby being less discriminative against suboptimal P1 residues. Differences in specific activity using either polyprotein or peptidic substrates have been reported for other proteases such as CMV protease (38) or tPa (39). Using in vitro translated substrates based on the NS5A/NS5B junction we have estimated the specific activity of added purified protease to be in the order of kcat/Km = 200,000 M-1 s-1.2 Nevertheless, we wanted to rule out that the relatively low activities we were observing using peptidic substrates were due to an only partially active enzyme. We have shown that we were indeed working with an enzyme population that was composed of more than 90% of enzymatically active molecules, indicating that the activities we measured under our experimental conditions were intrinsic features of the enzyme.

We found that optimized S1 pocket occupancy was a major determinant of kcat, whereas the P1 residue exerted a less pronounced effect on ground state binding of substrate peptides. Incorporation in the P1 position of residues that reduced cleavage of the resulting peptide to undetectable levels (at least 100-fold) resulted in unaltered or only up to 8-fold decreased affinities, as judged from the respective Ki values. Interestingly, this turned out to be true even for residues that were expected to cause significant steric or conformational perturbations such as phenylalanine or proline. This behavior sheds some light on the mechanisms of substrate recognition by the NS3 protease: apparently ground state binding of the substrate is mediated by multiple interactions involving distal residues, whereas the efficiency with which the bound substrate will proceed through the transition state is strongly influenced by the nature of the residue in the P1 position. This dual requirement is probably needed to endow the enzyme with the high degree of specificity necessary to accomplish its physiological role of mediating the generation of the mature HCV replication machinery.

In the P6 position a conserved negative charge is present in all cleavage sites. From our data it appears that, at least for the NS4A/NS4B junction there is a preference but no stringent requirement for this negative charge. This finding is in agreement with what has been found by others through introduction of point mutations in polyprotein substrates (22, 23). In fact, it has been reported that, in the context of different cleavage sites, extensive mutagenesis of the P6 position has little if any effect on cleavage efficiency. The absolute conservation of this residue especially in the light of the pronounced variability of the HCV genome might therefore indicate that it serves some more subtle function.

We attempted to identify additional crucial determinants of recognition by the protease by both "classical" alanine scanning and by "inverse alanine scanning" using a polyalanine substrate with 4 non-alanine positions. Re-introduction of wild type sequence residues in the minimalist polyalanine substrate causes a gradual but apparently, synergistic increase in cleavage efficiencies. Thus, re-introduction of the wild type P5 and P4' residues singularly into the minimalist substrate results in 2-2.5-fold cleavage enhancement, whereas pairwise re-introduction of both residues results in a more than 20-fold increase in kcat/Km.

The picture emerging from this study points to a rather permissive substrate binding site, where the only absolute requirement for cleavage is a small, hydrophobic P1 residue. Several minor contributions arising from contacts with distal residues then cooperate in modulating substrate recognition and cleavage. Our findings can be rationalized in the context of the recently solved structure of the NS3 protease (20, 21): the substrate binding channel is relatively solvent exposed, with all the contributing loops being shorter or absent in comparison with the other serine proteases of similar fold, like chymotrypsin or elastase. Moreover, substrate modelling in the active site suggested that major binding contributions should come from the P6-P2 peptide backbone, with an apparent lack of P5-P2 side chain to enzyme interactions (21). Overall, it appears that a deeper understanding of how substrate recognition by NS3 is fine-tuned, possibly by the use of combinatorial peptide libraries spanning several simultaneous mutations, will be necessary to guide the design of substrate-based inhibitors of NS3.


FOOTNOTES

*   The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Dagger    To whom correspondence should be addressed: IRBM, Via Pontina Km 30,600-00040-Pomezia, Italy. Tel.: 39-6-91093232; Fax: 39-6-91093225; E-mail: Steinkuhler{at}IRBM.it/.
1   The abbreviations used are: HCV, hepatitis C virus; Abu, alpha -aminobutyric acid; Aib, alpha -aminoisobutyric acid; Alg, allylglycine; CHAPS, 3-(3-cholamidopropyl)dimethylammonio-1-propanesulfonate; Fmoc, 9-fluorenylmethyloxycarbonyl; hCys, homocysteine; HPLC, high performance liquid chromatography; Hyp, hydroxyproline; NVal, norvaline.
2   R. De Francesco unpublished observations.

ACKNOWLEDGEMENTS

We are grateful to R. Cortese for continuous support of this work, V. Matassa for valuable discussions and careful revision of the manuscript, S. Acali for peptide synthesis, G. Biasiol for protein purification, R. Petruzzelli for N-terminal sequence analysis, and P. Pucci and F. Naimo for mass spectrometry.


REFERENCES

  1. Tomei, L., Failla, C., Santolini, E., De Francesco, R., and La Monica, N. (1993) J. Virol. 67, 4017-4026 [Abstract]
  2. Grakoui, A., McCourt, D. W., Wychowski, C., Feinstone, S. M., and Rice, C. M. (1993) J. Virol. 67, 2832-2843 [Abstract]
  3. Bartenschlager, R., Ahlborn-Laake, L., Mous, J., and Jacobsen, H. (1993) J. Virol. 67, 3835-3844 [Abstract]
  4. Eckard, M. R., Selby, M., Masiarz, F., Lee, C., Berger, K., Crawford, K., Kuo, C., Kuo, G., Houghton, M., and Choo, Q. L. (1993) Biochem. Biophys. Res. Commun. 192, 399-406 [CrossRef][Medline] [Order article via Infotrieve]
  5. Hijikata, M., Mizushima, H., Tanji, Y., Komoda, Y., Hirowatari, Y., Akagi, T., Kato, N., Kimura, K., and Shimotohno, K. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 10773-10777 [Abstract]
  6. Komoda, Y., Hijikata, M., Tanji, Y., Hirowatari, Y., Mizushima, H., Kimura, K., and Shimotohno, K. (1994) Gene (Amst.) 145, 221-226 [Medline] [Order article via Infotrieve]
  7. Failla, C., Tomei, L., and De Francesco, R. (1994) J. Virol. 68, 3753-3760 [Abstract]
  8. Lin, C., Pragai, B. M., Grakoui, A., Xu, J., and Rice, C. M. (1994) J. Virol. 68, 8147-8157 [Abstract]
  9. Failla, C., Tomei, L., and De Francesco, R. (1995) J. Virol. 69, 1769-1777 [Abstract]
  10. Tamji, Y., Hijikata, M., Satoh, S., Kaneko, T., and Shimotohno, K. (1995) J. Virol 69, 1575-1581 [Abstract]
  11. Satoh, S., Tamji, Y., Hijikata, M., Kimura, K., and Shimotohno, K. (1995) J. Virol. 69, 4255-4260 [Abstract]
  12. Lin, C., Thomson, J. A., and Rice, C. (1995) J. Virol. 69, 4373-4380 [Abstract]
  13. Bartenschlager, R., Lohmann, V., Wilkinson, T., and Koch, J. A. (1995) J. Virol. 69, 7519-7528 [Abstract]
  14. Tomei, L., Failla, C., Vitale, R. L., Bianchi, E., and De Francesco, R. (1996) J. Gen. Virol. 77, 1065-1070 [Abstract]
  15. Shimizu, Y., Yamaji, K., Masuho, Y., Yokota, T., Inoue, H., Sudo, S., and Shimotohno, K. (1996) J. Virol. 70, 127-132 [Abstract]
  16. Koch, J. O., Lohmann, V., Herian, U., and Bartenschlager, R. (1996) Virology 221, 54-66 [CrossRef][Medline] [Order article via Infotrieve]
  17. Steinkühler, C., Tomei, L., and De Francesco, R. (1996) J. Biol. Chem. 271, 6367-6373 [Abstract/Free Full Text]
  18. Pizzi, E., Tramontano, A., Tomei, L., La Monica, N., Failla, C., Sardana, M., Wood, T., and De Francesco, R. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 888-892 [Abstract]
  19. Failla, C., Pizzi, E., De Francesco, R., and Tramontano, A. (1996) Folding & Design 1, 35-42 [Medline] [Order article via Infotrieve]
  20. Love, R. A., Parge, H. E., Wickersham, J. A., Hostomsky, Z., Habuka, N., Moomaw, E. W., Adachi, T., and Hostomska, Z. (1996) Cell 87, 331-342 [Medline] [Order article via Infotrieve]
  21. Kim, J. L., Morgenstern, K. A., Lin, C., Fox, T., Dwyer, M. D., Landro, J. A., Chambers, S. P., Markland, W., Lepre, C. A., O'Malley, E. T., Harbeson, S. L., Rice, C. M., Murcko, M. A., Caron, P. R., and Thomson, J. A. (1996) Cell 87, 343-355 [Medline] [Order article via Infotrieve]
  22. Kolykhalov, A. A., Agapov, E., and Rice, C. (1994) J. Virol. 68, 7525-7533 [Abstract]
  23. Bartenschlager, R., Ahlborn-Laake, L., Yasargil, K., Mous, J., and Jacobsen, H. (1995) J. Virol. 69, 198-205 [Abstract]
  24. Leinbach, S., Bhat, R., Xia, S. M., Hum, W. T., Stauffer, B., Davis, A., Hung, P. P., and Mizutani, S. (1994) Virology 204, 163-169 [CrossRef][Medline] [Order article via Infotrieve]
  25. Komoda, Y., Hijikata, M., Sato, S., Asabe, S. I., Kimura, K., and Shimotohno, K. (1994) J. Virol. 68, 7351-7357 [Abstract]
  26. Steinkühler, C., Urbani, A., Tomei, L., Biasiol, G., Sardana, M., Bianchi, E., Pessi, A., and De Francesco, R. (1996) J. Virol. 70, 6694-6700 [Abstract]
  27. Kakiuchi, N., Hijikata, M., Komoda, Y., Tanji, Y., Hirowatari, Y., and Shimotohno, K. (1995) Biochem. Biophys. Res. Commun. 210, 1059-1065 [CrossRef][Medline] [Order article via Infotrieve]
  28. Overton, H., McMillan, D., Gillespie, F., and Mills, J. (1995) J. Gen. Virol. 76, 3009-3019 [Abstract]
  29. D'Souza, E. D. A., Grace, K., Sangar, D. V., Rowlands, D. J., and Clarke, B. E. (1995) J. Gen. Virol. 76, 1729-1739 [Abstract]
  30. Suzuki, T., Sato, M., Chieda, S., Shoji, I., Harada, T., Yamakawa, Y., Watabe, S., Matsuura, Y., and Miyamura, T. (1995) J. Gen. Virol. 76, 3021-3029 [Abstract]
  31. Shoji, I., Suzuki, T., Chieda, S., Sato, M., Harada, T., Chiba, T., Matsuura, Y., and Miyamura, T. (1995) Hepatology 22, 1648-1655 [Medline] [Order article via Infotrieve]
  32. Mori, A., Yamada, K., Kimura, J., Koide, T., Yuasa, S., Yamada, E., and Miyamura, T. (1996) FEBS Lett. 378, 37-42 [CrossRef][Medline] [Order article via Infotrieve]
  33. Hong, Z., Ferrari, E., Wright-Minogue, J., Chase, R., Risano, C., Seelig, G., Lee, C., and Kwong, A. D. (1996) J. Virol. 70, 4261-4268 [Abstract]
  34. Taliani, M., Bianchi, E., Narjes, F., Fossatelli, M., Urbani, A., Steinkühler, C., De Francesco, R., and Pessi, A. (1996) Anal. Biochem. 240, 60-67 [CrossRef][Medline] [Order article via Infotrieve]
  35. Fersht, A. (1985) Enzyme Structure and Mechanism, Freeman, New York
  36. Wang, W., and Liang, C. (1994) Biochemistry 33, 14636-14641 [Medline] [Order article via Infotrieve]
  37. Perona, J. J., and Craik, C. S. (1995) Protein Sci. 4, 337-360 [Abstract/Free Full Text]
  38. Yamanaka, G., DiIanni, C. L., O'Boyle, D. R., II, Stevens, J., Weinheimer, S. P., Deckman, I. C., Matusick-Kumar, L., and Colonno, R. J. (1995) J. Biol. Chem 270, 30168-30172 [Abstract/Free Full Text]
  39. Coombs, G. S., Dang, A. T., Madison, E. J., and Corey, D. R. (1996) J. Biol. Chem. 271, 4461-4467 [Abstract/Free Full Text]

©1997 by The American Society for Biochemistry and Molecular Biology, Inc.