1 Departament de Ciències Mèdiques Bàsiques, Facultat de Medicina, Universitat de Lleida, Avda. Rovira Roure 44, 25198 Lleida and 3 Institut de Biotecnologia i Biomedicina, Departament de Bioquímica i Biologia Molecular, Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
2 To whom correspondence should be addressed. E-mail: angel.mozo{at}cmb.udl.es
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Keywords: electric dipole moment/electrostatic interactions/hydrophobicity/thermostability
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Of all the interactions that take place within proteins, electrostatic and hydrophobic are the ones that have probably captured the attention of most researchers (Querol et al., 1996; Mozo-Villarías and Querol, 2000
; Sterner and Liebl, 2001
; Vieille and Zeikus, 2001
). As far as electrostatic forces are concerned, an increasing number of papers have established the enormous importance (both in number and constitution) of ion pair networks in maintaining thermal stability (Bashford, 1991
; Honig and Nicholls, 1995
; Vogt et al., 1997
; Chin et al., 1999
; Xiao and Honig, 1999
). The idea that emerges from these observations is that of a protein which works as an electric network, mostly at the surface and around a hydrophobic core. The stronger and more complex the net is, the most resistant the protein is to destabilization. A large number of computer software tools based on different numerical approximations analyzing electrical interactions have demonstrated their utility for computing and establishing electrical characteristics, such as the distributions of electric field and electric potential in proteins (Honig and Nicholls, 1995
). The present work suggests the use of magnitudes that characterize electrical properties of a protein in order to help determine which points of the protein are most sensitive in maintaining its 3D structure.
In this paper, we describe common traits shown by studies of protein thermostabilization involving electrical interactions. It must be noted that when extracting protein thermal information from the literature, it is difficult to choose interesting cases in a coherent manner, owing to the different criteria used by different researchers when reporting their results in terms of the thermal characteristics of the protein under study. The transition temperature, Tm, is considered to be the standard parameter for describing the thermal stability of a protein, yet many researchers prefer to present their results in terms of thermoresistance, which is the half-life of residual activity at a given temperature and therefore not necessarily related to Tm. This is a frequent case in studies involving enzymes for biotechnological purposes. Other authors simply provide information about optimal temperatures at which original organisms live, while there are those who directly provide values for G, that is, the change in free energy associated with thermostabilization.
The analysis presented in this paper provides an approximate but very simple way to monitor the outcome of any prediction of thermostabilization when a given mutation involving charged residues is proposed.
![]() |
Materials and methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Hydrophobic density calculations. The center-of-mass of each protein was determined using the coordinates provided by the Protein Data Bank (PDB). The coordinates for each atom were re-calculated and used for subsequent calculations. In this way, the center-of-mass of the protein was the origin of coordinates for all subsequent computations.
Hydrophobic tensors equivalent to the inertia tensor were calculated for positive hydrophobicities (h+i) and negative hydrophobicities (h-i), according to the Eisenberg hydrophobicity scale (Eisenberg et al., 1982). On this scale, positive values indicate hydrophobicity and negative values indicate hydrophylicity. These tensors were subsequently diagonalized:
. The radii of the equivalent ellipsoids were calculated from their respective diagonal elements. Thus, the radii of the ellipsoid corresponding to the positive hydrophobicity amino acids were calculated as
![]() |
where are the diagonal elements of the negative electric charge distribution and
is the sum of all the positive hydrophobicity amino acids in the protein. Subscript i denotes the three dimensions of space x, y, z. Similarly,
for negative hydrophobicity were calculated.
The positive hydrophobicity ellipsoid is of particular interest in this study, since it provides a measure equivalent to the density of the hydrophobic packing of a protein, . This density is computed as
Dipole moment vectors. The classical definition of the electric dipole moment vector or first-order electric moment is given
by , where
is the position vector of residue j with charge qj. Unless the total charge of the protein is zero,
is not independent of the origin of coordinates. Since in most proteins the total charge is not zero, it is useful to define two quasi-dipole moments,
and
, as
and
, where
and
are the total negative and positive charges (
is the total charge) and
![]() |
are the positive and negative centroids of the charge distributions, respectively.
Note that and
differ, as
, and when
.
Both and
are independent of the origin of coordinates and therefore they are intrinsic parameters of the protein. As a consequence, these quasi-dipole moments provide a useful characterization of the dipolar nature of the charge distribution of a protein. Since the information provided by
is essentially
the same as that of , in what follows
is used to characterize the charge distribution in a protein and its modulus is designed as P and expressed in debye. We prefer to use these quasi-moments rather than the origin-dependent true moment
since the purpose of this study was to compare moments among proteins with different thermal properties as described below and the use of D could give rise to meaningless conclusions.
All these magnitudes were easily calculated for the proteins described below using a desktop computer and employing a very simple function written in Mathematica.
Mesostablethermostable protein pairs. Coordinates for mesostablethermostable pairs of proteins, whose thermal properties are described in the literature, were obtained from the PDB. These proteins are listed in Table I. Here we report only those proteins whose sequences are complete. In some cases, proteins in their native state are complexes of several monomers, but only the coordinates of the monomers are reported in the PDB. In such cases, the oligomers were reconstructed by means of the Swiss-PDBViewer software, using their oligomeric homologues as templates. Proteins which lacked important fragments are not reported. In a few cases, mesostablethermostable pairs also included a hyperthermostable member whose parameters were also calculated for comparison.
|
|
Experimental
ß-Glucanase and its mutant N207D were obtained and purified by using experimental procedures described elsewhere (Pons et al., 1997). Transition temperatures (Tm) upon heating were measured spectrofluorimetrically on ß-glucanase wild-type and N207D mutant. Experiments were carried out with a Shimadzu RF2000 spectrofluorimeter.
![]() |
Results |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Sets of mesostablethermostable pairs of proteins were obtained by pooling all cases recently reported in the literature (Vogt et al., 1997; Karshikoff and Ladenstein, 1998
; Xiao and Honig, 1999
; Knochel et al., 2000
; Kumar et al., 2000
; Sterner and Liebl, 2001
; Vieille and Zeikus, 2001
; Klein et al., 2002; Nicholson et al., 2002
; Robic et al., 2002
). In the cases reported here, we did not take into account particular atomic or structural considerations. The only condition imposed was that the proteins reported should not lack any fragments or elements that could be of importance in the interpretation of electrical results. The electrical parameters described in the Materials and methods section were calculated for 37 groups of mesostablethermostable (and/or hyperthermostable) proteins and are shown in Table I
, together with their thermal and mechanical characteristics. For the computation of the electrical parameters, the charged amino acids were used [+1 for arginines and lysines, -1 for aspartates and glutamates and +0.5 for histidines (Spassov et al., 1997
)]. Regardless of the thermal properties of each protein, it was found that the sizes of the equivalent electric ellipsoids (both positive and negative) were, as expected, somewhat larger than those of the inertia ellipsoids, since most charged amino acids tend to be located in the surface of the protein. Of the 37 pairs of proteins reported, 27 (73%) had their quasi-dipole moment value (P) lower than that of their mesostable counterpart. It was also observed that many cases also show an increase in
+ concomitant with higher thermostability. Only two families (CheY and methionine aminopeptidase) showed both opposite effects simultaneously: increase in P and decrease in
+ for higher thermostability.
We estimated the error in these parameters from observed differences in values obtained in different subunits of polymeric crystals. For example, we found that in glutamate dehydrogenase, the variability was of the order of 2% for these calculated parameters.
Point mutations
The considerations explained in the preceding section also apply for the proteins reported in Table II and their respective point mutations (Dao-Pin et al., 1991
; Meiering et al., 1992
; Zhang et al., 1992
; Kanaya et al., 1996
; Predki et al., 1996
; White et al., 1996
; Malakauskas and Mayo, 1998
; Grimsley et al., 1999
; Lebbnik et al., 1999
; Mikami et al., 1999
; Peterson et al., 1999
; Wray et al., 1999
; Zhu et al., 1999
; Arrizubieta and Polaina, 2000
; Chen et al., 2000
; González-Blasco et al., 2000
; Ha et al., 2000
; Hasegawa et al., 2000
; Tchan et al., 2000
; Martin et al., 2001
; Ohmura et al., 2001
; Pedone et al., 2001
; Perl and Schmid, 2001
; Shaw et al., 2001
; Stewart et al., 2001
; Sung et al., 2001
; Trejo et al., 2001
; Almog et al., 2002
; Pokkuluri et al., 2002
). This table reports 55 mutations carried out in 30 proteins involving electric changes. In 43 of the cases reported (78%) mutations showed a decrease in P when thermostability increased or, conversely, an increase in P when thermostability decreased, which is common among reported mutations.
Relative electric dipole moment versus transition temperature
In order to illustrate the sensitivity of the thermal stability of a protein with respect to its quasi-dipole moment, the relative change of P was plotted against the increase in Tm, Tm. This relative change,
P/Pmeso, is calculated by taking the value of P of the mesostable protein as reference, in cases dealing with mesothermostable pairs or the value of the wild-type when dealing with point mutations. Figure 1
shows this relative change in P vs
Tm for those cases whose variations of thermal properties are reported in terms of Tm, for both mesothermostable pairs of proteins and point mutations. Both plots show linear correlations between both increments,
P/Pmeso and
Tm. The linear fit characteristics of both populations are displayed in Table III
. It should be noted that both fits show similar slope values around -0.011. The level of significance for both sets of proteins is p < 0.05 for mesothermo pairs (n = 18) and p < 0.01 for mutated proteins (n = 39).
|
|
of Pp and very similar results were obtained in terms of
P/Pmeso.
Detailed analysis and profiles of some examples
Cold shock protein. In order to illustrate the use of these electric dipole moments in predicting thermostabilizing mutations, profiles of P were calculated throughout the cold shock protein sequence. Figure 2 shows profiles of computed P for the mesostablethermostable pair, when each amino acid is replaced by either a negative charge, a neutral amino acid or a positive charge throughout its sequences. Those substitutions in which wild-type P values were reduced should, in principle, be considered as candidates for stabilizing mutations. For example, mutations E3R and E66L produced a decrease in P. Perl and Schmidt performed these mutations on the mesostable cold shock protein 1csp with a resulting increase in Tm (Perl and Schmidt, 2001). A corresponding decrease in Tm was observed when reversed mutations were carried out on the thermostable cold shock protein 1c90. An increase in P was observed for R3E and L66E mutations.
|
|
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
We found an inverse linear correlation between the relative change in dipole moment and transition temperature increase. Such a correlation should be expected since the attractive Coulomb force, which is a cohesive force, exerted by both positive and negative centroids depends on the inverse square distance between them. This finding is in good agreement with current ideas about the role played by the electrostatic networks of ion pairs and their optimization in thermostable proteins, as pointed out by several investigators (Spassov et al., 1994; Vogt et al., 1997
; Xiao and Honig, 1999
). According to these authors, such a clustering of opposite charges brings the centers of positive and negative charges close in space and thus reduces the magnitude of the electrostatic dipole moment of the protein. A similar correlation is found when the mesostablethermostable pairs of Table I
are used to compute dipole moments according to the method described by Felder et al.(1998)
by means of the true concept of dipole moment taking the precaution of being systematic with the choice of the origin of coordinates. However, the correlation coefficient found with their dipole moments was lower than those reported in Table III
.
In the calculations of the electrostatic dipole moment, it was assumed that all ionizable groups are fully charged except for histidines that were considered to have +0.5 charge. Given the large set of proteins being considered, we propose that this assumption is acceptable in the light of recent results on the possibility that a point mutation might not change the net charge of the protein. Alexov et al.(2000), working with replacements of a neutral group with ionizable in the reaction centers of several species of Rhodobacter, found that the replacements might not change the charge of the protein simply because the ionizable group could be uncharged owing to the specific environment in the protein or it might alter the charge of its neighbor.
It is worth mentioning that the attractive electrostatic force also depends on the inverse of the dielectric constant of the protein, which is also likely to decrease upon increasing the hydrophobic density. The difficulty in obtaining values for the dielectric constant of a protein has already been pointed out (Pitera et al. 2001; Schutz and Warshel, 2001
), being a function of the hydrophobic amino acid composition of the protein. The link between the electrical and hydrodynamic forces operating in a protein is certainly an unresolved complex issue and one in which determination of the distribution of the dielectric constant in the protein is crucial. It may not even be possible to propose general rules to describe their mutual influence, since they may depend on the context of each protein. It is necessary to rely on the opportunistic nature of this equilibrium in which other forces, like hydrogen bonding, solvation and others that have been excluded from this study, are also present. However, in a first approximation, an increase in positive hydrophobic density,
+, should imply a decrease in the average dielectric constant of the core of the protein and this would contribute to an increase in the strength of the attractive electric force, even in cases in which dipole moment increases. There are some cases reported in this paper in which we see thermal stabilizations in spite of a relative increase in P. At this point we can only suggest that when the core is more densely packed (as expressed by an increase in the positive hydrophobicity density,
+), the effect of increasing the separation between electric centroids (increase in P) may be compensated by an increase in the dielectric constant of the medium.
Another important aspect to be taken into account is the fact that the stabilization factors suggested here (minimization of P and maximization of +) are defined in a specific structural context involving relatively small structural variations within a protein. When major changes are to be considered, such as those simulated by Petrey and Honig (Petrey and Honig, 2000
), where the sequence of a given protein was superimposed on a totally different fold, the effects described in this paper should not be expected necessarily to hold.
The interest in this analysis lies mostly in the way it effectively provides both electric and hydrophobic equivalent spheroids for the study protein, without the need for a detailed knowledge of atomic peculiarities such as steric constraints, the existence of water-filled or empty cavities, etc. This circumstance imposes certain limitations on the applicability of this analysis in the case of very asymmetric proteins, as exemplified in the case of aminotransferase (not reported here). In this protein, the association of an ellipsoid with such an irregular structure (S-shaped) causes a considerable distortion of the model, which considers a protein as a spheroid possessing a compact hydrophobic nucleus, enclosed in an electrical layer.
Although the present state of knowledge does not allow a quantitative prediction of the degree of protein stability from its sequence and/or its 3D structure, or the amount of acquired stabilization (or destabilization) when a given mutation takes place, it is possible to check if putative specific replacements are likely to enhance the thermal stability of a protein.
![]() |
Acknowledgments |
---|
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Almog,O., Gallagher, D.T., Ladner,J., Strausberg, S., Alexander,P., Bryan,P. and Gililand,G. (2002) J. Biol. Chem., 277, 2755327558.
Arrizubieta,M.J. and Polaina,J. (2000) J. Mol. Biol., 275, 2884328848.
Bashford,D. (1991) Curr. Opin. Struct. Biol., 1, 175184.[CrossRef]
Chen,J., Lu,Z., Sakon,J. and Stites,W. (2000) J. Mol. Biol., 303, 125130.[CrossRef][ISI][Medline]
Chin,K., Sharp,K., Honig,B. and Pyle,A.-M. (1999) Nature Struct. Biol., 6, 10551061.[CrossRef][ISI][Medline]
Dao-Pin,S., Soderlind,E., Baase,W., Wozniak,J., Sauer,U. and Matthews,B. (1991) J. Mol. Biol., 221, 873887.[CrossRef][ISI][Medline]
Eisenberg, D., Weiss, R., Terwilliger, T. and Wilcox, W. (1982) Faraday Symp. Chem. Soc., 17, 109120.
Felder,C., Botti,S., Lifson,S., Silman,I. and Sussman,J. (1998) J. Mol. Graphics Modelling, 15, 318327.[CrossRef][ISI]
González-Blasco,G., Sanz-Aparicio,J., González,B., Hermoso,J. and Polaina,J. (2000) J. Chem. Biol., 275, 1378013712.
Grimsley,G., Shaw,K., Fee,L., Alston,R., Huyghes-Despointes,B., Thurlkill,R., Scholz,R. and Pace,N. (1999) Protein Sci., 8, 18431849.[Abstract]
Ha,N.C., Oh,B., Shin,S., Kim,H.-J., Oh,T.-K., Kim,Y.-O., Choi,K. and Oh,B.-H. (2000) Nature Struct. Biol., 7, 147153.[CrossRef][ISI][Medline]
Hasegawa,J., Uchiyama,S., Tanimoto,Y., Mizutani,M., Kobayashi,Y., Sambongi,Y. and Igarashi, Y. (2000) J. Biol. Chem., 275, 3782437828.
Honig,B. and Nicholls,A. (1995) Science, 268, 11441149.[ISI][Medline]
Juncosa, M., Pons, J., Planas, A. and Querol, E. (1994) BioTechniques, 16, 829824.
Kanaya,S., Oobatake,M. and Liu,Y. (1996) J. Biol. Chem., 271, 3272932736.
Karshikoff, A. and Ladenstein, R. (1998) Protein Eng., 11, 867872.[Abstract]
Klein,C., Georges,G., Kunkele,K.-P., Huber,R., Engh,R. and Hansen,S. (2001) J. Biol. Chem., 276, 3739037401.
Knochel,T., Pappenberger,A., Jansonius,J. and Kirschner,K. (2000). J. Biol. Chem., 277, 86268634.
Kumar,S., Tsai,C.-J. and Nussinov,R. (2000) Protein Eng., 13, 179191.
Lebbnik,J., Knapp,S., van der Oost,J., Rice,D. and de Vos,W. (1999) J. Mol. Biol., 289, 357369.[CrossRef][ISI][Medline]
Malakauskas,S. and Mayo,S. (1998) Nature Struct. Biol. 5, 470475.[ISI][Medline]
Marshall,S., Morgan,C. and Mayo,S. (2000) J. Mol. Biol., 316, 189199.[CrossRef]
Martin,A., Sieber,V. and Schmid,F. (2001) J. Mol. Biol., 309, 717726.[CrossRef][ISI][Medline]
Meiering,E., Serrano,L. and Fersht,A. (1992) J. Mol. Biol., 225, 585589.[ISI][Medline]
Mikami,B., Yoo,H.-J. and Yoshigi,N. (1999) J. Mol. Biol., 285, 12351243.[CrossRef][ISI][Medline]
Mozo-Villarias,A. and Querol,E. (2000) Curr. Top. Biochem. Res., 2, 1931.
Nicholson,E., Mo,H., Prusiner,S., Cohen,F. and Merqusee,S. (2002) J. Mol. Biol., 316, 80715.[CrossRef][ISI][Medline]
Ohmura,T., Ueda,T., Ootsuka,K., Saito,M. and Imoto,T. (2001) Protein Sci., 10, 313320.
Pedone,E., Saviano,M., Rossi,M. and Bartolucci,S. (2001) Protein Eng., 14, 255260.
Perl,D. and Schmid,F. (2001) J. Mol. Biol., 313, 343357.[CrossRef][ISI][Medline]
Peterson,R., Nicholson,E., Tharpar,R., Klevit,R. and Scholtz,M. (1999) J. Mol. Biol., 286, 16091619.[CrossRef][ISI][Medline]
Petrey,D. and Honig,B. (2000) Protein Sci., 9, 21812191.[Abstract]
Pitera, J, Falta,M. and van Gunsteren,W. (2001) Biophys. J., 80, 25462555.
Pokkuluri,P.R., Gu,M., Cai,X., Raffen,R., Stevens,F.J. and Schiffer,M. (2002) Protein Sci., 11, 16871694.
Pons,J., Querol,E. and Planas,A. (1997) J. Biol. Chem., 272, 1300613012.
Predki,P., Agrawal,V., Brunger,A. and Regan,L. (1996) Nature Struct. Biol., 3, 5458.[ISI][Medline]
Querol,E., Perez-Pons,J.A. and Mozo-Villarias,A. (1996) Protein Eng., 9, 265271.[Abstract]
Robic,S., Berger,J. and Marquesee,S. (2002). Protein Sci., 11, 381389.
Schutz,C. and Warshel,A. (2001) Proteins: Struct. Funct. Genet., 44, 400417.[CrossRef][ISI][Medline]
Shaw,K., Grimsley,G., Yakovlev,G., Makarov,A. and Pace,N. (2001) Protein Sci., 10, 12061215.
Spassov,V., Karshikoff,A. and Ladenstein,R. (1994). Protein Sci., 3, 15561569.
Spassov,V.Z., Ladenstein,R. and Karshikoff,A. (1997) Protein Sci., 6, 11901196.
Sterner,R. and Liebl,W. (2001) Crit. Rev. Biochem. Mol. Biol., 36, 39106.
Stewart,R., Varghese,J., Garrett,T., Hog,P. and Fincher,G. (2001) Protein Eng., 14, 245253.
Sung,Y.-H., Hong,H.-D., Cheong,C., Kim,J., Cho,J., Kim,Y.R. and Lee,W. (2001) J. Biol. Chem., 276, 4422944238.
Tchan,M., Choy,K., Mackay,J., Lyons,A., Bains,N. and Weiss,A. (2000) J. Biol. Chem., 275, 3745437461.
Trejo,F., Gelpí,J., Ferrer,A., Boronat,A., Busquets,M. and Cortés,A. (2001) Protein Eng., 14, 911917.
Vieille,C. and Zeikus,G. (2001) Microbiol. Mol. Biol. Rev., 65, 143.
Vogt,G., Woell,S. and Argos,P. (1997) J. Mol. Biol., 269, 631643.[CrossRef][ISI][Medline]
White,P., Squirrell,D., Arnaud,P., Lowe,C. and Murray,J. (1996) Biochem. J., 319, 343350.[ISI][Medline]
Wray,J., Baase,W., Lindstrom,J., Weaver,L., Poteete,A. and Matthews,B. (1999) J. Mol. Biol., 292, 11111120.[CrossRef][ISI][Medline]
Xiao,L. and Honig,B. (1999) J. Mol. Biol., 289, 14351444.[CrossRef][ISI][Medline]
Zhang,J., Liu,Z.-P., Jones,A., Gierasch,L. and Sambrook,J. (1992) Proteins: Struct. Funct. Genet., 13, 8799.[ISI][Medline]
Zhu,G.-P., Xu,C.-X., Teng,M.-K., Tao,L.-M., Zhu,X.-Y., Wu,C.-J., Hang,J., Niu,L.-W. and Wang,Y.-Zh. (1999) Protein Eng., 12, 635638.
Received July 23, 2002; revised January 20, 2003; accepted February 5, 2003.