1 Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Ul. Miklukho-Maklaya, 16/10 Moscow V-437, 117871 GSP-7, Russia and 3 Université des Sciences et Technologies de Lille, CRESIMM, UFR de Chimie, Bâtiment C8, 59655 Villeneuve d'Ascq, Cedex, France
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Keywords: hydrophilic contacts/hydrophobic contacts/molecular dynamics simulation/protein structure recognition/structure quality
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
In the present work we show that the value of the similarity score is correlated with the quality of the protein model. To prove that, the -score values are calculated for `good' experimental structures of proteins, and these values were compared with that calculated for `nearly good' models as well as for completely wrong models. The confidence interval of the
scores was also estimated.
![]() |
Materials and methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
To calculate the relative strengths and the types of intramolecular contacts (hydrophobic, hydrophilic or unfavorable) between side-chain (S) and backbone (B) segments of amino acid residues in proteins with known spatial structure we used approach described in detail in (Golovanov et al., 1998). The value of MHPXYij, created by the atoms of segment X (S or B) of amino acid residue i (source residue) in geometrical center of the segment Y (S or B) of residue j (target residue, i
j) was calculated according to the formula:
|
The atomic hydrophobicity constants fk (derived from experimental octanolwater partition coefficients of large set of various chemical compounds) of atom k were taken from Viswanadhan et al. (1989) and complemented as described before (Efremov and Alix, 1993). Ni is the number of atoms in segment X of residue i, Rdec is the effective radius of decay of the potential (taken to be 1 Å, as shown by Fauchère et al., 1988
), and rkj is the distance (in Å) between atom k and the geometrical center of segment Y of residue j. The exponential distance function in Equation (1) obviously has a sense only when rkj exceeds a certain value (approximately the sum of two atomic van der Waals radii). This condition is implicitly fulfilled for the protein models which do not have strong steric bumps.
Individual contributions of amino acid residue i to hydrophobic, hydrophilic and unfavorable pairwise interactions were calculated as:
|
where XY is SS, SB or BB. The value of CutoffMHP = 0.001 was chosen (Efremov et al., 1995; Golovanov et al., 1998
) to consider only the strongest hydrophobic, hydrophilic and unfavorable contacts and to discard the weak contacts (originating from residue pairs at distances greater ~7 Å). The values of individual contributions (given in arbitrary units) were multiplied by the factor 106 for convenience.
To characterize the contributions from SS, SB and BB interactions for the whole protein, several parameters (SumXYphob, SumXYphil and SumXYunf) were calculated as the sum of correspondent individual contributions (CXYphob, CXYphil and CXYunf) over all residues, normalized by the number of residues in the protein. Thus the normalized sums of hydrophobic, hydrophilic and unfavorable contributions were calculated as:
|
The similarity score = Sphob + Sphil + Sunf gives the total sum of all favorable (hydrophobic and hydrophilic) and unfavorable (hydrophobic-to-hydrophilic) interactions in the protein, normalized by the number of residues.
The contributions of hydrophobic, hydrophilic and unfavorable interactions for the protein model, as well as similarity score are calculated with our program Hi-EXPO (Golovanov et al., 1998
), which is available from A.P.G.
Representative and misfolded structures
The atomic coordinates of protein models were taken from the Protein Data Bank (PDB), (Bernstein et al., 1977). A representative set of 196 high-resolution (<2.0 Å) protein crystal structures with low homology (less than 25%) was prepared using the set proposed previously (Hobohm and Sander, 1994
). After deletion of the models lacking heavy atoms the set consisted of entries with the following names: 1lkk, 1arb, 5rxn, 1cus, 7rsa, 1ptx, 1aac, 193l, 1hmt, 1xyz, 2ctc, 2olb, 2phy, 3sdh, 1eca, 256b, 1rcf, 2end, 4gcr, 2rn2, 1xnb, 8abp, 1mla, 2hbg, 2mcm, 1ccr, 2prk, 9rnt, 121p, 2cba, 3grs, 1jcv, 1tca, 1tgx, 1arv, 1csh, 1mrj, 1nif, 1phg, 1ppn, 1vcc, 2ayh, 2dri, 2sil, 3pte, 2er7, 4fgf, 153l, 1nfp, 8tln, 2cpl, 1lcp, 1snc, 1sri, 3est, 1cpc, 1cpc, 2hmz, 3chy, 2ccy, 131l, 1bec, 1fkj, 1fnc, 1gca, 1gof, 1knb, 1mls, 1mol, 1onc, 1sbp, 1thx, 1ttb, 1vhh, 2bop, 2gdm, 3dfr, 1bp2, 2alp, 2cyp, 1hpm, 1vsd, 1chd, 1kpt, 1sat, 1thv, 1udh, 3cla, 2acq, 1amp, 1ars, 1atl, 1bdm, 1cmb, 1gad, 1hny, 1hvk, 1hxn, 1ilk, 1len, 1lfa, 1mml, 1nar, 1pgs, 1tml, 1xyl, 2fal, 2gst, 2nac, 2por, 2tgi, 3sic, 2aza, 2cdv, 4fxn, 7pcy, 1isc, 1cel, 1iae, 5tim, 1llo, 2abk, 1dyr, 1hsl, 1afb, 1aoz, 1chm, 1clc, 1dup, 1ede, 1fba, 1gpr, 1lis, 1mld, 1nhk, 1pbe, 1pnk, 1pnk, 1reg, 1slt, 1ubs, 1ukz, 2chs, 2mnr, 2fd2, 3tgl, 4enl, 1pbp, 1cns, 1daa, 1lts, 1dpg, 2pgd, 3pga, 1ade, 1cew, 1cfb, 1dsb, 1fnf, 1gky, 1hur, 1hvd, 1lct, 1lki, 1lld, 1mpp, 1msc, 1nba, 1nhp, 1ora, 1oyc, 1pbn, 1pii, 1pne, 1poc, 1rci, 1rtp, 1rva, 1sac, 1sra, 1trk, 1wht, 1wht, 2cwg, 2ebn, 2hpd, 2kau, 2kau, 2prd, 2scp, 4blm, 4mt2, 8acn, 1bbp, 1i1b, 3rub.
A set of 26 misfolded structures (Holm and Sander, 1992) was taken from the Internet (ftp.embl-heidelberg.de, from the directory /pub/databases/misfolded). This set was obtained by swapping the sequences of pairs of proteins with the same number of residues, but different structures, with further relaxing the coordinates with energy minimization. Thus each deliberately misfolded protein has two `parent' native structures (one for its fold, and the other for its sequence). The coordinates of `parent' structures were taken from PDB. Two misfolded structures for which their parent models were not found in the PDB were deleted from the original set of Holm and Sander (1992). Hydrogen atoms were attached using the standard facilities of the programs SYBYL (Clark et al., 1989
) or CHARMM (Brooks et al., 1983
).
Distorted structures
Seven structures from the representative set with the highest resolution (PDB codes 193l, 1aac, 1arb, 1cus, 1lkk, 5rxn and 7rsa, resolution better than 1.33 Å) were subjected to smooth distortion. For each of these proteins the set of 200 alternative conformations was calculated. The first 13 structures in each set were obtained by unconstrained conformational energy minimization. For the first structure 100 minimization steps were done with the positions of heavy atoms fixed. For the second structure 5 steps of minimization were done with all atoms relaxed. The number of minimization steps for structure number i (3 i
13) was defined as Ni = Ni1 + 5 + (i 2)2. Then each subsequent structure (14
i
200) was obtained from the previous one by short 0.1 ps molecular dynamics (MD) simulation (with temperature T = 15 (i 14) K) followed by 50-step conformational energy minimization. The standard TRIPOS force field from SYBYL (Clark et al., 1989
) in vacuo without electrostatic term was used both for the conformational energy minimization and MD simulations. The root-mean square deviations (r.m.s.d.) were calculated for C
atom positions after best fit superposition of structures.
To obtain the large set of alternative protein models close to the experimental ones, 196 proteins from the representative set were subjected to energy minimization using TRIPOS (30 and 100 steps) and CHARMM (30, 150 and 300 steps of minimization) force fields without the electrostatic energy terms. Thus for each of 196 proteins five alternative models were obtained. The small deviations of these alternative models from the experimental structures were characterized by r.m.s.d. calculated for all heavy atoms after best fit superposition of structures.
Estimation of the -score confidence interval
The confidence interval of the -score was estimated basing on its calculation for different `correct' models of the same protein, i.e. for the pairs of domains related by non-crystallographic symmetry (resolution better 2.5 Å), and for various models of the same protein in NMR-derived entries. The following X-ray set of domain pairs was used (the PDB code and chain identifiers of two compared chains are given): 1apx (A,B), 1bre (A,B), 1buc (A,B), 1deh (A,B), 1dpg (A,B), 1ebg (A,B), 1ebh (A,B), 1gse (A,B), 1ids (A,B), 1les (AB,CD), 1ndp (A,B), 1pvd (A,B), 1pyd (A,B), 1set (A,B), 1smn (A,B), 1tar (A,B), 1wgc (A,B), 2cst (A,B), 2nac (A,B), 2phi (A,B), 2wgc (A,B), 4mdh (A,B), 5p2p (A,B), 7wga (A,B), 8cat (A,B), 9wga (A,B). The set of NMR structures (all NMR entries in PDB searched by keywords `protein' or `toxin' without complexes of short peptides with DNA and entries without explicit protons) consisted of 277 entries (total 5665 protein models). The
-score confidence intervals were estimated as the three mean standard deviations of
-score for the pairs of domains (X-ray) or for different models (NMR) of the same protein.
Molecular dynamics simulations in solvent
MD simulations were performed on the NMR structure (Lubienski et al., 1994) of barstar (PDB entry 1btb) and the crystal structure (Housset et al., 1994
) of scorpion toxin II (PDB entry 1ptx, resolution 1.3 Å) using the program CHARMM (Brooks et al., 1983
). The computational protocols for the two proteins were the same. The proteins were solvated in the spheres of water molecules (50 Å in diameter, ~1800 water molecules). Solvent shells were pre-equilibrated by energy minimization with subsequent short (20 ps, 300 K) MD simulation with the protein atoms fixed. Then the entire systems were minimized, and the corresponded `minimized in solvent' protein structures were taken as the references. The systems were then heated to 300 K and equilibrated. The length of the subsequent MD run was 1600 ps for barstar and 600 ps for toxin II. Stochastic boundary conditions were applied for both systems. The structures were sampled every 1 ps, both for the heating stage and MD simulation. The details of MD simulations will be published elsewhere.
![]() |
Results and discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
The similarity score characterizes intramolecular contacts in proteins with known three-dimensional structures. It favors both hydrophobic and hydrophilic contacts and penalizes those that are hydrophobic-to-hydrophilic. Each amino acid residue is subdivided into two segments (side-chain and backbone) and pairwise contacts between these segments are summed all over the protein. The type of contact (hydrophobic, hydrophilic or unfavorable) and its relative strength is calculated using the molecular hydrophobicity potential approach as described before (Golovanov et al., 1998). The definitions of various types of contacts are illustrated on Figure 1
. If both groups of atoms (segments of amino acid residues) are turned to each other by their hydrophobic sides (i.e., they create positive, hydrophobic potential in the geometrical center of each other) the contact is hydrophobic. If both groups of atoms are turned to each other by their hydrophilic sides (i.e., they create a negative, hydrophilic potential in the geometrical center of each other) the contact is hydrophilic. But if two groups are turned to each other by the sides of different polarity (i.e., they create the potentials of different signs in the geometrical center of each other) the contact is unfavorable. The `strength' of the individual contact is equal to the product MHPijMHPji, which is positive for both favorable hydrophobic and hydrophilic contacts, and negative for unfavorable hydrophobic-to-hydrophilic contacts. The similarity score is the sum of hydrophobic, hydrophilic and unfavorable contacts in the protein, and its value decreases as the favorable contacts decrease, or unfavorable contacts increase. Thus the expansion of the protein model leading to the loss of intramolecular contacts causes a decrease in the similarity score. The form of the similarity score permits one to consider separately the contributions from hydrophobic, hydrophilic and unfavorable interactions, or contributions from side-chainside-chain (SS, mainly hydrophobic interactions), backbonebackbone (BB, mainly hydrophilic interactions, related to secondary structure) and side-chainbackbone (SB) interactions. While the similarity score reflects the `overall' quantity and quality of contacts in the protein model, its constituents give additional information about particular types of interactions in this model.
|
Two sets of proteins were used to reveal the typical values of -score and its constituents for `correct' and `wrong' models: the representative set of 196 high-resolution protein structures and the set of 26 deliberately misfolded proteins, obtained by threading of amino acid sequence of one protein on the spatial scaffold of another one (see Materials and methods). The calculated values of unnormalized
-scores and its unnormalized constituents for two sets of proteins were roughly proportional to the number of amino acid residues (with the correlation coefficients >0.95, data not shown). Therefore for further analysis we used
-scores and its constituents (see Materials and methods) normalized by the number of residues in protein.
The values of -score and its constituents calculated for the representative and misfolded sets of protein models are presented in Table I
. It can be seen that the values of similarity scores
calculated for the misfolded set are generally less than for the representative set, which means that the similarity of packing for the native proteins is higher than that for the misfolded models. As the different terms contributing to the total
-score reflect different kinds of intramolecular interaction, it is interesting to compare the typical values of these parameters for the `normal' and misfolded proteins to understand the differences between them. The value SumBBphil contributes mostly to the total favorable hydrophilic interactions Sphil. This value generally should not depend on the specific amino acid sequence, but reflects favorable polar backbonebackbone interactions due to formation of secondary structure elements and main chain hydrogen bonds. The values SumBBphil and Sphil for the set of misfolded proteins are only slightly lower than for the representative set, but this result is expected as the misfolded proteins conserve the secondary structure of their parent models, although it is distorted. The value of favorable hydrophobic interactions between the side chains SumSSphob contributes to the total hydrophobic interactions Sphob most of all (Table I
). This value also has the strongest discriminating power between the two sets of proteins. For the misfolded set this value is small, which means that misfolded proteins have a deficit of favorable hydrophobic interactions. The contributions of unfavorable interactions do not differ much between these two sets.
|
|
|
![]() |
Estimation of confidence interval of ![]() |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Analysis of distorted protein structures |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
The dependencies of r.m.s.d. versus structure number in each set of distorted structures (Figure 4A) are very smooth, and are very similar for different sets. Figure 4B
displays the dependencies of gyration radii on the structure number. It can be seen that on the initial stage of distortion (corresponding approximately to the first 20 structures which were obtained by energy minimization and MD at low temperatures) the radii of gyration do not increase, showing that the change of protein conformation is due to slight structural rearrangements, but not to the expansion of the protein.
|
|
It is also worth to mention that for some proteins in the test sets the first few steps of minimization made structures with even better -scores than for the experimental ones, while the corresponded r.m.s.d. values (for C
atoms) were negligibly small (less 0.1 Å). None of structures with r.m.s.d. larger than ca. 0.1 Å have better scores. This agrees with the idea that it is possible to find `more correct' structure (which will be closer to `ideal' than the experimental one) in the close vicinity to the experimental structure.
It is interesting to compare the performance of -score for the sets of distorted proteins with that of other commonly used scores which are able to recognize the native structure. Figure 6
illustrates the dependencies of S-score of 3D-1D profile method (Lüthy et. al., 1992), shown to be very useful in protein threading, on the r.m.s.d. of C
atoms from the experimental structures for the same sets of proteins. In the majority of cases the values of S-scores are not sensitive to small distortions (r.m.s.d. less than 2.5 Å) of protein structure (see Figure 6
). S-scores decrease considerably only with r.m.s.d. greater 3 Å, showing that the structures become `less correct'. These dependencies differ from those for
-scores, which are very sensitive to the small deviations from the experimental structure, and are rather insensitive to the deviations greater 5 Å (see Figure 5D
). This means that both types of scores work well in different r.m.s.d. ranges, and proposed
-score probably will be inappropriate for crude threading, but rather can be useful for the `fine tuning' of the structure in the close vicinity to the `correct' solution.
|
Another test was made with the representative set of high-resolution protein models, which were supposed to be very close to `correct'. For this set of structures several runs of unconstrained energy minimization in vacuo with different number of steps were made, using two different force fields (TRIPOS and CHARMM) without an electrostatic energy term. Each time the respective all-heavy-atom r.m.s.d. from the experimental structure was calculated, as well as the difference between similarity scores of minimized and experimental structures min
exp. The results of these calculations are presented on Figure 7
. Although in most cases when the minimization caused the increase of similarity score the difference
min
exp falls into the estimated confidence interval (calculated for this difference as ±34.2÷2 = ±49.2), we cannot exclude that some minimized structures are `more correct' than the experimental ones. For the majority of analysed models, the deviation from the experimental structure (heavy atom r.m.s.d. from 0 to 1.0 Å) caused the decrease of similarity score
. That means that, in the majority of cases, for the representative set of proteins unconstrained energy minimization in vacuo distorts the structures and leads to the noticeable loss of similarity of intramolecular packing: the overall number and strength of favorable hydrophobic and hydrophilic contacts decreased, and the number and strength of unfavorable hydrophobic-to-hydrophilic contacts increased. This finding cannot be attributed to the properties of the specific force field used for minimization, as two different force fields (TRIPOS and CHARMM) demonstrate similar results. The parameter
appeared to be very sensitive even to very small distortions (heavy atom r.m.s.d. less than 1 Å) of the structures relative to the experimental `correct' ones.
|
The ability of -score to discriminate the `correct' structures from decoys generated by MD at 300 K was demonstrated as previously proposed by Huang et al. (1996). We used MD simulations of NMR structure of barstar and X-ray structure of scorpion toxin II in water. To minimize the effect of the differences in computational protocols and force field parameters used for refinement of experimental structures and that used during current MD simulations we took as reference the experimental structures after additional energy minimization in solvent (see Materials and methods). These reference structures were assumed to be `native'. The dependencies of
-scores on the heavy atom r.m.s.d. from the reference structures for barstar (Figure 8A
) and scorpion toxin II (Figure 8B
) reveal that the reference structures have nearly the greatest
-scores, and only a few alternative models have slightly larger similarity score values. The similarity scores calculated for the structures after energy minimization in vacuo are considerably smaller, although the models are very similar (heavy atom r.m.s.d. ~1 Å). The alternative structures obtained during the heating stage and subsequent MD simulation reveal a clear correlation between r.m.s.d. and
-score (Figure 8
). This funnel-like distribution that has a wide dispersion of scores for conformations far from the correct structure and approaches a linear relationship between the score and r.m.s.d. as the structure approaches to correct one is very similar to that predicted earlier for `good' energy function (Park and Levitt, 1996
). Figures 5 and 7
reveal similar tendencies in distributions of
-scores versus r.m.s.d. for different test cases. All these data demonstrate the remarkable sensitivity of the similarity score to very small deviations of the structure from the correct one.
|
What can be wrong with the similarity score?
Although the proposed similarity score can easily penalize the expansion of the protein (directly, due to exponential dependencies on interresidue distances, and indirectly, due to mutual cancellation of hydrophobic and hydrophilic contributions of various atoms at larger distances), this score cannot penalize contraction of the protein. If the protein model is more compact than the normal one (e.g., obtained via calculation with smaller van der Waals radii), then the higher similarity score will not mean necessarily that this structure is better. Fortunately, such situations can be recognized by other available techniques (like estimation of energy using `standard' potentials). In most cases the protein models are obtained using molecular modelling (with or without experimental constrains) with the standard parameters for bond lengths, angles and van der Waals radii, and in the vast majority of cases the protein models are not too contracted (otherwise it would cause strong steric bumps). Using the -scores we implicitly assume that the protein structures were obtained by conventional modelling techniques, and thus the models are not too compact. As the similarity score is the `attracting' term, its incorporation into conventional force fields cannot be straightforward. The similarity score ignores the electrostatic interactions, effect of crystal packing and proteinsolvent contacts.
Another problem is the validation of -score performance for a greater number of examples. In the present work we limited our consideration to the high-resolution crystal structures, assuming that they are close to `ideal correct' structures. As far as the similarity score works close to the current limits of accuracy of experimental structures, additional studies of the influence of structure refinement protocols on the quality of spatial models are needed. Up to now, the most reasonable algorithm of structure refinement (energy minimization in solvent with the experimental restraints) is not widely used due to its complexity.
Concluding remarks
The assumption-based similarity score proposed in the current work is a measure of similarity of intramolecular packing and predominance of favorable contacts over unfavorable ones in the protein models. A model lacking favorable interactions will have a lower score. Unlike other approaches, the current approach takes into account both hydrophobic and hydrophilic contacts (including hydrogen bonds) in the protein in the same fashion and in the same terms, enabling simultaneously to favour formation of hydrophobic contacts and hydrogen bonds (and hence, secondary structure) and to penalize unfavorable hydrophobic-to-hydrophilic contacts (e.g., formation of unsatisfied hydrogen bond donors or acceptors). The
-score can successfully discriminate between the native and deliberately misfolded (i.e., with completely wrong chain topology), and between the native and distorted (i.e., with the similar chain topology) protein models. Proteins additionally stabilized by a large number of disulfide bridges, prosthetic groups and/or complexed ions have lower similarity score values than the proteins mainly stabilized by hydrophobic interactions. The remarkable feature of
-score is its sensitivity to the small distortions (heavy atom r.m.s.d. less than 1 Å) of the experimental ( `correct') structure.
It should be noted that the current -score is not based on the analysis of any statistical preferences, where some training set is used to obtain parameters, which are further statistically tested with other sets. On the contrary, this approach is based on a simple assumption (similar likes similar), and is tested by statistical analysis. The atomic hydrophobicities used as the quantitative criterion of `similarity' are derived from the experimental octanolwater partition coefficients, which indirectly include all the information about non-covalent interactions, including electrostatics, hydrophobic effect and the change of entropy.
The possible application of the proposed -score can be the quality assessment of the protein spatial models and recognition of the `correct' structure(s) among very close alternatives. As parameters Sphob, Sphil, Sunf and
-score are very sensitive measures of the intramolecular contacts, strength and similarity of these parameters can also be useful for characterization of models simulating protein folding/unfolding. The further direction of the work could be also including of
-score as the additional term in the conventional force fields for the energy minimization and molecular dynamics simulations.
![]() |
Acknowledgments |
---|
![]() |
Notes |
---|
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Bernstein,F.C., Koetzle,T.F., Williams,G.J.B., Meyer,E.F.,Jr., Brice,M.D., Rodgers,J.R., Kennard,O., Shimanouchi,T. and Tasumi,M. (1977) J. Mol. Biol., 112, 535542.[ISI][Medline]
Brasseur,R. (1991) J. Biol. Chem., 266, 1612016127.
Brooks,B.R., Bruccoleri,R.E., Olafson,B.D., States,D.J., Swaminathan,S. and Karplus,M. (1983) J. Comput. Chem., 4, 187217.[ISI]
Caflisch,A. and Karplus,M. (1995) J. Mol. Biol., 252, 672708.[ISI][Medline]
Casari,G. and Sippl,M.J. (1992) J. Mol. Biol., 224, 725732.[ISI][Medline]
Clark,M., Cramer III,R.D. and Van Opdenbosch,N. (1989) J. Comput. Chem, 10, 9821012.[ISI]
Daggett,V. and Levitt,M. (1993) J. Mol. Biol., 232, 600619.[ISI][Medline]
Delarue,M. and Koehl,P. (1995) J. Mol. Biol., 249, 675690.[ISI][Medline]
Dill,K.A. (1990) Biochemistry, 29, 71337155.[ISI][Medline]
Efremov,R.G. and Alix,A.J.P. (1993) J. Biomol. Struct. Dyn., 11, 483507.[ISI][Medline]
Efremov,R.G., Gulyaev,D.I., Vergoten,G. and Modyanov,N.N (1992) J. Protein Chem., 11, 665675.[ISI][Medline]
Efremov,R.G., Golovanov,A.P., Vergoten,G., Alix,A.J.P., Tsetlin,V.I. and Arseniev,A.S. (1995) J. Biomol. Struct. Dyn., 12, 971991.[ISI][Medline]
Eisenberg,D. and McLachlan,A.D. (1986) Nature, 319, 199203.[ISI][Medline]
Fauchère,J.-L., Quarendon,P. and Kaetterer,L. (1988) J. Mol. Graphics, 6, 203206.[ISI]
Furet,P., Sele,A. and Cohen,N.C. (1988) J. Mol. Graphics, 6, 182189.[ISI]
Gaillard,P., Carrupt,P.-A., Testa,B. and Boudon,A. (1994) J. Comput.-Aided Mol. Des., 8, 8396.
Golovanov,A.P., Efremov,R.G., Jaravine,V.A., Vergoten,G. and Arseniev,A.S. (1995) FEBS Lett., 375, 162166.[ISI][Medline]
Golovanov,A.P., Efremov,R.G., Vergoten,G., Jaravine,V.A., Kirpichnikov,M.P. and Arseniev,A.S. (1998) J. Biomol. Struct. Dyn., 15, 673687.[ISI][Medline]
Hobohm,U. and Sander,C. (1994) Protein Sci., 3, 522524.
Holm,L. and Sander,C. (1992) J. Mol. Biol., 225, 93105.[ISI][Medline]
Housset,D., Habersetzer-Rochat,C., Astier,J.-P. and Fontecilla-Camps,J.C. (1994) J. Mol. Biol., 238, 88103.[ISI][Medline]
Huang,E.S., Subbiah,S. and Levitt,M. (1995) J. Mol. Biol., 252, 709720.[ISI][Medline]
Huang,E.S., Subbiah,S., Tsai,J. and Levitt,M. (1996) J. Mol. Biol., 257, 716725.[ISI][Medline]
Hünenberger,P.H., Mark,A.E. and van Gunsteren,W.F. (1995) Proteins Struct. Funct. Genet., 21, 196213.[ISI][Medline]
Kellogg,G.E., Semus,S.F. and Abraham,G.J. (1991) J. Comput.-Aided Mol. Des., 5, 545552.
Kurochkina,N. and Lee,B. (1995) Protein Engng, 8, 437442.[Abstract]
Lubienski,M.J., Bycroft,M., Freund,S.M.V. and Fersht,A.R. (1994) Biochemistry, 33, 88668877.[ISI][Medline]
Luthardt,G. and Frömmel,C. (1994) Protein Engng, 7, 627631.[Abstract]
Lüthy,R., Bowie,J.U. and Eisenberg,D. (1992) Nature, 356, 8385.[ISI][Medline]
Miyazawa,S. and Jernigan,R.L. (1996) J. Mol. Biol., 256, 623644.[ISI][Medline]
Novotny,J., Bruccoleri,R. and Karplus,M. (1984) J. Mol. Biol., 177, 787818.[ISI][Medline]
Novotny,J., Rashin,A.A. and Bruccoleri,R.E. (1988) Proteins Struct. Funct. Genet., 4, 1930.[ISI][Medline]
Park,B. and Levitt,M. (1996) J. Mol. Biol., 258, 367392.[ISI][Medline]
Park,B.H., Huang,E.S. and Levitt,M. (1997) J. Mol. Biol., 266, 831846.[ISI][Medline]
Ptitsyn,O.B. (1992) In Creighton,T.E. (ed.), Protein Folding. W.H.Freeman, San Francisco, pp. 243300
Sansom,M.S.P. and Kerr,I.D. (1993) Protein Engng, 6, 6574.[Abstract]
Tirado-Rives,J. and Jorgensen,W.L. (1993) Biochemistry, 32, 41754184.[ISI][Medline]
Viswanadhan,V.N., Ghose,A.K., Revankar,G.R. and Robins,R.K. (1989) J. Chem. Inf. Comput. Sci., 29, 163172.[ISI]
Wang,Y., Zhang,H., Li,W. and Scott,R.A. (1995a) Proc. Natl Acad. Sci. USA, 92, 709713.[Abstract]
Wang,Y., Zhang,H. and Scott,R.A. (1995b) Protein Sci., 4, 14021411.
Received April 28, 1998; revised June 22, 1998; accepted September 18, 1998.