Department of Biological Sciences, University of Delaware
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Most of the protein sequence comparisons of mesophiles with thermophiles have examined a small number of proteins from a broad range of organisms (Argos et al. 1979
; Menéndez-Arias and Argos 1989
; Vogt, Woell, and Argos 1997
). Some consistent patterns have been evident, such as arginine being preferred over lysine at higher temperatures, but the broad taxonomic samples have made it difficult to know whether any general patterns of thermal adaptation are being obscured or exaggerated by taxon-specific asymmetries that may or may not be thermally adaptive. Recent whole-genome sequence projects have made it possible to examine a large number of protein sequences from a single pair of mesophilic versus thermophilic taxa. Haney et al. (1999)
compared 115 protein sequences from the mesophilic archaea Methanococcus maripaludis, Methanococcus vannielii, and Methanococcus voltae with the thermophilic Methanococcus jannaschii. They found 26 of the 190 pairs of amino acids to show significant (P < 0.01) asymmetry, suggesting that adaptation to temperature had an effect on a substantial proportion of amino acid substitutions. McDonald, Grasso, and Rejto (1999)
examined a similar data set from the same species of Methanococcus and compared the results with the patterns of asymmetry between the mesophilic bacterium Bacillus subtilis and the thermophilic Bacillus stearothermophilus. The Bacillus comparison displayed significant (P < 0.05) substitutional asymmetry at 54 pairs of amino acids, and most of the asymmetry was consistent in direction with that seen in the Methanococcus comparison. However, several pairs of amino acids showed patterns of asymmetry that were significantly different and opposite in direction in the Methanococcus and Bacillus data sets, suggesting that taxon-specific processes were indeed important. Most of these differences consisted of an amino acid with a more G+C-rich codon being favored in thermophilic Bacillus but less common in thermophilic Methanococcus, consistent with the higher genomic G+C content of B. stearothermophilus compared with B. subtilis. Because mesophilic and thermophilic Methanococcus differ little in genomic G+C content, McDonald, Grasso, and Rejto (1999)
suggested that the asymmetrical substitution patterns seen there gave a better indication of which amino acid substitutions were adaptive at different temperatures.
Even when mesophiles and thermophiles have the same genomic G+C content, it would be hasty to interpret all asymmetrical substitution patterns between them as evidence for thermal adaptation, because there are other processes besides changes in G+C content that could cause taxon-specific patterns of substitutional asymmetry. Amino acids vary in bioenergetic cost, and amino acids with lower costs presumably will be favored over functionally equivalent amino acids (Craig and Weber 1998
; Craig et al. 1999
). The relative bioenergetic costs of different amino acids may vary among species, depending on the availability for uptake of each amino acid in the environment, the biosynthetic pathways used to synthesize each amino acid, the abundance of raw materials for biosynthesis, and the effect of temperature and other environmental variables on the biosynthetic pathways. Environmental variables other than temperature, such as salinity, pH, and hydrostatic pressure, might also cause adaptive substitutional asymmetry that is unrelated to temperature.
Only patterns of substitutional asymmetry that are repeatedly observed in comparisons of mesophiles paired with related thermophiles will be robust evidence for thermal adaptation, while inconsistent patterns of asymmetry could have a variety of possible explanations. Here, I compared sequences from the mesophilic bacterium Deinococcus radiodurans, whose genome has been completely sequenced (White et al. 1999
), with Thermus thermophilus, a thermophilic bacterium that is related to Deinococcus (Hensel et al. 1986
; Weisburg, Giovannoni, and Woese 1989
). The resulting patterns were then compared with those observed earlier in comparisons of mesophilic and thermophilic Methanococcus and Bacillus to determine which asymmetries were consistent and which differed among pairs of species.
Little is known about the natural history of D. radiodurans (Murray 1992
). It can survive remarkable amounts of gamma radiation, which may be a byproduct of adaptation to desiccation resistance (Mattimore and Battista 1996
), and it can also withstand intense ultraviolet radiation (Minton 1994
) and desiccation (Sanders and Maxcy 1979
). Deinococcus radiodurans has a genomic G+C content of 66.6% (White et al. 1999
) and an optimal growth temperature of 2530°C (Murray 1992
). Thermus thermophilus lives in hot springs and artificial hot water environments. The type strain HB-8, which is used for most sequences, has an optimal growth temperature of 73°C (Williams and da Costa 1992
) and a G+C content of 64.7% (Manaia et al. 1994
). While T. thermophilus is sometimes considered a junior synonym of Thermus aquaticus (Degryse, Glansdorff, and Pierard 1978
), T. thermophilus and T. aquaticus have low similarity in genomic DNA : DNA hybridization (Manaia et al. 1994
; Williams et al. 1995
) and 16s sequences (Saul et al. 1993
), and T. thermophilus can grow in media containing 3% NaCl and has a higher maximum growth temperature than T. aquaticus (Manaia and da Costa 1991
). Here, I compared only T. thermophilus with D. radiodurans, because T. thermophilus has a slightly higher optimum growth temperature and has more sequences publicly available than does T. aquaticus.
![]() |
Materials and Methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Matching sequences were aligned using CLUSTAL W (Thompson, Higgins, and Gibson 1994
). Ambiguously aligned sites adjacent to gaps were omitted, with the omitted sites extending from the gap to the nearest pair of adjacent sites that were both identical in the two sequences. The total data set consisted of 49,337 aligned amino acid sites, of which 18,041 were different between the species. The number of aligned sites exhibiting each of the 190 possible pairwise patterns of difference was then counted. For each pair of amino acids, the significance of the deviation from the expected 50:50 ratio was tested using the log likelihood ratio test (G-test) with the Williams correction for continuity (Sokal and Rohlf 1981
); if the total number of sites was less than 50, Fisher's exact test was used. The ratio for D. radiodurans versus T. thermophilus was compared with the ratio for previously published Bacillus and Methanococcus comparisons (McDonald, Grasso, and Rejto 1999
) using a 2 x 2 contingency table test with the Williams correction (Sokal and Rohlf 1981
); if the total number of sites was less than 50 in either comparison, Fisher's exact test was used.
The information from the 190 pairwise comparisons was summarized into a single ranking of the amino acids from least preferred to most preferred at higher temperatures by assigning a thermal asymmetry index (TAI) reflecting the direction and magnitude of the asymmetries involving that amino acid (McDonald, Grasso, and Rejto 1999
). TAI values were assigned to minimize the difference between the predicted asymmetry for each pair of amino acids (a function of the difference in TAI values) and the observed asymmetry. Because only the difference in TAI values between amino acids was relevant, the TAI values were standardized so that the average value was 1.
To estimate the amount of divergence between pairs of taxa, 17 proteins were identified that were present in the Bacillus, Methanococcus, and Deinococcus/Thermus data sets. The six sequences for each protein were aligned using CLUSTAL W (Thompson, Higgins, and Gibson 1994
), and the sites containing gaps were eliminated. In those sites present in all six species, the proportion of identical sites was calculated for each mesophile-versus-thermophile pair.
![]() |
Results |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
|
|
|
|
|
|
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Difference in G+C Content
While there is no overall correlation of higher G+C content with higher habitat temperature across all prokaryotes (Galtier and Lobry 1997
), thermophiles exhibit higher G+C content than mesophiles within some taxa, such as Bacillus (Claus and Berkeley 1986
) and Methanobacterium (Whitman, Bowen, and Boone 1992
). It is not known whether higher G+C content reflects selection for greater DNA stability or a change in the mutation process (Mooers and Holmes 2000). Organisms with higher G+C content generally have greater abundances of those amino acids with G+C-rich codons (Lobry 1997
and references therein). In the comparison of the mesophile B. subtilis (G+C content 43.5%) with the thermophile B. stearothermophilus (G+C content 52%), the majority of significantly asymmetrical pairs of amino acids favor amino acids with codons with higher G+C content in B. stearothermophilus, while a very small number decrease the G+C content (McDonald, Grasso, and Rejto 1999
). Of the 15 pairs of amino acids that differ significantly in substitution ratio between Methanococcus and Bacillus and are in the opposite direction, all but two have an amino acid with higher G+C content being favored in thermophilic Bacillus and less common in thermophilic Methanococcus. This suggests that some of the asymmetrical substitution patterns observed in Bacillus are due to the differing G+C content of the amino acids' codons, not their biochemical properties. For the Methanococcus and Deinococcus/Thermus comparisons, where the mesophiles and the thermophiles are similar in G+C content, this should not play a major role in producing the asymmetry.
A more subtle way in which G+C content might affect patterns of adaptive asymmetry could result from the differences in the G+C content of the mesophile/thermophile pairs. Imagine a site with amino acid A in a mesophile where either amino acid B or amino acid C would have equally adaptive biochemical properties in a thermophile. If the codons for B have higher G+C content than the codons for C, the A-to-B substitution might be more common in a mesophile/thermophile pair with high G+C content (such as Deinococcus/Thermus), while the A-to-C substitution might be more common in a mesophile/thermophile pair with low G+C content (such as Methanococcus). It will be necessary to compare multiple mesophile/thermophile pairs with similar G+C contents to evaluate the importance of this effect.
Relaxed Constraint
Relaxation of selective constraint such that many sites were constrained to have one amino acid in the ancestral species but could have more than one adaptively equivalent amino acid in one descendant lineage could also lead to asymmetry. Amino acids that increased in frequency in the lineage where they were newly neutral would be difficult to distinguish from those that increased due to positive selection. Because some Deinococcus species (Ferreira et al. 1997
) and all known Thermus species are thermophilic, the common ancestor of the Deinococcus/Thermus group was probably a thermophile, as were the common ancestors of the Methanococcus spp. (Keswani et al. 1996
) and B. subtilis and B. stearothermophilus (Ochi 1994
). If there are many sites with less selective constraint at moderate temperatures than at high temperatures, similar patterns of asymmetry could have arisen in all three comparisons of mesophiles with thermophiles. Comparing a pair of species in which thermophily is the derived state would help to test this possibility.
Unequal Numbers of Mutations
The neutral model yielding symmetrical protein substitution assumes that the numbers of mutations (the number of generations times the mutation rate per generation) are equal on the lineages connecting a mesophile and a thermophile to their common ancestor (McDonald, Grasso, and Rejto 1999
). If there are more generations or more mutations per generation on one lineage than on the other, asymmetrical patterns of substitution may result. There is no evidence for the dramatic difference in substitution rate between mesophiles and thermophiles required for this process to yield asymmetrical substitution patterns. In addition, this process would not change the overall frequencies of the amino acids in proteins, provided that the frequencies of amino acids in the ancestral species were at mutation/drift equilibrium. The significant changes in frequency of several amino acids in the Deinococcus/Thermus (table 5
), Methanococcus, and Bacillus data sets (McDonald, Grasso, and Rejto 1999
) indicate that unequal mutation rates are not the sole cause of the asymmetry.
|
If the slow accumulation of neutral substitutions were causing different amounts of asymmetry in the different pairs of taxa, one would expect to see more asymmetry in the less diverged pairs of species. Mesophilic Methanococcus and M. jannaschii have about the same amount of divergence (65.7% identity for the 17 proteins analyzed) as D. radiodurans versus T. thermophilus (64.6% identity), so the amount of divergence is unlikely to be the cause of the different patterns of asymmetry in these two comparisons. Bacillus subtilis and B. stearothermophilus reveal considerably less divergence (79.6% identity) than the other mesophile-versus-thermophile comparisons. Of those pairs of amino acids with significantly different asymmetries between the Bacillus and Deinococcus/Thermus data sets that cannot be explained by differences in G+C content, most have less asymmetry in Bacillus than in Methanococcus or Deinococcus/Thermus (table 3 ). Thus, there is no evidence that reduced asymmetry due to slowly accumulating neutral substitutions is important in explaining the differences among the three data sets.
Different Sets of Proteins
It is possible that certain categories of proteins (membrane proteins, highly expressed proteins, ribosomal proteins, enzymes, etc.) exhibit different patterns of adaptive asymmetry from other categories of proteins. A mesophile/thermophile comparison that included mostly proteins from one category could then exhibit different patterns of asymmetry from a mesophile/thermophile comparison including mostly proteins from a different category. None of the three data sets compared here has an obvious preponderance of proteins from one functional category that was rare in one of the other data sets, so it seems unlikely that this is a major cause of the different patterns of asymmetry among the data sets. Detailed examination of the patterns of substitutional asymmetry among functional categories will require much larger data sets for any statistical power.
Selection Due to Environmental Factors Other than Temperature
The environments of mesophiles and thermophiles often differ in other environmental variables; if any of these variables favor some amino acids over others, they might also cause substitutional asymmetry. For example, the mesophilic Methanococcus were isolated from shallow and intertidal marine and estuarine sediments (Stadtman and Barker 1951
; Jones, Paynter, and Gupta 1983
), while the thermophilic M. jannaschii was isolated from a depth of 2,600 m (Jones et al. 1983
). Salinity, hydrostatic pressure, pH, and the abundance of different amino acids and their precursors in the environment are among the environmental variables that could cause selection favoring different amino acids in different environments. To identify those asymmetries caused by temperature differences, one ideally would want to compare two species whose environments are identical except for temperature. Realistically, it will be necessary to compare a large number of mesophile/thermophile pairs, so that temperature is the only environmental variable that consistently differs between them and could therefore explain any consistent asymmetries.
Selection Due to Bioenergetic Costs of the Amino Acids
In addition to biochemical properties and the G+C content of their codons, amino acids differ in their cost of uptake or synthesis, and if these bioenergetic costs vary among species, substitutional asymmetry could result (Craig and Weber 1998
; Craig et al. 1999
). For example, the mesophilic M. voltae lives in marine muds rich in organic material (Whitman et al. 1986
), assimilates all amino acids tested (Ekiel, Jarrell, and Sprott 1985
), and is heterotrophic for leucine and isoleucine (Whitman, Ankwanda, and Wolfe 1982
); presumably, it obtains much of its amino acids from its environment. The thermophile M. jannaschii is autotrophic and has limited ability to assimilate amino acids (Sprott, Ekiel, and Patel 1993
). At sites in protein sequences where two or more amino acids are functionally equivalent, the one that was most abundant in its environment presumably would be favored in M. voltae, while the one with the lowest cost of biosynthesis would be favored in M. jannaschii. This could appear to be temperature-related substitutional asymmetry, although the adaptation might not be caused by the temperature difference.
To summarize, the dramatic substitutional asymmetries observed between proteins from mesophiles and thermophiles are inconsistent with a simple neutral model of protein evolution, but they may not all be the result of temperature adaptation due to biochemical properties of the amino acids. The significant differences among taxa in amount and direction of asymmetry suggest that other processes, such as adaptation to environmental variables other than temperature, selection based on bioenergetic costs of amino acids, or (for taxa such as Bacillus) changes in G+C content, play an important role. As more data become available, those patterns of asymmetry that remain consistent across pairs of mesophiles/thermophiles will become more definitely related to temperature adaptation.
![]() |
Footnotes |
---|
1 Keywords: protein adaptation
thermophile
Deinococcus
Thermus.
2 Address for correspondence and reprints: John H. McDonald, Department of Biological Sciences, University of Delaware, Newark, Delaware 19716. mcdonald{at}udel.edu
![]() |
literature cited |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Altschul, S. F., T. L. Madden, A. A. Schaffer, J. H. Zhang, Z. Zhang, W. Miller, and D. J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:33893402.
Argos, P., M. G. Rossmann, U. M. Grau, H. Zuber, G. Frank, and J. D. Tratschin. 1979. Thermal stability and protein structure. Biochemistry 25:56985703.
Benjamini, Y., and Y. Hochberg. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57:289300.
Claus, D., and R. C. W. Berkeley. 1986. Genus Bacillus Cohn 1872. Pp. 11051139 in P. H. A. Sneath, ed. Bergey's manual of systematic bacteriology. Vol. 2. Williams and Wilkins, Baltimore, Md.
Craig, C. L, M. Hsu, D. Kaplan, and N. E. Pierce. 1999. A comparison of the composition of silk proteins produced by spiders and insects. Int. J. Biol. Macromol. 24:109118.[ISI][Medline]
Craig, C. L., and R. S. Weber. 1998. Selection costs of amino acid substitutions in ColE1 and ColIa gene clusters harbored by Escherichia coli. Mol. Biol. Evol. 15:774776.
Degryse, E., N. Glansdorff, and A. Pierard. 1978. A comparative analysis of extreme thermophilic bacteria belonging to the genus Thermus. Arch. Microbiol. 117:189196.
Ekiel, I., K. F. Jarrell, and G. D. Sprott. 1985. Amino acid biosynthesis and sodium-dependent transport in Methanococcus voltae, as revealed by 13C NMR. Eur. J. Biochem. 149:437444.[Abstract]
Ferreira, A. C., M. F. Nobre, F. A. Rainey, M. T. Silva, R. Wait, J. Burghardt, A. P. Chung, and M. S. da Costa. 1997. Deinococcus geothermalis sp. nov. and Deinococcus murrayi sp. nov., two extremely radiation-resistant and slightly thermophilic species from hot springs. Int. J. Syst. Bacteriol. 47:939947.
Galtier, N., and J. R. Lobry. 1997. Relationships between genomic G+C content, RNA secondary structures, and optimal growth temperature in prokaryotes. J. Mol. Evol. 44:632636.[ISI][Medline]
Haney, P. J., J. H. Badger, G. L. Buldak, C. I. Reich, C. R. Woese, and G. J. Olsen. 1999. Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species. Proc. Natl. Acad. Sci. USA 96:35783583.
Hensel, R., W. Demharter, O. Kandler, R. M. Kroppenstedt, and E. Stackebrandt. 1986. Chemotaxonomic and molecular-genetic studies of the genus Thermus: evidence for a phylogenetic relationship of Thermus aquaticus and Thermus ruber to the genus Deinococcus. Int. J. Syst. Bacteriol. 36:444453.[ISI]
Jones, W. J., J. A. Leigh, F. Mayer, C. R. Woese, and R. S. Wolfe. 1983. Methanococcus jannaschii sp. nov., an extremely thermophilic methanogen from a submarine hydrothermal vent. Arch. Microbiol. 136:254261.
Jones, W. J., M. J. B. Paynter, and R. Gupta. 1983. Characterization of Methanococcus maripaludis sp. nov., a new methanogen isolated from salt marsh sediment. Arch. Microbiol. 135:9197.
Keswani, J., S. Orkand, U. Premachandran, L. Mandelco, M. J. Franklin, and W. B. Whitman. 1996. Phylogeny and taxonomy of mesophilic Methanococcus spp. and comparison of rRNA, DNA hybridization, and phenotypic methods. Int. J. Syst. Bacteriol. 46:727735.[Abstract]
Lobry, J. R. 1997. Influence of genomic G+C content on average amino acid composition of proteins from 59 bacterial species. Gene 205:309316.
McDonald, J. H., A. M. Grasso, and L. K. Rejto. 1999. Patterns of temperature adaptation in proteins from Methanococcus and Bacillus. Mol. Biol. Evol. 16:17851790.
Manaia, C. M., and M. S. da Costa. 1991. Characterization of halotolerant Thermus isolates from shallow marine hot springs on S. Miguel, Azores. J. Gen. Microbiol. 137:26432648.
Manaia, C. M., B. Hoste, M. C. Gutierrez, M. Gillis, A. Ventosa, K. Kersters, and M. S. da Costa. 1994. Halotolerant Thermus strains from marine and terrestrial hot springs belong to Thermus thermophilus (ex Oshima and Imahori, 1974) nom. rev. emend. Syst. Appl. Microbiol. 17:526532.
Mattimore, V., and J. R. Battista. 1996. Radioresistance of Deinococcus radiodurans: functions necessary to survive ionizing radiation are also necessary to survive prolonged desiccation. J. Bacteriol. 178:633637.[Abstract]
Menéndez-Arias, L., and P. Argos. 1989. Engineering protein thermal stability: sequence statistics point to residue substitutions in alpha helices. J. Mol. Biol. 206:397405.[ISI][Medline]
Minton, K. W. 1994. DNA repair in the extremely radioresistant bacterium Deinococcus radiodurans. Mol. Microbiol. 13:915.
Mooers, A. Ø., and E. C. Holmes. 2000. The evolution of base composition and phylogenetic inference. Trends Ecol. Evol. 15:365369.[ISI][Medline]
Murray, R. G. E. 1992. The family Deinococcaceae. Pp. 37323744 in A. Balows, H. G. Trüper, M. Dworkin, W. Harder, and K.-H. Schleifer, eds. The prokaryotes. Springer-Verlag, New York.
Ochi, K. 1994. Phylogenetic diversity in the genus Bacillus and comparative ribosomal protein AT-L30 analyses of the genus Thermoactinomyces and relatives. Microbiology 140:21652171.
Sanders, S. W., and R. B. Maxcy. 1979. Isolation of radiation-resistant bacteria without exposure to irradiation. Appl. Environ. Microbiol. 38:436439.[ISI][Medline]
Saul, D. J., A. G. Rodrigo, R. A. Reeves, L. C. Williams, K. M. Borges, H. W. Morgan, and P. L. Bergquist. 1993. Phylogeny of 20 Thermus isolates constructed from 16s ribosomal RNA gene sequence data. Int. J. Syst. Bacteriol. 43:754760.[Abstract]
Sokal, R. R., and F. J. Rohlf. 1981. Biometry. 2nd edition. W. H. Freeman, San Francisco.
Sprott, G. D., I. Ekiel, and G. B. Patel. 1993. Metabolic pathways in Methanococcus jannaschii and other methanogenic bacteria. Appl. Environ. Microbiol. 59:10921098.[Abstract]
Stadtman, T. C., and H. A. Barker. 1951. Studies on the methane fermentation. X. A new formate-decomposing bacterium, Methanococcus vannielii. J. Bacteriol. 62:269280.
Thompson, J. D., D. G. Higgins, and T. J. Gibson. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:46734680.[Abstract]
Vogt, G., S. Woell, and P. Argos. 1997. Protein thermal stability, hydrogen bonds, and ion pairs. J. Mol. Biol. 269:631643.[ISI][Medline]
Weisburg, W. G., S. J. Giovannoni, and C. R. Woese. 1989. The Deinococcus-Thermus phylum and the effect of rRNA composition on phylogenetic tree construction. Syst. Appl. Microbiol. 11:128134.[ISI][Medline]
White, O., J. A. Eisen, J. F. Heidelberg et al. (29 co-authors). 1999. Genome sequence of the radioresistant bacterium Deinococcus radiodurans R1. Science 286:15711577.
Whitman, W. B., E. Ankwanda, and R. S. Wolfe. 1982. Nutrition and carbon metabolism of Methanococcus voltae. J. Bacteriol. 149:852863.[ISI][Medline]
Whitman, W. B., T. L. Bowen, and D. R. Boone. 1992. The methanogenic bacteria. Pp. 719767 in A. Balows, H. G. Trüper, M. Dworkin, W. Harder, and K.-H. Schleifer, eds. The prokaryotes. Springer-Verlag, New York.
Whitman, W. B., J. Shieh, S. Sohn, D. S. Caras, and U. Premachandran. 1986. Isolation and characterization of 22 mesophilic methanococci. Syst. Appl. Microbiol. 7:235240.[ISI]
Williams, R. A. D., and M. S. da Costa. 1992. The genus Thermus and related microorganisms. Pp. 37453753 in A. Balows, H. G. Trüper, M. Dworkin, W. Harder, and K.-H. Schleifer, eds. The prokaryotes. Springer-Verlag, New York.
Williams, R. A. D., K. E. Smith, S. G. Welch, J. Micallef, and R. V. Sharp. 1995. DNA relatedness of Thermus strains, description of Thermus brockianus sp. nov., and proposal to reestablish Thermus thermophilus (Oshima and Imahori). Int. J. Syst. Bacteriol. 45:495499.[Abstract]