Department of Zoology, Brigham Young University, Provo, Utah
Institute of Statistical Mathematics, Tokyo, Japan, and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
Institute of Arctic Biology, and Department of Biology and Wildlife, University of Alaska, Fairbanks, Alaska
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Past studies of the molecular evolution of cyt-b indicate that the functional domains evolve at different rates of amino acid replacement (e.g., Irwin, Kocher, and Wilson 1991
; DeWalt et al. 1993
; Griffiths 1997
). The common hypothesis for this phenomenon is that the selection responsible for maintaining the Q-cycle mechanism greatly influences rates of amino acid replacement. However, this hypothesis has yet to be critically evaluated in terms of evolutionary changes in the physicochemical properties that determine the phenotype of the protein. If the functional properties of the domains are being maintained by selection, very few amino acid replacements would be expected in those regions with the most residues implicated in the Q-cycle mechanism.
There are many physicochemical properties associated with amino acid residues (e.g., Sneath 1966
; Woese et al. 1966
; Alff-Steinberger 1969
; Grantham 1974
; Kyte and Doolittle 1982
). Several of these have been shown to correlate with the evolution of the genetic code, the amino acid composition of polypeptides, and rates of amino acid replacement (e.g., Grantham 1974
; Hughes, Ota, and Nei 1990
; Haig and Hurst 1991
; Xia and Li 1998
). As a result, similar codons generally specify amino acids with similar physicochemical properties (Sonneborn 1965
; Epstein 1966
; Goldberg and Wittes 1966
; Alff-Steinberger 1969
). Furthermore, these properties contribute to a vast array of structural and functional characteristics of proteins that can be influenced directly by natural selection.
We evaluated the physicochemical changes that result from the molecular evolution of the cyt-b functional domains in pocket gophers and cetartiodactyls. Six amino acid properties shown to correlate with rates of amino acid replacement (Xia and Li 1998
) were considered: composition of the side chain, polarity, and molecular volume (Grantham 1974
), as well as polar requirement (Woese et al. 1966
), hydropathy (Kyte and Doolittle 1982
), and isoelectric point (Alff-Steinberger 1969
). To evaluate the selective influences affecting the molecular evolution of cyt-b, we first calculated the goodness of fit between an observed distribution of physicochemical changes and an expected distribution based on codon composition (briefly outlined in McCracken et al. 1999
) to measure how well the data fit neutral expectations. Next, we compared the inferred numbers of conservative, moderate, radical, and very radical physicochemical changes relative to the number of possible evolutionary pathways using a normal distribution to obtain relative measures (z-scores) of the selective influence on each amino acid property.
The goodness-of-fit score (GF-score) and the set of three z-scores can be interpreted as a measure of the selective influences on each gene region, with the magnitude of the GF-score indicative of the intensity of selection and the z-scores indicative of the direction of selection. For example, an observed distribution with a GF-score > 7.815 (df = 3, = 0.05) and a z-score > 1.645 (
= 0.05) between the conservative and moderate magnitude classes with the conservative class being greater can be said to have experienced negative selection that favors conservative change. Alternatively, an observed distribution with a GF-score > 7.815 and a z-score > 1.645 between radical and very radical magnitude classes with the very radical class being greater can be said to have experienced positive selection that favors very radical change at those sites of amino acid replacement.
![]() |
Materials and Methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
|
Substitution Analysis
Ancestral character state reconstructions were optimized using the GTR maximum-likelihood model, with the same parameter estimates used to construct the tree. The sequence at each branch terminus on the tree was compared with its immediate ancestral sequence to estimate the number and types of substitutions (similar to Li 1993
; see also DeWalt et al. 1993
; Xia, Hafner, and Sudman 1996
; Xia 1998
; McClellan 2000
).
Information pertaining to each inferred nonsynonymous substitution was recorded, including codon position, type of base exchange (transition or transversion), and exact location in the protein. Substitutions were categorized further by the functional domain in which they occurred (Degli Esposti et al. 1993
; Zhang et al. 1998
). The magnitudes of physicochemical changes in composition, polarity, and molecular volume (Grantham 1974
), as well as polar requirement (Woese et al. 1966
), hydropathy (Kyte and Doolittle 1982
), and isoelectric point (Alff-Steinberger 1969
), resulting from each inferred nonsynonymous substitution were also recorded.
Expected Distribution of Physicochemical Changes
Distributions of changes in physicochemical properties of inferred amino acid replacements were compared with distributions calculated based on the assumption of completely random amino acid replacement (McCracken et al. 1999
; Xia 2000
) expected under the condition of selective neutrality. Expected frequencies per physicochemical property magnitude class were calculated by
|
The calculation of this expected distribution differs from that presented in McCracken et al. (1999)
in that the magnitude class frequencies are not transformed by the nonsynonymous substitution frequencies expected under the codon degeneracy model (McClellan 2000
). This is a null model of amino acid replacement and, as it applies to the present study, assumes completely random replacement. Thus, modification of the expected distribution of physicochemical changes with regard to patterns of codon degeneracy and an inferred transition bias is unnecessary.
The distributions of physicochemical changes inferred from the phylogenetic analysis of the pocket gopher cyt-b DNA sequences were compared with the expected distribution calculated with equation (1) using a 2 GF test of four magnitude categories: conservative, moderate, radical, and very radical. These categories are similar to those used by Wyckoff, Wang, and Wu (2000
). Good fit (P > 0.05) between expected and observed distributions indicates that observed evolution of the DNA sequences is consistent with the assumption of selective neutrality. This is not to say, however, that the evolution of the DNA sequences is not influenced by selection. Good fit means only that the preponderance of sites experiencing amino acid replacements are accumulating changes as predicted by the model.
Detecting the Causes of Deviation from Expectations
The method developed by Hughes, Ota, and Nei (1990)
extends Nei and Gojobori's (1986)
method of estimating rates of synonymous and nonsynonymous nucleotide substitution. Nei and Gojobori (1986)
categorized each nucleotide site as either synonymous, nonsynonymous, or a fraction thereof. Hughes, Ota, and Nei (1990)
took this one step further by dividing each nonsynonymous site (or fractional nonsynonymous site) into either conservative, radical, or a fraction thereof based on the extent to which the possible evolutionary pathways result in either conservative or radical changes relative to a certain amino acid property. The number of inferred conservative substitutions is then divided by the mean number of conservative sites, and the number of inferred radical substitutions is divided by the mean number of radical sites. If these two ratios are equal, the amino acid replacements are said to take place at random relative to that particular property. If the ratio for more conservative replacements is greater than the ratio for more radical replacements, however, the property is being conserved, as would be expected under conditions of purifying selection. If the conservative ratio is less than the radical ratio, the replacements are said to promote radical changes, as would be expected under conditions of positive selection (Hughes, Ota, and Nei 1990
).
One questionable aspect of this line of thought is the fractional assignment of a single nucleotide site as part conservative and part radical. We prefer to consider potential evolution by the number of possible nonsynonymous evolutionary pathways available to a given codon (Xia 1998
; Xia and Li 1998
). Another questionable aspect of the Hughes, Ota, and Nei (1990)
model is the consideration of just two categories of amino acid replacement (conservative and radical) based on the qualitative properties described by Taylor (1986)
. We prefer to use four categories of replacement with respect to quantifiable amino acid properties (as in Wyckoff, Wang, and Wu 2000
). Considering four categories (conservative, p1; moderate, p2; radical, p3; and very radical, p4) allows a more detailed analysis of more dynamic evolutionary trends. Each ratio is calculated by
|
|
![]() |
Results and Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
The matrix domain, which is located entirely within the inner surface of the inner mitochondrial membrane, is composed of 65 amino acid residues and has a relatively high proportion of polar and basic amino acid residues (Griffiths 1997
). There are relatively few residues in the matrix domain that are conserved or have been implicated in the proton-input function of a Qi redox center (Degli Esposti et al. 1993
). Most regions of the matrix domain have no known function (Irwin, Kocher, and Wilson 1991
) or conserved properties in metazoans (Degli Esposti et al. 1993
), and thus variable sites within this domain are expected to evolve in a nearly neutral manner relative to the other two functional domains.
The transmembrane domain, which consists of that portion of the cyt-b protein that transverses the inner mitochondrial membrane, is composed of 209 amino acid residues in mammals (Zhang et al. 1998
) and is characterized by its hydrophobic properties (Irwin, Kocher, and Wilson 1991
; Griffiths 1997
). Furthermore, most of the observed amino acid replacements in the transmembrane region are among the hydrophobic amino acids leucine, isoleucine, and valine (Irwin, Kocher, and Wilson 1991
; Kornegay et al. 1993
). In addition, about 19% of the residues that compose the transmembrane domain are completely or nearly conserved in the vast majority of metazoans (Degli Esposti et al. 1993
). Many of these residues have been implicated in the function of heme ligation, redox activity, or structural stability.
Summary of Cyt-b Data
Rates of synonymous substitution, rsyn, did not differ significantly between domains (table 1
), with the exception being the matrix domain in cetartiodactyls. Rates of nonsynonymous substitution, rns, however, did differ significantly. Both the matrix and the transmembrane domains showed amino acid replacements significantly more frequently than the intermembrane domain in both geomyids and cetartiodactyls. The proportion of invariable amino acid sites, pinvar, corresponded with the pattern of rates of nonsynonymous substitution among the functional domains. The intermembrane domain exhibited a greater proportion of invariable sites than either the matrix or the transmembrane domain in both geomyids and cetartiodactyls, indicating that purifying selection is much greater in the intermembrane than in the other two domains. The number of nonsynonymous substitutions per variable site, however, did not appear to be greatly affected by selection in either group. These results suggest that variable amino acid sites in the intermembrane domain are influenced by much less selective constraint than the invariable sites, which experience an extreme form of purifying selection.
|
Evolution Relative to Individual Amino Acid Properties
With regard to changes in physicochemical properties, the evolution of cyt-b in the two mammalian groups was similar (figs. 2 and 3 ). The matrix domain exhibited a good fit to expected magnitude distributions (P > 0.05) in relation to every property considered. The transmembrane domain, however, exhibited a poor fit (P < 0.05) to every expected distribution. Intermembrane domain evolution was not nearly as conservative across taxonomic groups. Pocket gophers exhibited a good fit to expected distributions for every amino acid property, whereas cetartiodactyls exhibited a good fit for polarity and isoelectric point, but not composition of the side chain, molecular volume, polar requirement, or hydropathy. The good fit of the pocket gopher data may be somewhat artificial, being the result of the extremely low rate of nonsynonymous substitution in the intermembrane domain (table 1
).
|
The relatively high rates of nonsynonymous substitution in the transmembrane domain (table 1 ) are not necessarily indicative of low levels of selection. The poor fit of the resultant amino acid replacements to neutral expectations supports this assertion, especially when compared with the GF scores calculated for the matrix domain, which exhibits similar rates of substitution. Poor GF scores with high rates of substitution in the transmembrane domain probably are due to the effect of codon composition.
Each codon possesses an inherent suite of possible amino acid changes precluding all others. As a result, some codons are inherently conservative by nature, whereas others are inherently radical. Codons that encode phenylalanine, leucine, isoleucine, methionine, and valine (codons with thymine at the second position) are examples of inherently conservative amino acids. Of the 104 possible evolutionary pathways for these 16 codons, only 12 result in nonconservative (i.e., moderate, radical, or very radical) changes with regard to composition of the side chain, 19 with regard to polarity, 16 with regard to molecular volume, 12 with regard to polar requirement, 26 with regard to hydropathy, and 5 with regard to isoelectric point. Collectively, these codons make up more than 52% of the transmembrane domain in both pocket gophers and cetartiodactyls. Phenylalanine, leucine, isoleucine, methionine, and valine are each hydrophobic and thus act to anchor the transmembrane domain into position through the membrane. The codons for these amino acids were the targets of a disproportionate number of nonsynonymous substitutions (72.5%) in the transmembrane domain, not because there are necessarily more mutations at these sites, but most probably because mutations at these sites are more likely to result in conservative changes, which in turn are more likely to become fixed. The correspondence between the magnitude of changes in amino acid properties and the general rate of amino acid replacement noted in the previous section may be the result of the genetic code playing a role that has been called "a guardrail set near the sharp ridge of the protein landscape" (Aita, Urata, and Husimi 2000). The parameters of the genetic code may constrain evolution in some regions, yet promote high, nearly neutral rates of evolution in others, and in both cases the effects of negative selection tend to amplify the results, in this case maintaining a rate of nonsynonymous substitution in the transmembrane domain that is comparable to the nonsynonymous rate of the matrix domain, which is affected by selection relatively very little (figs. 2 and 3 ).
As stated above, the pocket gopher intermembrane domain exhibited good fit to neutral expectations; the only possible exception was isoelectric point (fig. 2
). Just one of the 17 amino acid replacements in this domain was "very radical," whereas the rest were conservative. Although these numbers were insufficient to constitute a lack of global fit, the z-scores comparing magnitude classes indicated a significant difference between conservative and moderate categories and between radical and very radical categories. This may be interpreted as a state of general negative selection, with enough positive selection for the replacement glutamine (Q) arginine (R) at amino acid position 137 to become fixed in a single taxon (Geomys bursarius) contrary to neutral expectations and conserved cyt-b primary structure (Degli Esposti et al. 1993
).
The evolution of the cetartiodactyl intermembrane domain exhibited good fit for polarity and isoelectric point, although the latter appeared somewhat constrained. Composition, molecular volume, polar requirement, and hydropathy exhibited poor fits to expected distributions, indicating that there have been other influences on the evolution of these properties. Comparisons of observed magnitude classes suggested that conservative changes in molecular volume, as well as moderate changes in both composition and hydropathy, have been promoted by selection in this functional domain (fig. 3
). Analysis of inferred changes in polar requirement suggested that not only has there been negative selection promoting conservative changes, but there has also been enough positive selection to promote the same "very radical" amino acid replacement, aspartic acid (D) threonine (T) at amino acid site 159, in two separate lineages (Camelus bactrianus and, most notably, all cetaceans). Although this amino acid residue has not been implicated in the Q-cycle mechanism (Degli Esposti et al. 1993
), this change represents a large local decrease in polar requirement and may be of substantial evolutionary importance. The location of this residue is near the amino-terminal end in the second helix of the large cd-loop, which interacts directly with the other external surface domains of the bc1 complex. The effect of such a replacement has yet to be determined. However, a T at this position is a common characteristic shared by many vertebrates, whereas D is shared by relatively few (Degli Esposti et al. 1993
).
|
![]() |
Conclusions |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Acknowledgements |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Footnotes |
---|
1 Keywords: codon composition
physicochemical properties
amino acid replacement
cytochrome b functional domains
Geomyidae
Cetartiodactyla
2 Address for correspondence and reprints: David A. McClellan, Department of Zoology, 574 WIDB, Brigham Young University, Provo, Utah 84602. david_mcclellan{at}byu.edu
![]() |
literature cited |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Aita, T., S. Urata, and Y. Husimi. 2000. From amino acid landscape to protein landscape: analysis of genetic codes in terms of fitness landscape. J. Mol. Evol. 50:313323[ISI][Medline]
Alff-Steinberger, C.. 1969. The genetic code and error transmission. Proc. Natl. Acad. Sci. USA. 64:584591[Abstract]
Degli Esposti, M., S. De Vries, M. Crimi, A. Ghelli, T. Paternello, and A. Meyer. 1993. Mitochondrial cytochrome b: evolution and structure of the protein. Biochem. Biophys. Acta. 1143:243271[ISI][Medline]
DeWalt, T. S., P. D. Sudman, M. S. Hafner, and S. K. Davis. 1993. Phylogenetic relationships of pocket gophers (Cratogeomys and Pappogeomys) based on mitochondrial DNA cytochrome b sequences. Mol. Phylogenet. Evol. 2:193204[Medline]
Epstein, C. J.. 1966. Role of the amino-acid code and of selection for conformation in the evolution of proteins. Nature. 210:2528[ISI][Medline]
Goldberg, A. L., and R. E. Wittes. 1966. Genetic code: aspects of organization. Science. 153:420424[ISI][Medline]
Grantham, R.. 1974. Amino acid difference formula to help explain protein evolution. Science. 185:862864[ISI][Medline]
Graybeal, A.. 1993. The phylogenetic utility of cytochrome b: lessons from bufonid frogs. Mol. Phylogenet. Evol. 2:256269[Medline]
Griffiths, C. S.. 1997. Correlation of functional domains and rates of nucleotide substitution in cytochrome b.. Mol. Phylogenet. Evol. 7:352365[ISI][Medline]
Hafner, M. S., P. D. Sudman, F. X. Villablanca, T. A. Spradling, J. W. Demastes, and S. A. Nadler. 1994. Disparate rates of molecular evolution in cospeciating hosts and parasites. Science. 265:10871090[ISI][Medline]
Haig, D., and L. D. Hurst. 1991. A quantitative measure of error minimization in the genetic code. J. Mol. Evol. 33:412417[ISI][Medline]
Hughes, A. L., T. Ota, and M. Nei. 1990. Positive Darwinian selection promotes charge profile diversity in the antigen-binding cleft of class I major-histocompatibility-complex molecules. Mol. Biol. Evol. 7:515524[Abstract]
Irwin, D. M., T. D. Kocher, and A. C. Wilson. 1991. Evolution of the cytochrome b gene of mammals. J. Mol. Evol. 32:128144[ISI][Medline]
Kornegay, J. R., T. D. Kocher, L. A. Williams, and A. C. Wilson. 1993. Pathways of lysozyme evolution inferred from the sequences of cytochrome b in birds. J. Mol. Evol. 37:367379[ISI][Medline]
Kyte, J., and R. F. Doolittle. 1982. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157:105132[ISI][Medline]
Li, W.-H.. 1993. Unbiased estimation of the rates of synonymous and nonsynonymous substitution. J. Mol. Evol. 36:9699[ISI][Medline]
McClellan, D. A.. 2000. The codon-degeneracy model of molecular evolution. J. Mol. Evol. 50:131140[ISI][Medline]
McCracken, K. G., J. Harshman, D. A. McClellan, and A. D. Afton. 1999. Data set incongruence and correlated character evolution: an example of functional convergence in the hind-limbs of stifftail diving ducks. Syst. Biol. 48:683714[ISI][Medline]
Nei, M., T. Gojobori. 1986. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3:418426[Abstract]
Nikaido, M., A. P. Rooney, and N. Okada. 1999. Phylogenetic relationships among cetartiodactyls based on insertions of short and long interspersed elements: hippopotamuses are the closest extant relatives of whales. Proc. Natl. Acad. Sci. USA. 96:1026110266
Shedlock, A. M., M. C. Milinkovitch, and N. Okada. 2000. SINE evolution, missing data, and the origin of whales. Syst. Biol. 49:110[ISI]
Shedlock, A. M., and N. Okada. 2000. SINE insertions: powerful tools for molecular systematics. BioEssays. 22:148160[ISI][Medline]
Sneath, P. H. A.. 1966. Relations between chemical structure and biological activity. J. Theor. Biol. 12:157195[ISI][Medline]
Sonneborn, T. M.. 1965. Degeneracy of the genetic code: extent, nature, and genetic implicationsPp. 377397 in V. Bryson and H. J. Vogel, eds. Evolving genes and proteins. Academic Press, New York
Swofford, D. L.. 1998. PAUP*Phylogenetic analysis using parsimony (*and other methods). Version 4. Sinauer, Sunderland, Mass
Taylor, W. R.. 1986. The classification of amino acid conservation. J. Theor. Biol. 119:205218[ISI][Medline]
Woese, C. R., D. H. Dugre, S. A. Dugre, M. Kondo, and W. C. Saxinger. 1966. On the fundamental nature and evolution of the genetic code. Cold Spring Harb. Symp. Quant. Biol. 31:723736[ISI][Medline]
Wyckoff, G. J., W. Wang, and C.-I. Wu. 2000. Rapid evolution of male reproductive genes in the descent of man. Nature. 403:304308[ISI][Medline]
Xia, X.. 1998. The rate heterogeneity of nonsynonymous substitutions in mammalian mitochondrial genes. Mol. Biol. Evol. 15:336344[Abstract]
.. 2000. Phylogenetic relationships among horseshoe crab species: effect of substitution models on phylogenetic analysis. Syst. Biol. 49:87100[ISI][Medline]
Xia, X., M. S. Hafner, and P. D. Sudman. 1996. On transition bias in mitochondrial genes of pocket gophers. J. Mol. Evol. 43:3240[ISI][Medline]
Xia, X., and W.-H. Li. 1998. What amino acid properties affect protein evolution?. J. Mol. Evol. 47:557564[ISI][Medline]
Zhang, Z., L. Huang, V. M. Shulmeister, Y.-I. Chi, K. K. Kim, L.-W. Hung, A. R. Crofts, E. A. Berry, and S.-H. Kim. 1998. Electron transfer by domain movement in cytochrome bc1. Nature. 392:677684[ISI][Medline]