*Departamento de Genética, Universidad de Sevilla, Spain;
Instituto de Biología y Genética Molecular, Universidad de Valladolid, Spain
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Studies about modern human genetic diversity (Foley 1998
; Jorde et al. 2000
) assume that Neandertals were not related to modern humans, although such assumptions have been extensively debated (Nordborg 1998
; Hawks and Wolpoff 2001
; Relethford 2001
).
Five mitochondrial DNA D-loops have been recovered from Neandertal bones (Krings et al. 1997
; Krings et al. 1999
; Krings et al. 2000
; Ovchinnikov et al. 2000
). Henceforth, we will refer to these research groups as the Neandertal sequencing groups (NSGs). The phylogenetic analyses of these sequences located Neandertal DNA at the base of modern human diversity (fig. 2C
), suggesting that the Neandertal genes probably did not contribute to the modern human genetic pool.
|
|
![]() |
Data and Methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
We carried out two analyses: the first analysis concerns the HVI region and includes three Neandertal sequences. The second analysis concerns the HVI plus HVII regions including two Neandertal sequences. The HVII region of the Mezmaiskaya specimen has not been recovered.
In the first analysis (HVI region) we included human entries containing at least positions 16056 to 16378 of the Anderson et al. (1981)
reference sequence (fig. 1
). This segment covers the minimal region common to the three Neandertal entries. Human sequences, duplicated or ambiguous, were deleted. This selection rendered 1,905 human sequences (328 Africans, 471 Asians, 211 Australians-Oceanians, 475 Europeans, and 420 Americans). Three Pan paniscus and three Pan troglodytes sequences were added as outgroup. Also, we discarded alignment columns where one sequence showed an insertion, whereas the rest did not show any insertion. The final alignment contains 3 Neandertal, 1,905 human, and 6 chimpanzee sequences. The range used for analysis covers positions 1602316400, that correspond to the longest Neandertal sequence (fig. 1
). In the second analysis we included individuals where both HVI and HVII regions were sequenced at least from positions 1605616378 and 57343, respectively (fig. 1
). These ranges cover the region common to the two Neandertal sequences. Repeated or ambiguous entries were deleted. A total of 377 human entries was selected (79 Africans, 59 Asians, 84 Australians-Oceanians, 123 Europeans, and 32 Americans). The final alignment contains 2 Neandertal, 377 human, and 6 chimpanzee sequences that cover positions 1602316400 and 57343. We deleted alignment columns where most sequences showed a gap.
|
Pairwise Distances
Uncorrected and maximum-likelihood pairwise distances have been computed using PAUP* 4.0 (Swofford 1998
). Maximum-likelihood distances were calculated using the best model fitting the data.
Phylogenetic Analysis and Bootstrapping
Phylogenetic analyses were carried out using PAUP* 4.0 (Swofford 1998
). For each data set the procedure was: (1) the sequence order was randomized, (2) the maximum-likelihood distances (using the selected model and the best parameters) were computed, and (3) a neighbor-joining tree (Saitou and Nei 1987
) was constructed for 100 bootstrap replicates (Felsenstein 1985
). Interior branch tests (Sitnikova, Rzhetsky, and Nei 1995
) were carried out with the program PHYLTEST 2.0 (Kumar 1996
) using the most complex model available in this program: the Kimura two-parameter model (Kimura 1980
) with gamma correction.
![]() |
Results and Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
The parameters estimated in our work (table 2
) are similar to those obtained by other authors (Excoffier and Yang 1999
; Meyer, Weiss, and von Haeseler 1999
). There is a strong difference between transitions and transversions, and a low value for the gamma distribution shape parameter, which indicates a strong heterogeneity in the substitution rate among sites. The value of the shape parameter estimated here is greater than other estimations (Excoffier and Yang 1999
; Meyer, Weiss, and von Haeseler 1999
) because we have introduced a parameter for the proportion of invariable sites.
|
Given that neither the bootstrap method (Felsenstein 1985
) nor the likelihood mapping method used by Krings et al. (1997
, 1999
, 2000)
have statistical meaning, we performed the interior branch test. It has been shown that the confidence value of this test is the complement of the P values in the standard statistical test (Sitnikova, Rzhetsky, and Nei 1995
). We divided our data sets into four clusters: (1) chimpanzees, (2) Neandertals, (3) extant human sequences located at the base of the HVI + HVII tree (fig. 2B
) that correspond to 10 !Kung entries of the MOUSE database, and (4) the rest of humans. We then determined whether the lengths of the interior branches of the three possible trees for the four sequence clusters are significantly different from 0. The result of this test is not significant for either the HVI or HVI + HVII regions. Thus, like the bootstrap analyses, the interior branch test suggests that a polytomy is the best representation for the evolutionary tree relating Neandertal and extant humans.
Pairwise Distances
The NSG reported that the pairwise comparisons between the Neandertal and human sequences demonstrate that Neandertals are outside of modern human D-loop variability. In particular, Krings et al. (1997)
stated that a total of 0.002% of the pairwise comparisons between human mtDNA sequences were larger than the smallest difference between the Neandertal and the humans. We think that this point merits further analysis. The current database is biased because of the overrepresentation of some populations and the underrepresentation of others. For instance, the MOUSE database contains 6,012 entries for the HVI region, but 31% of the entries belong to only 20 populations out of 206 populations represented (10% of the total of populations). The extreme cases are 306 Koreans, 126 Yaps, 120 Cayapa Amerindians, 119 Mandeka, 115 Palau, and 100 white British. There are also 1,417 entries of undetermined population (40% of them are from North America and 23% European, but only 9% are from Africa). Thus, African populations containing the most ancient lineages and the highest variation are underrepresented in the database.
Because of the database overrepresentation of some human populations, the distribution of pairwise distances is biased. A large part of pairwise comparisons are made between individuals belonging to the same population. Likewise, it is expected that most individuals from a single population will show similar distances to a given outgroup (Neandertal, in this case). To overcome this problem, we considered another sample of the human variation. We first sorted the HVI sequences in our data set according to its uncorrected distance to the reference sequence (Anderson et al. 1981
). Then we grouped them into 171 classes, containing equidistant sequences (considering four decimals), and chose one sequence at random from each class. The computation of pairwise distances between the 171 randomly selected sequences and the Neandertals rendered 1.6% of human-human comparisons larger than the smallest difference between Neandertals and humans. Likewise, 27% of the comparisons are lower than the largest human-human difference. This result suggests that Neandertals sequences are not so different from those of extant humans, in contrast to the NSG claims.
Final Considerations
A main conclusion can be extracted from our analyses: the phylogenetic position of the ancient DNA sequences recovered from Neandertal bones is sensitive to the phylogenetic methods employed. It depends on the model of nucleotide substitution, the branch support method, and the set of data used. Adcock et al. (2001)
recovered HVI sequences of archaic human bones from Australia, and their phylogenetic analysis showed that two of the specimens were outgroups even for the most ancient African lineages. They concluded that this is an evidence supporting the multiregional hypothesis. However, a second analysis carried out by Cooper et al. (2001)
that took into account the heterogeneity of rates between sites and a large sample of modern humans, showed that both HVI sequences are located among extant humans. This case illustrates the influence of the nucleotide substitution model on the phylogenetic reconstruction of the human D-loop region.
The NSG studies used poor parameter models of nucleotide substitution for their analyses, whereas we opted for complex (parameter rich) models following the likelihood ratio test. Both alternatives have pros and cons. Yang (1997)
and Posada and Crandall (2001)
encourage the use of the best-fit model, although they reported some examples where simple models can recover the true phylogenetic tree better than complex models. Several authors highlight the importance of using the best nucleotide substitution model for a given data set (Huelsenbeck and Hillis 1993
; Sullivan and Swofford 1997
; Cunningham, Zhu, and Hillis 1998
). The same considerations apply to the choice of the branch support method. The bootstrap method has some caveats (Cummings, Otto, and Wakeley 1995
), and the quartet puzzling (likelihood mapping) has also been criticized for overestimating branch support values (Cao, Adachi, and Hasegawa 1998
).
Secondly, we believe that the likelihood mapping values supporting Neandertals as a different species might be artifactually increased. To illustrate this point, we took two human HVI sequences showing maximal divergence between them, a chimpanzee, and the Feldhofer Neandertal. The three possible quartets were evaluated with the program TREE-PUZZLE (Strimmer and von Haeseler 1996
) with the same model and parameter used by Krings et al. (1997)
. It can be seen (fig. 3
) that the chimpanzee branch is very long, and that the Neandertal branch is longer than the two humans, opening the possibility of a LBA artifact. It has been reported that quartet methods are very prone to LBA (Huelsenbeck 1998
). Thus, the best-scored topology is usually the one joining long branches in exhaustive quartet evaluations. Another issue relates to the significance of the log-likelihood values. In the trees drawn in figure 3
, the best score is for the tree that joins Neandertal and chimpanzee sequences. However, the log-likelihoods are very similar between the alternative topologies. The Kishino-Hasegawa test (Kishino and Hasegawa 1989
) yielded no significant differences between them at the 5% level. This example illustrates that likelihood mapping can select as the best topology one that is not significantly better than the other two. A third argument against the use of likelihood mapping in this study has to do with the database population bias mentioned earlier. It is not advisable to perform likelihood mapping with a database where most human sequences are very similar among themselves.
|
The low bootstrap values obtained in our work indicate the lack of resolution of the HVI and HVII sequences to determine the phylogenetic position of Neandertals because no tree topology is favored. In part, this is caused by short sequences with a high proportion of invariable sites. Ingman et al. (2000)
noted that the D-loop region, in spite of its wide use for human phylogenetic analysis, is not an appropriate region of the mitochondrial genome for answering phylogenetic questions.
Another interesting issue is the possibility that the Neandertal sequences were artifactual. Caldararo and Gabow (2000)
noticed that some of the nucleotide substitutions found in the Feldhofer Neandertal sequence matched those found in other human ancient HVI sequences. They attributed these similarities to diagenetic changes that occurred in fossil DNA. Hansen et al. (2001)
studied different clones of ancient DNA amplifications and concluded that PCR can introduce errors caused by the miscoding lesions of ancient DNA. In the absence of diagenetic changes, the mean distance between an ancient sequence and an extant one ought to be larger than the distance between a more ancient sequence and the extant one. So, the more ancient a sequence, the shorter its distance to the extant sequences (fig. 4
). We tested this prediction by computing the average distances between each Neandertal HVI sequence and our data set of extant HVI human sequences (table 3
). It can be seen that the most recent specimen (Mezmaiskaya) shows a shorter distance than the oldest one (Feldhofer). This result can be explained by a different nucleotide substitution rate among Neandertal lineages (populations) or by miscoding DNA lesions in Neandertal fossils. Had DNA damage increased the differences between Neandertal and modern humans, Neandertals would be more akin to modern humans than what recent claims suggest.
|
|
![]() |
Acknowledgements |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Footnotes |
---|
Keywords: Neandertal
ancient DNA
D-loop
Address for correspondence and reprints: Gabriel Gutiérrez, Departamento de Genética. Universidad de Sevilla, Apartado 1095, 41080 Sevilla, Spain. ggpozo{at}us.es
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Adcock G. J., E. S. Dennis, S. Easteal, G. A. Huttley, L. S. Jermiin, W. J. Peacock, A. Thorne, 2001 Mitochondrial DNA sequences in ancient Australians: implications for modern human origins Proc. Natl. Acad. Sci. USA 98:537-542
Anderson S., A. T. Bankier, B. G. Barrell, et al. (14 co-authors) 1981 Sequence and organization of the human mitochondrial genome Nature 290:457-465[ISI][Medline]
Brocchieri L., 2001 Phylogenetic inferences from molecular sequences: review and critique Theor. Popul. Biol 59:27-40[ISI][Medline]
Burchardt F., A. von Haeseler, S. Meyer, 1999 HvrBase: compilation of mtDNA control region sequences from primates Nucleic Acids Res 27:138-142
Caldararo N., S. Gabow, 2000 Mitohondrial DNA analysis and the place of Neandertals in Homo Ancient Biomol 3:135-158
Cann R. L., M. Stoneking, A. C. Wilson, 1987 Mitochondrial DNA and human evolution Nature 325:31-36[ISI][Medline]
Cao Y., J. Adachi, M. Hasegawa, 1998 Comment on the quartet method for finding maximum-likelihood tree topologies Mol. Biol. Evol 15:87-89
Cooper A., A. Rambaut, V. Macaulay, E. Willerslev, A. J. Hansen, C. Stringer, 2001 Human origins and ancient human DNA Science 292:1655-1656
Cummings M. P., S. P. Otto, J. Wakeley, 1995 Sampling properties of DNA sequence data in phylogenetic analysis Mol. Biol. Evol 12:814-822[Abstract]
Cunningham C. W., H. Zhu, D. M. Hillis, 1998 Best-fit maximum-likelihood models for phylogenetic inference: empirical tests with known phylogenies Evolution 52:978-987[ISI]
Excoffier L., Z. Yang, 1999 Substitution rate variation among sites in mitochondrial hypervarible region I of humans and chimpanzees Mol. Biol. Evol 16:1357-1368[Abstract]
Felsenstein J., 1985 Confidence limits on phylogenies: an approach using the bootstrap Evolution 39:783-791[ISI]
Foley R., 1998 The context of human genetic evolution Genome Res 8:339-347
Hansen A. J., E. Willerslev, C. Wiuf, T. Mourier, P. Arctander, 2001 Statistical evidence for miscoding lesions in ancient DNA templates Mol. Biol. Evol 18:262-265
Hawks J., M. Wolpoff, 2001 Paleoanthropology and the population genetics of ancient genes Am. J. Phys. Anthropol 114:269-272[ISI][Medline]
Huelsenbeck J. P., 1998 Systematic bias in phylogenetic analysis: is the Strepsiptera problem solved? Syst. Biol 47:519-537[ISI][Medline]
Huelsenbeck J. P., D. M. Hillis, 1993 Success of phylogenetic methods in the four-taxon case Syst. Biol 42:247-264[ISI]
Huelsenbeck J. P., B. Rannala, 1997 Phylogenetic methods come of age: testing hypothesis in an evolutionary context Science 276:227-232
Ingman M., H. Kaessmann, S. Pääbo, U. Gyllensten, 2000 Mitochondrial genome variation and the origin of modern humans Nature 408:708-713[ISI][Medline]
Jorde L. B., W. S. Watkins, M. J. Bamshad, M. E. Dixon, C. E. Ricker, M. T. Seieltad, M. A. Batzer, 2000 The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data Am. J. Hum. Genet 66:979-988[ISI][Medline]
Jukes T. H., C. R. Cantor, 1969 Evolution of protein molecules Pp. 21132 in H. N. Munro, ed. Mammalian protein metabolism. Academic Press, New York
Kimura M., 1980 A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences J. Mol. Evol 16:111-120[ISI][Medline]
Kishino H., M. Hasegawa, 1989 Evaluation of the maximum likelihood estimation of the evolutionary tree topologies from DNA sequence data, and the branching order in Hominoidea J. Mol. Evol 29:170-179[ISI][Medline]
Krings M., C. Capelli, F. Tschentscher, H. Geisert, S. Meyer, A. von Haeseler, K. Grossschmidt, G. Possnert, M. Paunovic, S. Pääbo, 2000 A view of Neandertal genetic diversity Nat. Genet 26:144-145[ISI][Medline]
Krings M., H. Geisert, R. W. Schmitz, H. Krainitzki, S. Pääbo, 1999 DNA sequence of the mitochondrial hypervariable region II from the Neandertal type specimen Proc. Natl. Acad. Sci. USA 96:5581-5585
Krings M., A. Stone, R. W. Schmitz, H. Krainitzki, M. Stoneking, S. Pääbo, 1997 Neandertal DNA sequences and the origin of modern humans Cell 90:19-30[ISI][Medline]
Kumar S., 1996 PHYLTEST: phylogenetic hypothesis testing software. Version 2.0 Pennsylvania State University, University Park, Pa
Maddison D. R., M. Ruvolo, D. L. Swofford, 1992 Geographic origins of human mitochondrial DNA: phylogenetic evidence from control region sequences Syst. Biol 41:111-124[ISI]
Meyer S., G. Weiss, A. von Haeseler, 1999 Pattern of nucleotide substitution and rate heterogeneity in the hypervariable regions I and II of human mtDNA Genetics 152:1103-1110
Nordborg M., 1998 On the probability of Neanderthal ancestry Am. J. Hum. Genet 63:1237-1240[ISI][Medline]
Ovchinnikov I. V., A. Götherström, G. O. Romanova, V. M. Kharitonov, K. Lidén, W. Goodwin, 2000 Molecular analysis of Neanderhal DNA from the northern Caucasus Nature 404:490-493[ISI][Medline]
Philippe H., J. Laurent, 1998 How good are deep phylogenetic trees? Curr. Opin. Genet. Dev 8:616-623[ISI][Medline]
Posada D., K. A. Crandall, 1998 MODELTEST: testing the model of DNA substitution Bioinformatics 14:817-818[Abstract]
. 2001 Simple (wrong) models for complex trees: a case from retroviridae Mol. Biol. Evol 18:271-275
Relethford J. H., 2001 Absence of regional affinities of Neandertal DNA with living humans does not reject multiregional evolution Am. J. Phys. Anthropol 115:95-98[ISI][Medline]
Saitou N., M. Nei, 1987 The neighbor-joining method: a new method for reconstructing phylogenetic trees Mol. Biol. Evol 4:406-425[Abstract]
Sitnikova T., A. Rzhetsky, M. Nei, 1995 Interior-branch and bootstrap tests of phylogenetic trees Mol. Evol. Biol 12:319-333[Abstract]
Strimmer K., A. von Haeseler, 1996 Quartet puzzling: a quartet maximum likelihood method for reconstructing tree topologies Mol. Biol. Evol 13:964-969
Stringer C. B., P. Andrews, 1988 Genetic and fossil evidence for the origin of modern humans Science 239:1263-1268[ISI][Medline]
Sullivan J., D. L. Swofford, 1997 Are guinea pigs rodents? The importance of adequate models in molecular phylogenetics J. Mamm. Evol 4:77-86
Swofford D. L., 1998 PAUP*: phylogenetic analysis using parsimony (*and other methods). Version 4 Sinauer Associates, Sunderland, Mass
Swofford D. L., G. J. Olsen, P. J. Waddell, D. M. Hillis, 1996 Phylogenetic inference Pp. 407514 in D. M. Hillis, C. Moritz, and B. K. Mable, eds. Molecular systematics. Sinauer, Sunderland, Mass
Tamura K., M. Nei, 1993 Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees Mol. Biol. Evol 10:512-526[Abstract]
Wakeley J., 1993 Substitution rate variation among sites in hypervariable region 1 of human mitochondrial DNA J. Mol. Evol 37:613-623[ISI][Medline]
Wiens J. J., 1998 Combining data sets with different phylogenetic histories Syst. Biol 47:568-581[ISI][Medline]
Wolpoff M. H., J. Hawks, D. W. Frayer, K. Hunley, 2001 Modern human ancestry at the peripheries: a test of the replacement theory Science 291:293-297
Yang Z., 1997 How often do wrong models produce better phylogenies? Mol. Biol. Evol 14:105-108
Yang Z., N. Goldman, A. Friday, 1994 Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation. 1994 Mol. Biol. Evol 11:316-324[Abstract]