Department of Ecology and Evolutionary Biology, Biosciences West Building, University of Arizona, Tucson
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
In addition to leaving a signature on the target of selection, positive selection should affect patterns of linked neutral variation to an extent determined by the local recombination rate and the strength of selection (Maynard Smith and Haigh 1974
). Two modes of positive selection, directional and balancing, are expected to impact molecular diversity at linked regions in different ways. Positive directional selection can lead to an overall reduction of linked variation and an excess of alleles, given the observed heterozygosity (Maynard Smith and Haigh 1974
; Watterson 1978
; Kaplan, Hudson, and Langley 1989
). In contrast, balancing selection can lead to an elevation of linked polymorphism and a deficit of alleles, given the observed heterozygosity (Watterson 1978
; Hudson, Kreitman, and Aguade 1987
).
The idea that selection can lead to a skew in the frequency distribution at linked loci suggests a way to identify individual genomic regions experiencing positive selection. With a set of loci throughout the genome mapped at sufficiently high density, clusters of loci exhibiting similar skews in the frequency distribution of alleles may pinpoint genomic regions that have recently experienced positive selection. In principle, such an approach could be used to identify regions with either an excess of rare alleles (indicative of directional selection) or an excess of intermediate frequency alleles (indicative of balancing selection). Because most human populations are not at equilibrium and have undergone recent expansions (e.g., Rogers and Harpending 1992
), localized skews in the frequency distribution for particular genomic regions must be interpreted against the background of the genomic average skew in the frequency distribution created by population-level processes.
In addition to localized skews in the frequency distribution, positive selection may cause different patterns of variability at X-linked and autosomal loci (Begun and Whitley 2000)
. If adaptive mutations are recessive on an average, fixation rates are expected to be higher for the X chromosome relative to the autosomes because recessive X-linked mutations will be visible to selection at every generation in males (Charlesworth, Coyne, and Barton 1987
). In addition to having faster fixation rates, X-linked beneficial mutations may experience shorter transit times than their autosomal counterparts. Because X-linked mutations spend one-third of their time in a haploid state, their fitness variance is larger than that of autosomal mutations, making selection more effective and the increase in frequency more rapid for X-linked beneficial mutations (Avery 1984
). Thus, there are several reasons why beneficial mutations arising on the X chromosome may be affected more dramatically by selection than mutations arising on the autosomes (discussed in Begun and Whitley 2000)
. However, if selection acts on standing variation that is in mutation-selection balance, fixation rates for X-linked mutations are not expected to be faster (Orr and Betancourt 2001)
. Assuming similar recombinational environments for the X chromosome and the autosomes, the frequency distributions at neutral loci can be used to test the hypothesis that X-linked beneficial mutations experience faster fixation rates or shorter transit times (or both) than autosomal mutations. This hypothesis predicts that X-linked loci may show a greater excess of rare alleles on average.
In this article, we use published polymorphism data from 5,257 genetically mapped microsatellites to search the human genome for evidence of positive selection in two ways. First, we use a sliding-window approach to look for genomic regions showing unusually high numbers of loci deviating from equilibrium. Second, we compare the frequency distributions of alleles at X-linked and autosomal microsatellites.
![]() |
Materials and Methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
|
We calculated the expected allele numbers and heterozygosities by assuming a stepwise mutation model (SMM; Ohta and Kimura 1973
). The SMM appears to be appropriate for many human dinucleotide repeat microsatellites (Valdes, Slatkin, and Friemer 1993
; Weber and Wong 1993
). In the data used in this study (Dib et al. 1996
), about 90% of the observed mutations produced repeat changes of size one or two.
We investigated departures from the frequency distribution expected using the SMM in two ways. First, we used the formula of Ohta and Kimura (1973)
to estimate the expected number of alleles, given the observed levels of heterozygosity, for each locus separately. Second, we used the coalescent method (and BOTTLENECK v. 1.2.02 software) of Cornuet and Luikart (1996)
. This approach simulates microsatellite evolution under a specified mutational model (in this case, 1,000 replicates per locus using the SMM), assuming a constant population size at mutation-drift equilibrium, and calculates the expected heterozygosity at a locus, given the observed number of alleles. By completing many replicate simulations, the difference between observed and expected heterozygosity can be assigned a probability. This approach is formally equivalent to Watterson's (1978)
homozygosity test (in reverse). The critical value for the probabilities was set at 0.05.
Clustering of Nonequilibrium Loci
Positive selection may induce a nonrandom spatial distribution of loci with unusual frequency distributions. We tested this prediction in two ways. First, we conducted simple spatial autocorrelation analyses, asking whether the statistics of neighboring loci (i.e., lag of one locus) are more similar than those expected by chance. Autocorrelations were estimated separately for each chromosome and in analyses including all chromosomes. Autocorrelation tests were performed using the observed minus expected heterozygosity, as well as the number of alleles, heterozygosity, and variance in allele size.
Second, we identified genomic regions exhibiting clustering of significant loci. We defined a sliding window, 5 cM in size, beginning with each microsatellite. Huttley et al. (1999)
demonstrated that linkage disequilibrium rarely extends beyond 5 cM for this data set; hence, this genetic distance provides a reasonable upper bound for selection effects mediated by linkage. Next, we counted the number of loci in each window showing significant differences between observed and expected heterozygosity using coalescent simulations (Cornuet and Luikart 1996
). We then used the genome-wide average proportion of significant loci (or the chromosomal average) to predict the proportion of significant markers that would be found in each window if significant loci were randomly distributed throughout the genome (or along an individual chromosome). The probability of the observed (or more extreme) proportion was estimated using the binomial distribution as
|
The computation of more than 5,000 binomial tests suggests that the statistical critical value should be reduced. We employed two independent corrections for multiple tests. First, we applied a Bonferroni correction (Sokal and Rohlf 1995
, p. 240). This suggested a critical value of 9.5 x 10-6 = (0.05/5,257). We also used the randomization procedure of Churchill and Doerge (1994)
, which was designed to deal with a similar multiple test problem: the identification of regions of the genome related to quantitative trait variation (QTL mapping). Holding the number of loci per window and their genetic positions constant, the significance state (significant vs. not significant) was shuffled across loci throughout the genome. Binomial P-values for all resulting windows were calculated, and the minimum P-value in the genome was recorded. We repeated this procedure 10,000 times to generate a distribution of minimum P-values under the null hypothesis that significant loci are randomly distributed throughout the genome. Binomial probabilities observed for actual windows were then compared with this distribution. We conducted a comparable test using the chromosome as the level of the experiment (23 times). Here, the null distribution was formulated on the basis of 1,000 replicates, and a separate mean proportion of significant loci per window was used for each chromosome (rather than the genome-wide mean).
This binomial test assumes that adjacent loci are independent under a neutral model. Because closely linked loci will have correlated histories, this assumption is not strictly correct. Therefore, this test may overestimate the significance of some genomic regions. However, autocorrelation analyses of the skew in the frequency distribution and of the measures of variation suggest little evidence for shared histories among neighboring loci overall (see Results).
The Frequency Distribution of Polymorphisms and Recombination Rate
Theory predicts that positive selection will affect nearby neutral polymorphism most strongly in regions of reduced recombination (Maynard Smith and Haigh 1974
). Therefore, we also assessed the relationship between measures of skew in the frequency distribution and local recombination rate. Recombination rates were estimated for a subset of the microsatellites considered in this article by comparing integrated genetic and physical maps of the human genome (Payseur and Nachman 2000)
. We used Kendall's correlation tests to evaluate the association between measures of skew in the frequency distribution and recombination rate.
Comparisons Between the X Chromosome and the Autosomes
We compared the frequency distributions of the X-linked and autosomal microsatellites using Mann-Whitney U-tests. Results from both Ohta and Kimura's (1973)
formulae and from Cornuet and Luikart's (1996)
method were analyzed in these tests.
![]() |
Results |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
|
|
Sliding-Window Analysis
The sliding-window binomial probabilities for a deficit of heterozygosity are plotted against genetic map position for all the 23 chromosomes in figure 2
. Several patterns emerge from this analysis. First, there is considerable variance in binomial P-values within individual chromosomes. Chromosomal regions with multiple adjacent windows containing low P-values may represent portions of the genome recently subject to positive directional selection. Second, the X chromosome, in general, harbors a greater proportion of windows with low P-values than do the autosomes. For example, 7.7% of all windows on the X chromosome are significant at the 0.01 level compared with 0.5% of the windows on the autosomes. Finally, when either a Bonferroni correction or the randomization method of Churchill and Doerge (1994)
using genome-wide means is used to account for 5,257 tests, none of the individual sliding-window probabilities remain significant for either an excess or deficit of heterozygosity. When the randomization method of Churchill and Doerge (1994)
is used with chromosome means, one window contains a significant deficit of heterozygosity, located at 100.4 cM on chromosome 17 (although the observation that chromosome 17 exhibits a significant degree of autocorrelation suggests caution in the interpretation that this window contains a target of selection). These corrections for multiple tests may be conservative and may overlook regions of biological importance, as discussed below. They also do not take into account adjacent clusters of windows with low P-values.
|
|
|
|
|
The hypothesis that genomic windows with the most severe skews in the frequency distribution represent candidate regions for positive directional selection also predicts that microsatellites in these windows will show reduced variation. The mean heterozygosity is reduced in these extreme windows relative to the genome-wide average, and this effect is significant (Mann-Whitney U; P < 0.0003). However, extreme windows do not show reduced variance in allele size (P > 0.05) and do exhibit a higher mean number of alleles than other windows (P < 0.02). This discrepancy between different measures of variation may suggest that microsatellite heterozygosity is a more sensitive indicator of recent selective sweeps than allele number or variance in allele size. Although we cannot offer a simple demographic scenario that could produce this discordance, these results may also indicate that some of the identified windows are not tracking selective events.
Recombination Rates and the Frequency Spectrum
Previous work revealed no correlation between recombination rate and levels of microsatellite polymorphism in these data (Payseur and Nachman 2000)
. We compared variation in the recombination rate with measures of the skew in the frequency distribution. Overall, recombination rate is not correlated with the difference between observed and expected heterozygosity (P > 0.05) or with the difference between observed and expected number of alleles (P > 0.05). However, there is evidence that windows with extreme skews toward a deficit of heterozygosity exhibit reduced recombination rates relative to the genome-wide average (table 4
; extreme windows: average = 0.8 cM/Mb; genome-wide: average = 1.5 cM/Mb; Mann Whitney U, P < 0.05). This effect is most pronounced for windows on the X chromosome, where all the windows with extreme skews are found in regions with the lowest rates of recombination. This nonrandom association between skews in the frequency distribution and recombination rate on the X chromosome is highly significant (table 4
, Fisher's Exact Test, P < 10-10). An analysis of this pattern using only nonoverlapping windows or using individual loci gives the same result (Fisher's Exact Test, P < 0.05 for both).
|
|
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Population Structure and the Frequency Distribution of Polymorphisms
The average microsatellite in this population is about 7% less heterozygous than expected when heterozygosity is predicted with allele-based coalescent simulations assuming the SMM. Over 25% (1,447/5,257) of the tests on individual loci show significant deficits of heterozygosity. As expected, most of these loci also show an excess of rare alleles using Ohta and Kimura's (1973)
formula. It seems unlikely that most of these loci are tracking individual events of directional selection or that the excess of rare alleles indicates widespread maintenance of slightly deleterious alleles. Such a genome-wide pattern is more parsimoniously interpreted as evidence of a recent population expansion in Europe. An alternative explanation is that the SMM is inappropriate for many human microsatellites and that it (systematically) overpredicts the expected heterozygosity. However, given the genetic evidence (Kimmel et al. 1998
; Torroni et al. 1998
) for the recent European population size increase and the support for the SMM in humans (Valdes, Slatkin, and Friemer 1993
; Weber and Wong 1993
), the demographic explanation seems most reasonable.
Sliding-Window Analysis
The sliding-window analysis is predicated on the notion that the spatial distribution of markers can provide information that is useful in identifying regions of the genome under selection. There are a number of biological and statistical issues that must be confronted in interpreting the results of these analyses, including the background signature of a population expansion, the likelihood that selection will impact patterns of microsatellite variability, and the statistical problems of assuming independence among linked loci and of conducting multiple tests.
First, it is challenging to identify locus-specific skews in the frequency distribution against the background of a genome-wide skew in the same direction. Thus, it is likely to be more difficult to detect positive directional selection in an expanding population than in one of a constant size.
Second, although many models of selection deal with simple, strong effects (e.g., Maynard Smith and Haigh 1974
; Simonsen, Churchill, and Aquadro 1995
), weak or fluctuating selection may be important in shaping patterns of variation at the molecular level (Gillespie 1991
, p. 142; Wayne and Simonsen 1998
; Przeworski, Hudson, and di Rienzo 2000
). A simple, catastrophic selective sweep in which a newly arising adaptive mutation is quickly driven to fixation is expected to leave a clear signature on the frequency spectrum of segregating polymorphisms, and this has been best studied with nucleotide polymorphisms (Braverman et al. 1995
; Simonsen, Churchill, and Aquadro 1995
). In these studies, the power to detect a sweep from the frequency spectrum is high only during a fairly brief interval after the sweep. Outside of this interval, or if selection is weak, the power to detect a sweep is quite low. Although similar power analyses have not been carried out with microsatellite data, it is likely that the power to detect sweeps will be similarly low outside of a narrow time interval. An added concern is that the high mutation rate of microsatellites will quickly obscure past selective events. In a simple hitchhiking model with s = 0.01 and N = 104, Wiehe (1998)
and Schlötterer and Wiehe (1999)
have shown that a reduction in microsatellite variability is expected only if microsatellites are tightly linked (<0.15 cM) and if microsatellite mutation rates are low (<0.01). Thus, we might expect, a priori, that it will be difficult to detect a skew in the frequency distribution, except in regions of low recombination where any given microsatellite will presumably be linked to more genes. For example, the recombinational distance of less than approximately 0.15 cM required to detect hitchhiking may correspond to a physical distance of less than 25 kb in high-recombination regions of the genome and a distance of 1 Mb or more in low-recombination regions. The average gene density across the human genome is approximately 10 genes per Mb (International Human Genome Sequencing Consortium 2001
). It is noteworthy that we observe windows with extreme skews in the frequency distribution mostly in regions of the genome with little recombination (table 2
). This suggests that these windows are tracking selective events. The power to detect selection may be even lower under more complex models of selection. For example, some models of selection in fluctuating environments can result in little skew in the frequency distribution of polymorphisms (Gillespie 1994
).
Statistical issues may also complicate our attempts to identify genomic regions under selection. First, the binomial test we applied assumes statistical independence among loci. Even under a neutral model, however, closely linked loci may share histories, leading to correlations among their frequency spectra. As a result, our analyses may overestimate the statistical significance of some genomic windows. However, the observation that measures of variation are not autocorrelated and that frequency spectra are only weakly autocorrelated suggests that this problem is unlikely to be severe.
Second, given the corrections for multiple tests, a sliding window will be significant only if its binomial probability is less than about 10-5. Because of the genome-wide skew toward a deficit of heterozygosity, this occurs only if approximately 90% of the loci within the window individually show significant skews in the frequency distribution. One way to improve the power of the test would be to increase the window size. Although this approach would be statistically defensible, it is less biologically reasonable because selection is unlikely to exert effects over long distances. Therefore, the identification of genomic regions affected by selection when large numbers of loci are analyzed remains a difficult problem.
The distribution of extreme windows is highly nonrandom, both with respect to X-autosome differences and with respect to recombination rate. As noted previously, these observations suggest that many or most of these windows are tracking selective events. Further indication that some of the windows in table 2
are tracking selection comes from studies of human nucleotide polymorphism in these same regions. For example, the two regions of low recombination near the X chromosome centromere that are identified in table 2 also contain genes showing reduced nucleotide heterozygosity and a skew in the frequency spectrum with negative values of Tajima's (1989)
D statistic (Nachman 2001
). Furthermore, one of these regions contains a microsatellite (at 99.7 cM) that shows an unusually high level of divergence in allele frequencies between European Americans and African Americans (Smith et al. 2001)
. These observations are consistent with recent positive directional selection.
Recombination Rates and the Frequency Spectrum
Theory predicts that positive selection will impact linked neutral variation most severely in regions of reduced recombination (Maynard Smith and Haigh 1974
). Although we do not observe an overall correlation between recombination rate and measures of the frequency distribution, windows with extreme skews are found predominantly in low recombination regions. The mean recombination rate for windows with extreme skews is significantly below the genomic average. This effect is particularly pronounced on the X chromosome (table 4
).
Comparisons Between the X Chromosome and the Autosomes
There is a greater skew in the frequency distribution for X-linked loci than for autosomal loci. The existence of this difference is not sensitive to the mechanism of mutation: a similar result is obtained when an infinite alleles model is assumed (results not shown). Assuming that the X chromosome and the autosomes experience similar average recombination rates, this result has both demographic and selective explanations. First, differences in effective population size may cause X-linked and autosomal loci to be differentially affected by demographic events. For example, Fay and Wu (1999)
showed that a similar discordance in the frequency spectrum between mitochondrial and autosomal loci in humans could be caused by a population bottleneck. The effect is expected to be less severe for X-autosome comparisons because the X chromosome has three-fourths of the effective population size of autosomes, whereas the comparable value for the mitochondrial genome is one-fourth. Differences between the sexes in demographic factors, such as migration rate and age structure, may also leave different signatures on X-chromosomal and autosomal frequency spectra.
Alternatively, this discrepancy may indicate a relatively higher fixation rate or shorter transit time for beneficial mutations on the X chromosome (Avery 1984
; Charlesworth, Coyne, and Barton 1987
; Begun and Whitley 2000
). Weak support for the hypothesis that fixation rates are higher or transit times are lower for adaptive mutations on the X chromosome comes from the observation that the overall levels of variability are slightly lower on the X chromosome than on the autosomes. From the observed heterozygosity and the observed variance in allele size, we estimated the ratio
X/
A as 0.71 and 0.68, respectively, both slightly below the expected value of 0.75. Similarly, the density of single nucleotide polymorphisms is lower on the X chromosome than on the autosomes (International SNP Map Working Group 2001
). Reduced levels of variation on the X chromosome relative to the autosomes have also been noted in several other species, including mice (Hedrick and Parker 1997
) and Drosophila simulans (Begun and Whitley 2000)
.
At present, we cannot clearly distinguish between the competing demographic and selective explanations for the discrepancy in frequency spectra between the X chromosome and the autosomes. Additional theoretical work could help resolve which model(s) best explain(s) this pattern.
Predictions
The availability of the complete genome sequence of humans may make it possible to test some predictions from this study. For example, we might expect to find some genes in the windows in table 2
that show high ratios of nonsynonymous to synonymous substitutions in interspecific comparisons. There is an inherent difficulty, common to mapping studies, in identifying the underlying genes: it is easiest to detect a signal in regions of reduced recombination specifically because the number of genes contained in these regions is expected to be large, but the large number of genes makes it more difficult to pinpoint individual genes of interest. For this reason, it may be most promising to look first at the few windows in table 2
that show high rates of recombination. The markers in these windows may lie close to the genes under selection.
![]() |
Acknowledgements |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Footnotes |
---|
Keywords: positive selection
directional selection
balancing selection
genetic hitchhiking
microsatellites
human genome
Address for correspondence and reprints: Bret A. Payseur, Department of Ecology and Evolutionary Biology, Biosciences West Building, University of Arizona, Tucson, Arizona 85721. payseur{at}email.arizona.edu
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Avery P. J., 1984 The population-genetics of haplo-diploids and X-linked genes Genet. Res 44:321-341[ISI]
Begun D. J., P. Whitley, 2000 Reduced X-linked nucleotide polymorphism in Drosophila simulans Proc. Natl. Acad. Sci 97:5960-5965
Braverman J. M., R. R. Hudson, N. L. Kaplan, C. H. Langley, W. Stephan, 1995 The hitchhiking effect on the site frequency spectrum of DNA polymorphims Genetics 140:783-796
Charlesworth B., J. A. Coyne, N. H. Barton, 1987 The relative rates of evolution of sex chromosomes and autosomes Am. Nat 130:113-146[ISI]
Churchill G. A., R. W. Doerge, 1994 Empirical threshold values for quantitative trait mapping Genetics 138:963-971
Cornuet J.-M., G. Luikart, 1996 Description and power analysis of two tests for detecting recent population bottlenecks from allele frequency data Genetics 144:2001-2014
Dib C., S. Faure, C. Fizames, et al. (14 co-authors) 1996 A comprehensive genetic map of the human genome based on 5,264 microsatellites Nature 380:152-154[ISI][Medline]
Ewens W. J., 1972 The sampling theory of selectively neutral alleles Theor. Popul. Biol 3:87-112[ISI][Medline]
Fay J. C., C.-I. Wu, 1999 A human population bottleneck can account for the discordance between patterns of mitochondrial versus nuclear DNA variation Mol. Biol. Evol 16:1003-1005
Fu Y.-X., 1997 Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection Genetics 147:915-925
Fu Y.-X., W.-H. Li, 1993 Statistical tests of neutrality of mutations Genetics 133:693-709
Gillespie J. H., 1991 The causes of molecular evolution Oxford University Press, Oxford
. 1994 Alternatives to the neutral theory Pp. 117 in B. Golding, ed. Non-neutral evolution: theories and molecular data. Chapman and Hall, New York
Hedrick P. W., J. D. Parker, 1997 Evolutionary genetics and genetic variation of haplodiploids and X-linked genes Ann. Rev. Ecol. Syst 28:55-83[ISI]
Hudson R. R., M. Kreitman, M. Aguade, 1987 A test of neutral molecular evolution based on nucleotide data Genetics 116:153-159
Huttley G. A., M. W. Smith, M. Carrington, S. J. O'Brien, 1999 A scan for linkage disequilibrium across the human genome Genetics 152:1711-1722
International Human Genome Sequencing Consortium. 2001 Initial sequencing and analysis of the human genome Nature 409:860-921[ISI][Medline]
International SNP Map Working Group. 2001 A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms Nature 409:928-933[ISI][Medline]
Kaplan N. L., R. R. Hudson, C. H. Langley, 1989 The "hitchhiking effect" revisited Genetics 123:887-899
Kimmel M., R. Chakraborty, J. P. King, M. Bamshad, W. S. Watkins, L. B. Jorde, 1998 Signatures of population expansion in microsatellite repeat data Genetics 148:1921-1930
Kreitman M., H. Akashi, 1995 Molecular evidence for natural selection Ann. Rev. Ecol. Syst 26:403-422[ISI]
Maynard Smith J., J. Haigh, 1974 The hitch-hiking effect of a favorable gene Genet. Res 23:23-35[ISI][Medline]
McDonald J. H., M. Kreitman, 1991 Adaptive protein evolution at the Adh locus in Drosophila Nature 351:652-654[ISI][Medline]
Moran P. A. P., 1975 Wandering distributions and the electrophoretic profile Theor. Popul. Biol 8:318-330[ISI][Medline]
Nachman M. W., 2001 Single nucleotide polymorphisms and recombination rate in humans Trends Genet 17:481-485[ISI][Medline]
Ohta T., M. Kimura, 1973 The model of mutation appropriate to estimate the number of electrophoretically detectable alleles in a genetic population Genet. Res 22:201-204[ISI][Medline]
Orr H. A., A. Betancourt, 2001 Haldane's sieve and adaptation from the standing genetic variation Genetics 157:875-884
Payseur B. A., M. W. Nachman, 2000 Microsatellite variation and recombination rate in the human genome Genetics 156:1285-1298
Pritchard J. K., M. Przeworski, 2001 Linkage disequilibrium in humans: models and data Am. J. Hum. Genet 69:1-14[ISI][Medline]
Przeworski M., R. R. Hudson, A. di Rienzo, 2000 Adjusting the focus on human variation Trends Genet 16:296-302[ISI][Medline]
Rogers A. R., H. Harpending, 1992 Population-growth makes waves in the distribution of pairwise genetic-differences Mol. Biol. Evol 9:552-569[Abstract]
Schlötterer C., T. Wiehe, 1999 Microsatellites, a neutral marker to infer selective sweeps Pp. 238248 in D. B. Goldstein and C. Schlötterer, eds. Microsatellites: evolution and applications. Oxford University Press, Oxford
Simonsen K. L., G. A. Churchill, C. F. Aquadro, 1995 Properties of statistical tests of neutrality for DNA polymorphism data Genetics 141:413-439
Smith M. W., J. A. Lautenberger, H. D. Shin, J.-P. Chretien, S. Shrestha, D. A. Gilbert, S. J. O'Brien, 2001 Markers for mapping by admixture linkage disequilibrium in African American and Hispanic populations Am. J. Hum. Genet 69:1080-1094.[ISI][Medline]
Sokal R. R., F. J. Rohlf, 1995 Biometry W. H. Freeman, New York
Tajima F., 1989 Statistical method for testing the neutral mutation hypothesis by DNA polymorphism Genetics 123:585-595
Torroni A., H.-J. Bandelt, L. D'Urbano, et al. (11 co-authors) 1998 MtDNA analysis reveals a major late Paleolithic population expansion from southwestern to northeastern Europe Am. J. Hum. Genet 62:1137-1152.[ISI][Medline]
Valdes A. M., M. Slatkin, N. B. Friemer, 1993 Allele frequencies at microsatellite loci: the stepwise mutation model revisited Genetics 133:737-749
Watterson G. A., 1978 The homozygosity test of neutrality Genetics 88:405-417
Wayne M. L., K. L. Simonsen, 1998 Statistical tests of neutrality in the age of weak selection Trends Ecol. Evol 13:236-240[ISI]
Weber J. L., C. Wong, 1993 Mutation of short human tandem repeats Hum. Mol. Genet 2:1123-1128[Abstract]
Wiehe T., 1998 The effect of selective sweeps on the variance of the allele distribution of a linked multiallele locus: hitchhiking of microsatellites Theor. Popul. Biol 53:272-283[ISI][Medline]