Departamento de Ecología Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Key Words: Drosophila buzzatii inversion polymorphism Esterase-A gene flow natural selection nucleotide variation
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
The utilization of sequencing techniques in population surveys has caused renewed interest in the study of inversion polymorphisms (Andolfatto, DePaulis, and Navarro 2001). Several issues of their evolutionary history, such as mutational origin, extent to which different recombination environmentsnear the breakpoints or in the center more recombining parts of the rearranged segmentaffect nucleotide variation and the mechanisms involved in the maintenance of inversion polymorphisms, may be revisited within the framework of coalescent theory (Hudson 1990).
In recent years a great effort has been devoted to study nucleotide variation in coding and noncoding regions with different degrees of linkage with inversion complexes in well-known Drosophila model systems such as In(3L)Payne (Wesley and Eanes 1994; Hasson and Eanes 1996; Verrelli and Eanes 2000) and In(2L)t (Bénassi et al. 1993, 1999; Andolfatto, Wall, and Kreitman 1999; DePaulis, Brazier, and Veuille 1999; Andolfatto and Kreitman 2000; DePaulis et al. 2000) in D. melanogaster; the third chromosome inversion system (Popadic, Popadic, and Anderson 1995) and the sex ratio complex (Babcock and Anderson 1996) in D. pseudoobscura; and the O (Rozas et al. 1999) and A chromosome inversion complexes (Munté, Aguadé, and Segarra 2000) in D. subobscura. Three main conclusions can be drawn from these studies. First, patterns of distribution of nucleotide variation within and between arrangements are not compatible with the hypothesis that inversions are old balanced polymorphisms (Andolfatto, DePaulis, and Navarro 2001). Second, inversions have a monophyletic origin, even in those cases, such as D. melanogaster In(2L)t and D. buzzatii inversion 2J, in which the involvement of transposable elements in the origin of inversions has been demonstrated, (Andolfatto, Wall, and Kreitman 1999; Cáceres et al. 1999; Cáceres, Puig, and Ruiz 2001; and see also Lyttle and Haymer 1992; Ladeveze et al. 1998). Third, genetic exchange is greatly impeded in the vicinity of inversion breakpoints, while in the middle of inversions genetic exchange by means of double crossovers and/or gene conversion can lead to a complete homogenization of the genetic content between inverted and noninverted chromosomes (Navarro, Barbadilla, and Ruiz 2000; Andolfatto, DePaulis, and Navarro 2001).
In the cactophilic D. buzzatii, previous studies have shown that populations are strongly structured for the inversion polymorphism but less structured for variation at electrophoretic loci (Rodríguez et al. 2000). On one hand, the concordant latitudinal clines in inversion frequencies reported in original South American and recently colonized Australian populations are indicative of strong adaptive forces (Hasson et al. 1995; Rodríguez et al. 2000). On the other hand, clinal variation for allozymes linked to inversions can be mainly accounted for by hitchhiking with inversions, with the exception of the Esterase-2 locus (Rodríguez et al. 2000). Moreover, linkage disequilibrium between inversions and allozymes (Rodríguez et al. 2001) and patterns of population structure with different genetic markers (Rossi et al. 1996; Rodríguez et al. 2000) suggest that population expansions may have been important in the recent history of D. buzzatii.
In this article, we analyze intraspecific nucleotide variation at the Esterase-A (Est-A) locus in samples of three different second chromosome inversion arrangements segregating in natural populations of D. buzzatii and interspecific divergence with the closely related D. koepferae. This gene, which has been isolated, sequenced and described (East, Graham, and Whitington 1990) as a putative member of the ß-cluster of the carboxyl/cholinesterases family specific for carboxylester substrates (Oakeshott et al. 1999), is located in the center of the segments rearranged by two overlapping inversions, 2J and 2Z3. The former arose from a ST ancestral chromosome and the latter occurred on a 2J chromosome and defines the 2JZ3 arrangement that differs from 2J by a single overlapping inversion.
We address the following questions: (1) Are nucleotide sequence data consistent with the cytogenetic phylogeny accepted for the species? (2) Are closely linked regions like those inside the inversion J complex monophyletic? (3) Is nucleotide variation at Est-A compatible with neutral expectations? (4) Have derived arrangements reached mutation-drift-flux equilibrium? (5) Are the patterns of distribution of nucleotide variation within and between arrangements compatible with the hypothesis that the inversion polymorphism is an old balanced polymorphism? (6) Does the analysis of nucleotide variation allow us to infer the evolutionary forces that shaped the history of the inversion polymorphism?
![]() |
Materials and Methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
In Situ Hybridization
Salivary glands of third instar larvae grown in uncrowded cultures were dissected and processed according to the method of Montgomery, Charlesworth, and Langley (1987) to obtain polytene chromosome slides. A polymerase chain reaction (PCR) amplified fragment of 1.1 kb of Est-A was used as probe labeled with digoxigenin 11 dUTP (Boehringer Mannheim) using a random priming reaction. Prehybridization solutions and hybridization washes were performed according to the instructions of the manufacturer.
DNA Extraction, Amplification, and Sequencing
Genomic DNA was extracted from individual flies using the kit Puregene (Gentra systems). The published sequence of the Est-A gene (East, Graham, and Whitington 1990) was employed to design specific PCR and sequencing primers.
Initially a fragment of 1.1 kb uncovering most of the coding region of the gene was PCR amplified using primers (numbers refer to the nucleotide site in the published sequence) 913: 5'-CTACGTGGACGCGATAAAGG-3' (forward primer) and 2014: 5'-CGCTGAAGTTCCTATAGCCG-3' (reverse primer). All PCR reactions were carried out in final volumes of 50 µl containing 40 µM of each dNTP, 2 units of Taq DNA polymerase (Promega), and 90 ng of direct and reverse primers, 1.5 mM of MgCl2 and 1 µl of genomic DNA. Amplifications were performed according to the following PCR profile: 1 min at 94°C and 35 cycles of denaturation (1 min at 94°C), annealing (1 min at 60°C) and extension (2 min at 72°C), and a final extension step of 5 min at 72°C. All PCR products were gel purified from 1% low melting point agarose (GIBCO) gels and used as templates for the amplification of two shorter and overlapping fragments using two primer combinations 913-1615 (5'-CGATGCTTTGCGACTGTCAC-3') and 1391 (5'-TCGGTGGAGATCCAGAGAAC-3')-2014. PCR products were excised from low melting point agarose gels and purified using the QIAquick gel extraction kit (QIAGEN Inc.). Purified DNAs were cycle sequenced using fmol sequencing system (Promega). Both strands were sequenced using a set of primers spaced 300 nucleotides. Reactions were run on acrylamide gels and electrophoresed for 37 h depending on the length of the fragment.
DNA of D. koepferae was also extracted from individual flies and subjected to the procedures as described above, with minor modifications. Two additional primers were needed to complete the sequencing.
Sequences were aligned manually. All sequences reported in this study were deposited in GenBank under accession numbers AY113076AY113105.
Data Analysis
Nucleotide polymorphism at all sites, synonymous and replacement sites, were estimated as the number of segregating sites (S) and the average number of pairwise differences (k), the average number of pairwise differences per site () and the expected heterozygosity per site under the mutation drift-equilibrium given the observed S value (
S) (Watterson 1975).
Genetic differentiation between arrangements or between groups of populations was estimated as the average number of pairwise differences per site between arrangements/populations (dxy) and as the net number of nucleotide differences per site (da), according to equations 10.20 and 10.21 in Nei (1987), respectively. The extent of genetic differentiation was also analyzed by means of FST and KSTstatistics, two different measures of the proportion of nucleotide diversity attributable to variation between arrangements (or populations), (equations 3 and 9 in Hudson, Slatkin, and Maddison 1992; Hudson, Boos, and Kaplan 1992). Statistical significance of FST and KST was tested by means of permutations tests (Hudson, Boos, and Kaplan 1992) implemented in the program Proseq 2.9 (Filatov, D., http://www.biosciences.bahm.ac.uk/labs/filatov/proseq.html).
Linkage Disequilibrium and Recombination
The significance of pairwise associations between informative polymorphic sites was determined by means of Fisher Exact tests and adjusted for multiple comparisons by sequential Bonferroni tests (Rice 1989). In addition, the sign test on D (Lewontin 1995) was applied to search for overall evidence of linkage disequilibrium. This test allows analysis of linkage disequilibrium between independent pairs of polymorphic loci. For simplicity we considered adjacent pairs of sites as independent. The data set consisted of all polymorphic sites including singletons. The rationale of this test is to determine whether the number of positive and negative D values between pairs of sites differ from expectations under the null hypothesis of site independence.
The recombination parameter C (C = 4Nc, where N is the effective population size and c the recombination rate per generation between the most distant sites) was estimated using the Hudson and Kaplan (1985) method, based on the minimum number of recombination events determined using the four gamete rule. Sites with more than two segregating variants were excluded from the analysis. We also employed the method of Hudson (1987) to obtain estimates of the recombination parameter.
Tests of Neutrality
The tests of Tajima (1989), Fu and Li (1993), and Fu (1997) were applied to determine whether intraspecific patterns of variation detected in the Est-A region were in agreement with neutral predictions. Tajima's D test statistic is based on the standardized difference between two estimates of the neutral parameter ( = 4Nµ), one based on the observed average number of pairwise differences (k) and the other on the observed number of segregating sites in the sample (S). Fu and Li (1993) proposed tests that compare different estimators of the neutral parameter. Statistic D is based on the standardized difference between the total number of mutations (
) in all branches of the genealogy and the number of external mutations, and F is the standardized difference between k and the number of external mutations. External mutations are those occurring in external branches of the genealogy and can be defined as singletons absent in the sequence of an outgroup.
The test devised by Fu (1997), which examines the significance of the probability S' of having no fewer than k0 haplotypes in a random sample given that =
was also applied to the entire sample and to the subsamples of ST and J chromosomes.
DNAsp version 3.53 (Rozas and Rozas 1999) was employed to obtain estimates of nucleotide polymorphism, detection of conversion tracts, linkage disequilibrium and recombination, and to perform most neutrality tests. This program was also used to infer the statistical significance of Tajima, Fu, and Li's statistics by means of simulations of the coalescent process (10000 replications) assuming no recombination.
The direct relationship between polymorphism and divergence predicted by the neutral theory was tested by means of the average sliding G test (McDonald 1998). This test examines the distribution of polymorphic relative to diverged sites along a DNA sequence. The data set consists of an ordered series of variable sites classified as polymorphic or fixed differences. Values of the test statistics and their significance estimated by means of Monte Carlo simulations of a coalescent model incorporating both sampling and phylogenetic variation were estimated using the program DNASLIDER (McDonald 1998).
Neighbor-Joining trees (Saitou and Nei 1987) were obtained using MEGA version 2.1 (Kumar et al. 2001).
The distribution of observed and expected nucleotide polymorphism along the sequence was investigated using sliding window graphs obtained with DNAsp version 3.53.
![]() |
Results |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Nucleotide Variation
We sequenced a fragment of 1.1 kb in 30 D. buzzatii strains, 14 ST, 12 J, and 4 JZ3. These numbers are proportional to the average inversion frequencies observed in South American populations: ST = 0.45, J = 0.45 and JZ3 = 0.10, though the latter is slightly overrepresented in our sample. However, strains were randomly chosen within each chromosomal class with respect to the gene studied.
A total of 126 polymorphic sites were observed in the sample of 30 alleles (fig. 1); 8 sites were segregating for 3 nucleotide variants which implies that the total number of mutations () in our sample was 134. No indel mutations were detected. Nucleotide variation in the D. buzzatii Est-A gene region is summarized in table 1, in terms of the usual measures of nucleotide polymorphism
S and
. Sixty-four (69 mutations) synonymous sites and 62 (65 mutations) replacement sites were polymorphic, and 50% of those were unique mutations (fig. 1; table 1).
|
|
Nucleotide Variation Within and Between Chromosomal Arrangements
Levels of variation were estimated for the complete data set and for each chromosomal class (table 2). Comparisons among arrangements revealed that nucleotide variation at synonymous sites was 40% to 45% higher in ST chromosomes than in J and JZ3, and 20% to 34% higher at replacement sites.
|
Low levels of nucleotide differentiation among gene arrangements are consistent with the virtual absence of fixed differences (table 3). Moreover, 30% of the sites segregating in two arrangements were polymorphic for the same variants in both gene orders (shared polymorphisms). If we assume that inversions are unique events, i.e., are monophyletic, which in the case of inversion J seems to be valid (see Cáceres et al. 1999), there are only two possible explanations for the presence of shared polymorphism: (1) same mutations arising independently in both arrangements or (2) exchange of genetic information between gene arrangements by means of double crossovers and/or gene conversion. There were 24 (11 synonymous, 13 nonsynonymous) shared polymorphisms among the three chromosomal classes. We employed the hypergeometric distribution method to evaluate the probability of a certain number of shared polymorphic sites arising independently by parallel mutation in each chromosomal class. According to the number of synonymous polymorphisms detected in ST (48), J (33), and JZ3 (19), no more than two shared polymorphic sites (P = 0.068) are expected by chance. Similarly, given the observed number of nonsynonymous polymorphic sites in each chromosomal arrangement (42 in ST, 36 in 2J, and 18 in JZ3), no shared polymorphic sites are expected.
|
Analysis of linkage disequilibrium between pairs of informative sites using Fisher's Exact test for the full sample showed that 205 (14%) of 1,431 comparisons for 54 informative sites were significant. Similar analysis for each arrangement showed that for 37 informative sites in the sample of ST chromosomes, 63 of 666 comparisons (9%) were significant, whereas for J, 29 sites were informative and 80 of 406 comparisons (20%) yielded significant results. In both ST and J the proportion of significant comparisons was near the nominal rejection probability of 5%. However, in the case of arrangement J, the higher percentage of pairs of informative sites with significant linkage disequilibrium might be a result of the detected conversion tracts. Pairwise linkage disequilibrium between arrangements and each one of the informative sites in a subsample including only ST and J chromosomes yielded 14 of 55 significant comparisons, none according to Bonferroni criteria. However, as pointed out by Lewontin (1995), some tests of association cannot give significant results even with extreme values of disequilibrium, particularly when a very large fraction are represented in the data by singletons or only a very few copies of the rare variant. Therefore, we also analyzed linkage disequilibrium using the sign test on D (Lewontin 1995). In ST we performed 76 independent comparisons and, after pooling cells with low expected values in two classes, the goodness of fit test revealed a certain excess of negative D values (2 = 4.9, df = 1, P = 0.02), suggesting an excess of repulsion linkage in this arrangement. In contrast, in the 73 independent comparisons performed in J we detected an excess of positive D values (i.e., of coupling gametes) after pooling cells with low expected values in a single class (
2 = 5.8, df = 1, P = 0.01).
Est-A lies in a potentially high recombining region, and in the middle of the segments rearranged by inversions J and Z3. The minimum number of recombination events (MNRE) for the full sample was 25, and the estimate of the recombination parameter per site according to the method of Hudson (1997) was 0.072 (the method of Hey and Wakeley 1997 gave very similar results). However, estimates of the recombination parameter for each arrangement revealed important differences. On the one hand, C per site was high in ST (0.15) with a minimum number of 19 recombination events, whereas in J it was 78 times smaller (0.019), with a minimum number of recombination events of 11.
In figure 2 we present a Neighbor-Joining (NJ) tree based on synonymous variation including the 30 D. buzzatii alleles plus five D. koepferae Est-A sequences used as outgroups. Although recombination, which as shown above is high, may obscure the genealogical relationships among alleles, the aim of the inclusion of a NJ analysis is simply an exploratory method. There are two features in the tree that deserve attention. One is the absence of geographic structuring. Alleles of North Western (Chu, Til, Cat, Trh, and Qui), North Eastern (Ber and Tir), and Southern (Ota) populations of Argentina, which were shown to be highly differentiated for the inversion polymorphism and some electrophoretic loci (Rodríguez et al. 2000), are not grouped according to geographic origin. Indeed, genetic differentiation among populations was not significant either for synonymous (Fst = 0.0037, P = 0.61; Kst= 0.0059, P = 0.40) or nonsynonymous variation (Fst= 0.0132, P = 0.35; Kst = 0.0032, P = 0.36). In addition, differentiation among populations within arrangement classes was also not significant (not shown). The second aspect is that, although ST and derived arrangements do not form clearly defined clusters, nine J plus one JZ3 form a cluster (with low bootstrap support) that is also present in parsimony trees (not shown). The remaining J and JZ3 alleles, specifically those involved in events of genetic exchange, are associated with different ST alleles (fig. 2).
|
Fu's (1997) FS statistic was negative and significant for the complete dataset for both synonymous (FS = -18.48, P < 0.0001) and replacement substitutions (FS = -17.36, P < 0.0001) and for the subsamples of ST (FS = -4.38, P = 0.012 and FS = -4.92, P = 0.007 for silent and replacement substitutions, respectively) and J (FS = -4.39, P = 0.007 and FS = -4.68, P = 0.009 for silent and replacement substitutions, respectively). Significant negative values of FS indicate an excess of haplotypes relative to those expected under neutrality.
Neutrality Tests Based on the Correlation Between Polymorphism and Divergence
The traditional HKA test (Hudson, Kreitman, and Aguadé 1987) designed to test the fit of nucleotide polymorphism and interspecific divergence to the neutral model of evolution was not applied, because in D. buzzatii we do not have a locus that would serve as a neutral standard. Several tests have recently been reported aimed to analyze the distribution of polymorphic sites relative to diverged sites along a DNA sequence (McDonald 1998). One such test, the average sliding G test, which is among the most powerful of those analyzed in McDonald (1998), yielded significant results (G = 4.15) when applied to synonymous variation in the subsample of ST chromosomes, assuming no recombination (P = 0.02), high recombination (C = 100), which is close to the value estimated from the data (P = 0.03), and an intermediate level of recombination (C = 10), which is the value that maximized the probability (P = 0.05). Arrangement J was not considered, because nucleotide variation does not appear to be in mutation-drift-flux equilibrium (sensu Navarro, Barbadilla, and Ruiz 2000).
A complementary approach to test for heterogeneity in the distribution of silent polymorphism is to calculate a goodness-of-fit test (Kreitman and Hudson 1991), comparing levels of variation among a priori defined regions with those expected under a hypothesis of random distribution of segregating sites. The 1,061 nucleotides were then divided into four regions of equal size, and for each subregion the expected number of segregating sites was calculated on the basis of the number of sites segregating in the sample and the size of each region. Goodness-of-fit tests showed that the distributions of both synonymous (2 = 14.8 df = 3, P = 0.002) and replacement (
2 = 8.1 df = 3, P = 0.044) variation were significantly heterogeneous, mainly because of an excess of variation in the first half of the sequenced region. In contrast, similar tests for interspecific divergence showed that synonymous (
2 = 0.24, df = 3, P = 0.97) and replacement (
2 = 1.0, df = 3, P = 0.80) diverged sites are distributed homogeneously along the studied gene region.
Graphical Representation of the Patterns of Nucleotide Distribution
The significant results of the average sliding G test suggest the presence of regions of elevated polymorphism relative to neutral expectations. Thus, we explored the pattern of distribution of synonymous polymorphism along Est-A by means of a sliding window approach using a window of size of 25 base pairs and steps of 1 bp (essentially identical results were obtained varying window size).
The first graph shows the distribution of observed nucleotide heterozygosity at synonymous sites compared to neutral expectations (fig. 3a). Expected values were obtained by dividing the number of synonymous differences between D. buzzatii and D. koepferae by T + 1 (Hudson 1990; Kreitman and Hudson 1991). T + 1 ( 5) was calculated according to the formulae given in Hudson, Kreitman, and Aguadé (1987), after dividing Est-A sequence in two regions of equivalent numbers of silent sites and using one D. koepferae as the outgroup. T is the time since divergence of the two species measured in units of 2N generations (Hudson 1990). T + 1 can be thought of as a scaling factor between polymorphism and divergence (Kreitman and Hudson 1991).
|
These peaks of elevated polymorphism do not seem to be due to linkage of Est-A to the inversion polymorphism, because the mean number of synonymous differences between arrangements ST and J (12.70 ± 1.95, standard deviation computed by the bootstrap method) is not substantially larger than the mean number of differences between alleles within ST (12.58 ± 1.97) and greater than within J (8.46 ± 1.54). Moreover, the average number of differences between ST and J chromosomes at the sites where polymorphism appears to exceed neutral expectations is not higher than within arrangements (fig. 3b; JZ3 alleles were excluded from this analysis).
The comparison between D. buzzatii and D. koepferae sequences revealed the presence of 12 shared polymorphisms, 6 synonymous (at sites 9, 45, 81, 450, 456, and 459) and 6 replacement (at sites 14, 32, 77, 94, 421, and 452) (fig. 1), i.e., segregating for the same variants in both species (transpecific polymorphisms). Most of these shared replacement polymorphisms are located in regions of elevated synonymous polymorphism, and may be the putative targets of balancing selection. Particularly, amino acid polymorphisms Y/F at 26, H/N at 32, and A/D/N at 151 may be good candidates because variants are segregating at relatively high frequencies and consist of nonconservative substitutions for at least one of the amino acid physico-chemical properties usually employed to evaluate relatedness in protein sequences (Taylor 1986).
In the sliding window plot the average numbers of synonymous differences between allele families, at or near the putative selected sites, are substantially higher than between sequences within allele families at both amino acid sites 32 (average numbers of pairwise synonymous differences within the allelic classes bearing H and N are 10.40 ± 1.93 and 8.68 ± 1.82, respectively, and between classes is 12.05 ± 2.09) (fig. 3c) and 151 (average numbers of pairwise synonymous differences within the allelic classes bearing D and A are 9.55 ± 1.56 and 11.76 ± 2.10, respectively and between classes the average is 14.24 ± 2.42) (fig. 3d; sequences with N at site 151 were not included). Sliding window graphs in which Est-A alleles are classified according to amino acid sites 26 and 31 showed patterns similar to those in figure 3c. This is not unexpected given the association observed between variants at sites 5, 26, and 32 for which 6 of 8 possible haplotypes were present in our sample. In contrast, polymorphic amino acid sites, not shared between species and segregating for variants in intermediate frequencies, such as sites 28 and 77, did not show such peaks of elevated synonymous variation (data not shown). Likewise, we did not detect patterns of distribution of synonymous polymorphism similar to the one depicted in figure 3d when sequences were grouped according to amino acid variants segregating at sites close to residue 151 (results not shown).
Replacement Polymorphism and Amino Acid Variation
Ample replacement polymorphism was detected in our sample (fig. 4). At the level of the amino acid sequence some sites segregate for more than two amino acids, which in general were due to substitutions at different positions of the same codon, except codons 32, 73, and 215, in all of which the presence of three amino acid variants involved more than one substitution at the same position. Half of the amino acid substitutions were nonconservative changes for different properties such as charge, polarity, and volume. Replacement heterozygosity is extraordinarily high in the 5' half and in certain parts similar to nonsynonymous divergence between D. buzzatii and its sibling D. koepferae. In contrast, in the 3' half heterozygosity is markedly lower than interspecific divergence (see above). Most sites segregating for replacement variants are singletons or low frequency variants, suggesting that most amino acid variants may be slightly deleterious.
|
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Low functional constraint can only partially account for the high level of nucleotide diversity at Est-A, because divergence between D. buzzatii and D. koepferae at both synonymous and (mainly) replacement sites (KS = 0.153 and KR = 0.046) is also relatively high when compared to published data for Xdh (Rodríguez-Trelles, Alarcón, and Fontdevila 2000) (KS = 0.123 and KR = 0.022). Recent or ancient population subdivision may also be ruled out as an alternative explanation because geographic structure is not apparent either in the clustering of alleles in the NJ tree (fig. 2) or in the estimates of genetic differentiation among populations grouped according to geographic origin. Furthermore, South American D. buzzatii populations are only structured for the inversion polymorphism and certain adaptive allozyme polymorphisms (Rodríguez et al. 2000), whereas mt-DNA (Rossi et al. 1996) failed to reveal geographic subdivision.
The third possible explanation is balancing selection. Population genetic theory predicts a window of elevated silent polymorphism near the selected site, the width of the window depending on the local recombination rate (Hudson 1990), leading to a local departure from the predicted correlation between silent polymorphism and divergence.
The result of the average sliding G test seems to give support to this hypothesis. The presence of two apparent peaks of synonymous variation, which seems to be coincident with nonconservative amino acid substitutions in the sliding window plot (fig. 3a), is also consistent with this explanation. However, it is worth noting that the graphs should only be considered as an illustration of the McDonald (1998) test. The first peak, near the 5' end of the region sequenced, would be associated with amino acid polymorphisms at sites 26 and/or 32, whereas the second peak seems to be centered on amino acid site 151. Moreover, the six alleles carrying alanine (A) at codon 151 (Cat8, Tir20, Chu74, Chu45, Til3, and Ber36) appear ancestral in the NJ tree, whereas 10 of the 16 alleles that carry glutamic acid (E) at codon 31 and histidine (H) at codon 32 (Chu6, Chu7, Trh6, Chu2, Til1, Qui19, Chu118, Ota72, Ota73, and Qui7), which all happen to be on J or JZ3 chromosomes, form a cluster in NJ (fig. 2). Peaks of elevated polymorphism appear to be due to a greater average number of pairwise differences between alleles carrying different amino acids at sites 31 and/or 32 and 151 than between alleles within families (fig. 3c, 3d).
However, there are two issues that cast some doubts on our conclusion that the peaks of elevated polymorphism may be the footprints of balancing selection. The first is related to the problem of the use of several recombination rates, which raises the issue of multiple comparisons in the average sliding G test. If we correct for multiple comparisons using Bonferroni's correction, which in the present case is overconservative due to nonindependence, none of the tests would be significant. The second issue is that most of our analyses are applied on intra-allelic subsamples that are nonrandom subsamples of the total. In particular, under the hypothesis of balancing selection, we expect a lower polymorphism within allelic classes compared to the total sample and genetic differentiation between allelic classes (Innan and Tajima 1999). However, these effects are expected to vanish with the genetic distance from the selected site in a model with recombination, as can be observed in figure 3c and 3d. In fact, the values of the ratio of the sum of the average number of pairwise differences within allelic classes, as compared to the average pairwise number of differences between all sequences, were slightly lower than 2 (1.8 and 1.9 for sites 32 and 151, respectively), which is the expected value under neutrality and moderate rates of recombination (Innan and Tajima 1999). It should be noted, however, that under neutrality a similar intra-allelic effect to that under balancing selection is also expected (Innan and Tajima 1997, 1999).
The hypothesis that peaks of elevated silent polymorphism may be the footprints of balancing selection is also supported by the fact that polymorphic variants at the putative targets of selection, along with variants at neighboring sites, appear to be rather old because they are segregating for the same variants in D. koepferae, i.e., shared or transpecific polymorphisms. Well-known examples of polymorphisms shared between species are the Mhc in mammals (reviewed in Nei and Kumar 2000), self-incompatibility genes in plants (Charlesworth and Awadalla 1998), and Pgi in L. stylosa (Filatov and Charlesworth 1999). In Drosophila transpecific polymorphisms have been reported in the D. simulans species complex (Kliman et al. 2000), in D. pseudoobscura and close relatives (Wang et al. 1997), and in the virilis subgroup (Hilton and Hey 1997; Vieira 2002).
One possible explanation for shared polymorphisms in closely related species is recurrent mutation. However, the probability of 12 shared polymorphisms (6 synonymous and 6 nonsynonymous) conditional on the number of polymorphic sites in D. buzzatii (126, 64 synonymous and 62 replacement) and D. koepferae (30, 15 synonymous and nonsynonymous) is quite low (P < 107; P = 8 x 10-4 and P = 106, for total, synonymous, and replacement polymorphism, respectively). Yet, this method is based on the assumption of constant mutation rates, which in the case of Est-A may have weak support, given the presence of 8 sites segregating for 3 variants in D. buzzatii. However, the presence of 8 sites with two hits in a sequence of 1,061 nucleotides conditional on a total of 137 mutations is not an unexpected event for a Poisson process (goodness of fit 2 = 0.58, df = 2, P = 0.75). A similar conclusion can be reached for the 5 synonymous (
2 = 5.41, df = 2, P = 0.07) and 3 nonsynonymous (
2 = 2.74, df = 2, P = 0.25) sites with three variants.
Still, the number of sites with three variants may be an underestimate of sites with multiple hits. Under the Jukes and Cantor model, which assumes homogeneous rates among sites, one-third of double hits would be undetected. Hence, the number of sites with undetected double hits may be 4. If we add this quantity to the number of observed sites with three variants, the distribution of mutations along the sequence does not depart from the expectations of a Poisson process (2 = 2.83, df = 2, P = 0.24). However, it may be argued that this effect would be stronger if there is a transition/transversion bias. For instance, sites with two successive transitions would show only two variants and would thus not be detected. Assuming the Kimura 2 parameter model and using the transition/transversion ratio (
= ts/tv), we can calculate the relative probability of such undetected double hits. Given that there are two hits on the site the relative probability of undetected double transitions is ((
/ 2 +
))2, and (2/(2 +
)2) for double transversions. The remaining double hits that should lead to three detectable variants occur with a probability equal to (2 + 4
) / (2 +
)2. Therefore, the expected ratio of undetected/detected double hits is (2 +
2) / (2 + 4
). If we use in the latter formula our estimate of
= 1.2, calculated from our data, we obtain an estimate of the proportion of undetected double hits of 0.51. Again, as with Jukes and Cantor's model, the distribution of mutations does not depart from the expectations of a Poisson process (
2 = 2.94, df = 2, P = 0.23). Yet, the mutational hypothesis cannot be disregarded because we cannot rule out that under more complex models (412 parameters models) the number of observed double hits would depart from the expectations of a Poisson process.
We further investigated the issue of rate heterogeneity by means of the Akaike (1974) information criterion (AIC) using the program Modeltest (Posada and Crandall 1998). The AIC is a useful measure that allows comparisons among alternative models of sequence evolution by means of goodness-of-fit tests, in which AIC is the maximum value of the likelihood function for a specific model using a number of independently adjusted parameters (Posada and Crandall 2001). On the one hand, the best model fitting our data for fourfold degenerate sites did not include the gamma distribution (a measure of rate heterogeneity among sites) but included different rates of transitions and transversions. On the other hand, the alpha parameter of the gamma distribution was included among the parameters of the best models fitting the data for 0-fold ( = 0.43) and twofold (
= 0.71) degenerate sites.
An alternative explanation for transpecific polymorphisms is the persistence of neutral ancestral variants in populations of large size. However, population genetic theory predicts that the expected time for the loss of a shared ancestral neutral polymorphism is 1.7 N generations (Clark 1997). Available information suggests a relatively old divergence between D. buzzatii and D. koepferae. Cáceres, Puig, and Ruiz (2001) reported that D. buzzatii and D. martensis shared their last common ancestor around 5.8 Myr on the basis of divergence at the breakpoints of inversion J. In addition, in a recent phylogenetic study of the D. buzzatii complex using the Xdh locus (Rodríguez-Trelles, Alarcón, and Fontdevila 2000), the number of divergent synonymous sites between D. martensis and D. buzzatii was 86.5, while 62 separated D. buzzatii and D. koepferae. This gives an estimate of 4 Myr [(62 x 5.8) / 86.5] for the time of divergence between the latter two species and suggests that sufficient time has passed for most shared neutral ancestral polymorphisms to have become fixed.
Interspecific gene flow can also account for the presence of shared polymorphisms. D. buzzatii and D. koepferae are reproductively isolated by partial ecological isolation (Fanara, Fontdevila, and Hasson 1999) and postmating barriers due to sterility of male hybrid progeny (Naveira and Fontdevila 1986). Thus, hybrid females are a potential bridge for introgression. Recently, we have detected transpecific polymorphisms in the Xdh gene (R. Piccinali, M. Aguadé, and E. Hasson, unpublished results), which is also linked to the polymorphic second chromosome. This observation suggests that ancient or recent gene flow may be a plausible explanation for the extensive sharing of variants in the pair D. buzzatiiD. koepferae. Nevertheless, the distribution of silent polymorphism and divergence along the sequence of Est-A, compared to Xdh, suggests that the pattern observed in the present study is a particular feature of Est-A.
In order to disentangle the role of ancient or recent gene flow, balancing selection, recurrent mutation, persistence of ancestral polymorphisms, and the role played by adaptive inversion polymorphisms, it is necessary to compare nucleotide variation between gene regions linked to second chromosome inversions versus genes located in inversion free chromosomes.
Patterns of Nucleotide Variation Within and Between Arrangements
In general terms, the distribution of nucleotide variation among arrangements is in good agreement with expectations of population models of gene flux in inversion heterokaryotypes (Navarro et al. 1997). Genetic exchange between arrangements in the Est-A region has led to an extensive sharing of polymorphisms, as usually observed in genes located in central positions of inversions (Bénassi et al. 1993; Popadic, Popadic, and Anderson 1995; Hasson and Eanes 1996; Rozas et al. 1999). Conversely, in the neighborhood of the breakpoints of naturally occurring inversions, monophyly is unequivocal, because genetic exchange has not erased the genealogical relationships between alleles contained in different arrangements (Wesley and Eanes 1994; Popadic and Anderson 1995; Babcock and Anderson 1996; Hasson and Eanes 1996; Munté, Aguadé, and Segarra 2000; Navarro-Sabaté, Aguadé, and Segarra 1999; Rozas et al. 1999). Unique origin is also well supported, even in cases in which transposable elements appear to have been involved in the origin of inversions such as D. melanogaster In(2L)t (Andolfatto, Wall, and Kreitman 1999) and D. buzzatii inversion J (Cáceres et al. 1999).
The comparison of patterns of nucleotide variation at Est-A and the breakpoints of inversion J clearly shows that the presence of the inversion polymorphism differentially affects gene regions located in the center and close to the breakpoints. Genetic exchange due to gene conversion and/or double crossovers tends to homogenize genetic variation among arrangements in the middle of an inversion, whereas in the breakpoints neutral variation evolves largely independently in inverted and ancestral chromosomes. In D. buzzatii available evidence suggest that the distribution of nucleotide variation within and between gene arrangements at both the breakpoints and Est-A is not in agreement with those predicted by theoretical work modeling the population genetics of inversions (Navarro, Barbadilla, and Ruiz 2000), in particular 2J, a very common inversion that appears to be 1 Myr old (Cáceres, Puig, and Ruiz 2001). The lower nucleotide diversity, the greater linkage disequilibrium, the clustering of 10 derived alleles (9J and 1 JZ3) in the NJ tree, and, especially the extremely low sequence variation at the breakpoints reported in Cáceres, Puig, and Ruiz (2001) are not compatible with the idea of inversion J being an old lineage.
Possible explanations for the low level of variation in J chromosomes, such as recent origin and low effective population size, can be ruled out. A plausible alternative may be that the contemporary global frequency of arrangement J is not representative of its historical effective population size, but a reflection of a relatively recent expansion. The footprint of such an event may be an excess of rare variants, which is consistent with the results of Fu's test. But, the significant result obtained when this test was applied to the subsample of ST chromosomes, seems to indicate that the entire species had passed through a population expansion. However, the results of these tests should be interpreted with caution given that recombination tends to produce additional haplotypes. In this sense, the estimated recombination parameters suggest, assuming equal recombination rates in both arrangements, that the historical effective population size of ST has been at least 78 times larger than J, even though present average frequencies of both arrangements are almost equal. Thus, the most likely explanation seems to be that the frequency of arrangement J has risen in association with a recent expansion of the species.
Data on nucleotide sequence variation for actual neutral markers uncovering other regions along the rearranged segment are needed to understand the complex evolutionary history of the inversion polymorphism of D. buzzatii. The recently reported physical map based on sequenced RAPD markers (Laayouni, Santos, and Fontdevila 2000) may provide the tools to clarify the issues raised in this article.
![]() |
Acknowledgements |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Footnotes |
---|
![]() |
Literature Cited |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Akaike, H. 1974. A new look at the statistical model identification. IEEE Trans. Autom. Contr. 19:716-723.
Andolfatto, P., F. DePaulis, and A. Navarro. 2001. Inversion polymorphism and nucleotide variability in Drosophila. Genet. Res. 77:1-8.[CrossRef][ISI][Medline]
Andolfatto, P., and M. Kreitman. 2000. Molecular variation at the In(2L)t proximal break point site in natural populations of Drosophila melanogaster and D. simulans. Genetics 154:1681-1691.
Andolfatto, P., J. D. Wall, and M. Kreitman. 1999. Unusual haplotype structure at the proximal break point of In(2l)t in a natural population of Drosophila melanogaster. Genetics 153:1297-1311.
Aquadro, C. F., A. L. Weaver, S. W. Schaeffer, and W. W. Anderson. 1991. Molecular evolution of inversions in Drosophila pseudoobscura: the amylase gene region. Proc. Natl. Acad. Sci. USA 88:305-309.[Abstract]
Ayala, F. J., B. S. Chang, and D. L. Hartl. 1993. Molecular evolution of the Rh3 gene in Drosophila. Genetica 92:23-32.[CrossRef][ISI][Medline]
Ayala, F. J., and D. L. Hartl. 1993. Molecular drift of the bride-of-sevenless (boss) gene in Drosophila. Mol. Biol. Evol. 10:1030-1040.[Abstract]
Babcock, C. S., and W. W. Anderson. 1996. Molecular evolution of the sex-ratio inversion complex in Drosophila pseudoobscura: analysis of the Esterase-5 gene region. Mol. Biol. Evol. 13:297-308.[Abstract]
Bénassi, V., S. Aulard, S. Mazeau, and M. Veuille. 1993. Molecular variation of Adh and P6 genes in an African population of Drosophila melanogaster and its relation to chromosomal inversions. Genetics 134:789-799.
Bénassi, V., F. DePaulis, G. Meghlaoui, and M. Veuille. 1999. Partial sweeping at the Fbp2 locus in a West African population of Drosophila melanogaster. Mol. Biol. Evol. 16:347-353.[Abstract]
Betrán, E., J. Rozas, A. Navarro, and A. Barbadilla. 1997. The estimation of the number and the length distribution of gene conversion tracts from population DNA sequence data. Genetics 146:89-99.
Cáceres, M., M. Puig, and A. Ruiz. 2001. Molecular characterization of two natural hotspots in the Drosophila buzzatii genome induced by transposon insertions. Genome Res. 11:1353-1364.
Cáceres, M., J. M. Ranz, A. Barbadilla, M. Long, and A. Ruiz. 1999. Generation of a widespread Drosophila inversion by a transposable element. Science 285:415-418.
Charlesworth, D., and P. Awadalla. 1998. Flowering plant self-incompatibity: the molecular population genetics of Brassica S-loci. Heredity 81:1-9.[CrossRef][ISI][Medline]
Clark, A. G. 1997. Neutral behavior of shared polymorphism. Proc. Natl. Acad. Sci. USA 94:7730-7734.
DePaulis, F., L. Brazier, S. Mousset, A. Turbé, and M. Veuille. 2000. Selective sweep near in an African population of Drosophila melanogaster. Genet. Res. 76:149-158.[CrossRef][ISI][Medline]
DePaulis, F., L. Brazier, and M. Veuille. 1999. Selective sweep at the Drosophila melanogaster Supressor of Hairless locus, and its association with the In(2L)t inversion polymorphism. Genetics 152:1017-1024.
East, P., A. Graham, and G. Whitington. 1990. Molecular isolation and preliminary characterization of a duplicated esterase locus in Drosophila buzzatii. Pp. 389406. in J. S. F. Barker, W. T. Starmer and R. J. MacIntyre, eds. Ecological and Evolutionary Genetics of Drosophila. Plenum Press, New York.
Fanara, J. J., A. Fontdevila, and E. Hasson. 1999. Oviposition preference, viability, developmental time and body size in the cactophilic sibling species Drosophila buzzatii and Drosophila koepferae in association to their natural hosts. Evo. Ecol. 13:173-190.[CrossRef][ISI]
Filatov, D. A., and D. Charlesworth. 1999. DNA polymorphism, haplotype structure and balancing selection in the Leavenworthia Pgi locus. Genetics 153:1423-1434.
Fu, Y. X. 1997. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147:915-925.
Fu, Y. X., and W. H. Li. 1993. Statistical tests of neutrality of mutations. Genetics 133:693-709.
Hasson, E., and W. Eanes. 1996. Contrasting histories of three gene regions associated with In(3L)Payne of Drosophila melanogaster. Genetics 144:1565-1575.
Hasson, E., C. Rodríguez, J. J. Fanara, H. Naveira, O. A. Reig, and A. Fontdevila. 1995. The evolutionary history of Drosophila buzzatii. XXVI. Macrogeographic patterns in the inversion polymorphism in New World populations. J. Evol. Biol. 8:369-384.[ISI]
Hey, J., and J. Wakeley. 1997. A coalescent estimator of the population recombination parameter. Genetics 145:833-846.
Hilton, H., and J. Hey. 1997. A multilocus view of speciation in the Drosophila virilis species group reveals complex histories and taxonomic conflicts. Genet. Res. 70:185-194.[CrossRef][ISI]
Hudson, R. J. 1987. Estimating the genetic recombination parameter of a finite population model without selection. Genet. Res. 50:245-250.[ISI][Medline]
Hudson, R. J., and N. L. Kaplan. 1985. Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.
Hudson, R. R. 1990. Gene genealogies and the coalescent process. Oxf. Surv. Evol. Biol. 7:1-44.
Hudson, R. R., D. D. Boos, and N. L. Kaplan. 1992. A statistical test for detecting population subdivision. Mol. Biol. Evol. 9:138-151.[Abstract]
Hudson, R. R., M. Kreitman, and M. Aguadé. 1987. A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.
Hudson, R. R., M. Slatkin, and W. P. Maddison. 1992. Estimation of levels of gene flow from DNA sequence data. Genetics 132:583-589.
Innan, H., and F. Tajima. 1997. The amounts of nucleotide variation within and between allelic classes, and the reconstruction of common ancestral sequence. Genetics 147:1431-1444.
Innan, H., and F. Tajima. 1999. The effect of selection on the amounts of nucleotide variation within and between allelic classes. Genet. Res. 73:15-28.[CrossRef][ISI]
Inomata, N., H. Shibara, E. Okuyama, and T. Yamazaki. 1995. Evolutionary relationships and sequence variation of -amylase variants encoded by duplicated genes in the Amy locus of Drosophila melanogaster. Genetics 141:237-244.
Karotam, J., T. M. Boyce, and J. G. Oakeshott. 1995. Nucleotide variation at the hypervariable esterase 6 isozyme locus of Drosophila simulans. Mol. Biol. Evol. 12:113-122.[Abstract]
Kliman, R. M., P. Andolfatto, J. A. Coyne, F. DePaulis, M. Kreitman, A. J. Berry, J. McCarter, J. Wakeley, and J. Hey. 2000. The population genetics of the origin and divergence of the Drosophila simulans complex species. Genetics 156:1913-1931.
Kreitman, M., and R. R. Hudson. 1991. Inferring the evolutionary histories of the Adh and Adh-dup loci in Drosophila melanogaster from patterns of polymorphism and divergence. Genetics 127:565-582.
Krimbas, C. B., and J. R. Powell. 1992. Introduction. Pp. 152 in C. B. Krimbas and J. R. Powell, eds. Drosophila inversion polymorphism. CRC Press, Boca Raton, FL.
Kumar, S., K. Tamura, I. B. Jakobsen, and M. Nei. 2001. MEGA2: Molecular evolutionary genetics analysis software. Arizona State University, Tempe, Arizona.
Laayouni, H., M. Santos, and A. Fontdevila. 2000. Toward a physical map of Drosophila buzzatii. Use of randomly amplified polymorphic DNA polymorphisms and sequence tagged site landmarks. Genetics 156:1797-1816.
Ladeveze, V., S. Aulard, N. Chaminade, G. Periquet, and F. Lemeunier. 1998. Hobo transposons causing chromosomal break points. Proc. R. Soc. Lond. Ser. B. Biol. Sci. 265:1157-1159.[CrossRef][ISI][Medline]
Lewontin, R. C. 1995. The detection of linkage disequilibrium in molecular sequence data. Genetics 140:377-388.
Lyttle, T. W., and D. S. Haymer. 1992. The role of transposable element hobo in the origin of endemic inversions in wild populations of Drosophila melanogaster. Genetica 86:113-126.[ISI][Medline]
McDonald, J. H. 1998. Improved tests for heterogeneity across a region of DNA sequence in the ratio of polymorphism to divergence. Mol. Biol. Evol. 15:377-384.[Abstract]
Montgomery, E. A., B. Charlesworth, and C. H. Langley. 1987. A test of the role of natural selection in the stabilization of transposable element copy number in a population of Drosophila melanogaster. Genet. Res. 49:31-41.[ISI][Medline]
Munté, A., M. Aguadé, and C. Segarra. 2000. Nucleotide variation at the yellow gene region is not reduced in Drosophila subobscura: a study in relation to chromosomal polymorphism. Mol. Biol. Evol. 17:1942-1955.
Navarro, A., A. Barbadilla, and A. Ruiz. 2000. Effect of inversion polymorphism on the neutral nucleotide variability of linked chromosomal regions in Drosophila. Genetics 155:685-698.
Navarro, A., E. Betrán, A. Barbadilla, and A. Ruiz. 1997. Recombination and gene flux caused by gene conversion and crossing over in inversion heterokaryotypes. Genetics 146:695-709.
Navarro-Sabaté, A., M. Aguadé, and C. Segarra. 1999. The relationship between allozyme and chromosomal polymorphism inferred from nucleotide variation at the Acph-1 gene region of Drosophila subobscura. Genetics 153:871-889.
Naveira, H., and A. Fontdevila. 1986. The evolutionary history of Drosophila buzzatii. XII. The genetic basis of sterility in hybrids between D. buzzatii and its sibling D. koepferae from Argentina. Genetics 114:841-857.
Nei, M. 1987. Molecular evolutionary genetics. Columbia University Press, New York.
Nei, M., and S. Kumar. 2000. Molecular evolution and phylogenetics. Oxford University Press, Oxford.
Oakeshott, J. G., C. Claudianos, R. J. Russell, and G. C. Robin. 1999. Carboxyl/cholinesterases: a case study of the evolution of a successful multigene family. Bioessays 21:1031-1042.[CrossRef][ISI][Medline]
Popadic, A., and W. W. Anderson. 1995. Evidence for gene conversion in the amylase multigene family of Drosophila pseudoobscura. Mol. Biol. Evol. 12:564-572.[Abstract]
Popadic, A., D. Popadic, and W. W. Anderson. 1995. Interchromosomal exchange of genetic information between gene arrangements of the third chromosome of Drosophila pseudoobscura. Mol. Biol. Evol. 12:938-943.[Abstract]
Posada, D., and K. A. Crandall. 1998. MODELTEST: testing the model of DNA substitution. Bioinformatics 14:817-818.[Abstract]
Posada, D., and K. A. Crandall. 2001. Selecting models of nucleotide substitution: an application to human immunodeficiency virus 1 (HIV-1). Mol. Biol. Evol. 18:897-906.
Powell, J. R. 1997. Progress and prospects in evolutionary biology: the Drosophila model. Oxford University Press, New York.
Rice, W. R. 1989. Analyzing tables of statistical tests. Evolution 43:223-225.[ISI]
Rodríguez, C., R. Piccinali, E. Levy, and E. Hasson. 2000. Contrasting population structures using allozyme variation and the inversion polymorphism in Drosophila buzzatii. J. Evol. Biol. 13:976-984.[CrossRef][ISI]
Rodríguez, C., R. Piccinali, E. Levy, and E. Hasson. 2001. Gametic associations between inversion and allozyme polymorphisms in Drosophila buzzatii. J. Hered. 92:382-391.
Rodríguez-Trelles, F., L. Alarcón, and A. Fontdevila. 2000. Molecular evolution and phylogeny of the buzzatii complex (Drosophila repleta group): a maximum-likelihood approach. Mol. Biol. Evol. 17:1112-1122.
Rossi, M., Barrio, E., A. Latorre, J. E. Quezada-Diaz, E. Hasson, A. Moya, and A. Fontdevila. 1996. The evolutionary history of Drosophila buzzatii. XXX. Mitochondrial DNA polymorphism in original and colonizing populations. Mol. Biol. Evol. 13:314-323.[Abstract]
Rozas, J., and R. Rozas. 1999. DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics 15:174-175.
Rozas, J., C. Segarra, G. Ribó, and M. Aguadé. 1999. Molecular population genetics of the rp49 gene region in different chromosomal inversions of Drosophila subobscura. Genetics 151:189-202.
Saitou, N., and M. Nei. 1987. The Neighbor-Joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4:406-425.[Abstract]
Schaeffer, S. W., C. Scott Walthour, D. M. Toleno, A. T. Olek, and E. L. Miller. 2001. Protein variation in ADH and ADH-related in Drosophila pseudoobscura: linkage disequilibrium between single nucleotide polymorphisms and protein alleles. Genetics 159:673-687.
Schmid, K. J., L. Nigro, C. H. Aquadro, and D. Tautz. 1999. Large number of replacement polymorphisms in rapidly evolving genes of Drosophila: implications for genome-wide surveys DNA polymorphism. Genetics 153:1717-1729.
Tajima, F. 1989. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585-595.
Taylor, W. R. 1986. The classification of amino acid conservation. J. Theor. Biol. 119:205-218.[ISI][Medline]
Verrelli, B. C., and W. F. Eanes. 2000. Extensive amino acid polymorphism at the Pgm locus is consistent with adaptive protein evolution in Drosophila melanogaster. Genetics 156:1737-1752.
Vieira, J. 2002. Two divergent species of the virilis group, Drosophila littoralis and D. virilis, share a replacement polymorphism at the fused locus. Mol. Biol. Evol. 19:579-581.
Wang, R. L., J. Wakeley, and J. Hey. 1997. Gene flow and natural selection in the origin of Drosophila pseudoobscura and close relatives. Genetics 147:1091-1106.
Watterson, G. A. 1975. On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7:256-276.[ISI][Medline]
Wayne, M. L., D. Contamine, and M. Kreitman. 1996. Molecular population genetics of ref(2)P, a locus which confers viral resistance in Drosophila. Mol. Biol. Evol. 13:191-199.[Abstract]
Wesley, C., and W. F. Eanes. 1994. Isolation and analysis of the breakpoint sequences of chromosome inversion In(3L)Payne in Drosophila melanogaster. Proc. Natl. Acad. Sci. USA 91:3132-3136.[Abstract]