* University of British Columbia, Botanical Garden and Centre for Plant Research, Vancouver, British Columbia, Canada
Royal Botanic Garden Edinburgh, Edinburgh, UK
Montana State University
Correspondence: E-mail: quentin.cronk{at}ubc.ca.
![]() |
Abstract |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Key Words: nonsynonymous/synonymous rate ratio flower morphology LEGCYC CYCLOIDEA Papilionoideae Lupinus
![]() |
Introduction |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Inferences of adaptive molecular evolution from nucleotide sequence data typically focus on the rate of nonsynonymous substitution (dN) relative to that of synonymous substitution (dS), with = dN/dS (Goldman and Yang 1994; Muse and Gaut 1994). The most common method of estimating
from interspecific data is to optimize stochastic models of codon substitution on phylogenetic trees representing the genealogical history of the sequences under scrutiny. Codon models of molecular evolution in a phylogenetic context are becoming increasingly fine-grained in identifying substitution patterns that represent potentially adaptive evolution, to a point where it is now possible to test statistically for individual sites with elevated dN/dS ratios along specific lineages of a phylogenetic tree (Yang and Nielsen 2002). Such analyses of genes that regulate morphological development are thus a potentially powerful means of connecting directional or diversifying selection at the molecular level to adaptive phenotypic change at the morphological level.
The focus of this study is the angiosperm group Lupinus (Leguminosae: Papilionoideae: Genisteae). Papilionoid legumes have flowers that are characteristically zygomorphic (i.e., differentiated along but symmetric about a dorsoventral axis), in which the dorsal, lateral, and ventral petals make up the standard, wings, and keel, respectively. Typically the standard is reflexed (i.e., held upright) and is large in proportion to the rest of the flower, presumably functioning to attract pollinators and effect outcrossing.
The morphology and anatomy of flowers in Leguminosae have been well-studied (e.g. see Ferguson and Tucker 1994); in contrast, the genetic control of legume flower development is only beginning to be studied and understood. Molecular phylogenetic studies of Leguminosae (Citerne et al. 2003) discovered genes with homology to CYCLOIDEA (CYC) genes in Antirrhinum majus. CYC genes are transcription factors in the TCP gene family that are known to be involved in flower development, in particular conferring adaxial floral organ identity and hence zygomorphy (Luo et al. 1996). In legumes, two major groups (I and II) of LEGCYC genes have been identified, and within Genisteae two copies of group I genes, LEGCYC1B and LEGCYC1A, are known (Citerne et al. 2003). These latter genes are paralogs resulting from a gene duplication event that predates the origin of Genisteae (Citerne et al. 2003). Both are expressed in the dorsal region of the developing perianth (H. Citerne, unpublished data), which is consistent with expression patterns of CYC in Antirrhinum (Luo et al. 1999).
In this article we describe the molecular evolution of LEGCYC group I paralogs in Lupinus in relation to flower morphology, with emphasis on the western New World species. Despite considerable interspecific variation in foliage and seed coat characters (Aïnouche and Bayer 1999; Heyn and Herrnstadt 1977), all species of Lupinus have typical papilionoid flowers. Much of the floral variation in the group occurs in the size of the dorsal petal (standard) relative to the lateral petals (wings). In this respect most lupin species are fairly uniform, having large standards relative to the wings (i.e., the flowers are standard-dominated; fig. 1A). Flowers of L. densiflorus are an exception: they are wing-dominated and smaller than those of other lupin species, characteristics that are possibly associated with self-pollination. Lupinus densiflorus is thus a particularly interesting species because its wing-dominated floral morphology is unusual within lupins and may represent a morphological adaptation corresponding to a shift in breeding system. In this article we use likelihood models of codon evolution and phylogenetic inference to investigate the hypothesis that in the L. densiflorus lineage, the change to wing-dominated flowers is associated with higher rates of nonsynonymous evolution in genes known to regulate floral development. As a point of comparison, we also test the lineage leading to the North American lupin species flock, along which we have no reason to believe LEGCYC paralogs should have experienced directional selection, on account of the plesiomorphic morphological uniformity of flowers in that group. By focusing on closely related species, the time scale of this study provides a "snapshot" of gene evolution compared to previous broader surveys within Leguminosae (Citerne et al. 2003). Inferences of LEGCYC evolution at the scale of all legumes were limited because LEGCYC homologs were too divergent to reliably align outside of the conserved TCP and R domains. By focusing more narrowly, we hope to obtain a more fine-grained view of the molecular evolution of genes that are likely to be important in flower development.
|
![]() |
Methods |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
|
LEGCYC1B and LEGCYC1A were amplified by polymerase chain reaction (PCR) using 34 cycles with denaturation at 94°C for 60 s, annealing at 50°C to 55°C for 60 s, and extension at 72°C for 90 s. We amplified each locus separately in two to three fragments each using a combination of primers designed for LEGCYC group 1, and primers designed specifically for LEGCYC1A and LEGCYC1B. Primer sequences are available from the authors on request. Amplified fragments were sequenced directly and assembled into contigs using Sequencher (Gene Codes, Ann Arbor, Mich.). In some cases locus-specific primers failed to amplify their target; in others, apparent allelic length differences in two regions of trinucleotide repeats in LEGCYC1B resulted in unreadable chromatograms downstream of the point of variation. For these samples, PCR was performed using the general primers LEGCYC1_F1 and LEGCYC1_R1 (Citerne, unpublished) and the products were cloned. Clones containing LEGCYC1B and LEGCYC1A were selected for sequencing based on diagnostic digests using EcoR1. The ITS region, including the 5.8S subunit, was amplified with forward primer ITS5 and reverse primer ITS4 (White et al. 1990) using 34 cycles with denaturation at 94°C for 60 s, annealing at 55°C to 58°C for 60 s, and extension at 72°C for 90 s. In some cases multiple bands were consistently amplified, so the appropriate fragment (ca. 800 bp in length) was isolated either by agarose gel extraction or by cloning.
The PCR products were purified using QiaQuick kits (Qiagen) and sequenced directly or cloned into the pCR 2.1-TOPO vector using TOPO TA cloning kits (Invitrogen, Carlsbad, Calif.). One to four clones were sequenced from each ligation reaction using vector primers M13F and M13R. Automated sequencing was carried out on ABI PRISM 377 DNA sequencers.
Phylogenetic Analyses
One representative sequence of each gene (LEGCYC1B, LEGCYC1A, and ITS) was selected from each sample. Sequences were aligned "by eye." Phylogenetic analyses were conducted using PAUP* version 4b10 (Swofford 1998) and MrBayes (Huelsenbeck and Ronquist 2001). Gaps were treated as missing data. The most appropriate nucleotide substitution model for phylogenetic analysis of each gene was chosen using the Akaike Information Criterion (AIC) (Akaike 1974). Hierarchical model comparisons (Posada and Crandall 2001) were facilitated by the use of MrModeltest (Nylander 2002). Estimates of substitution parameters from the model selection process were used as starting values for subsequent phylogenetic analyses.
For individual genes, maximum-likelihood (ML) trees were found using PAUP* by heuristic searches employing ten random-addition replicates with tree bisection-reconnection (TBR) branch swapping. Bayesian posterior probabilities of clades were found by Metropolis-coupled Markov chain Monte Carlo (MCMC) using MrBayes. For individual genes, we ran MCMC analyses with four chains (three "heated") for 1,300,000 generations. We discarded the first 300,000 generations as burn-in after visually confirming probable chain stationarity from plots of likelihood against generation. We also combined the LEGCYC1B, LEGCYC1A, and ITS alignments into a gene-partitioned "total evidence" data set for a Bayesian analysis in which each partition was assigned its optimal substitution model with parameters that were estimated independently for each gene. This analysis ran for 3,000,000 generations (including 1,000,000 for burn-in) with five chains (four heated). For all MCMC runs, chains were sampled every 100 generations.
Phylogenetic Congruence Tests
We used the Shimodaira-Hasegawa (SH) test (Shimodaira and Hasegawa 1999) implementation in PAUP* to assess the degree to which the different data sets are in agreement with respect to tree topology. We tested each data set against a set of 1,413 trees comprising the ML topologies from each data set and all topologies visited by the Markov chain in the total evidence analysis. This reasonably fulfills the requirement of the SH test that the set of trees contains all topologies that can reasonably be considered to be the true tree (Goldman, Anderson, and Rodrigo 2000).
Analyses of LEGCYC Coding Region Evolution
Models of codon evolution and tests for selection on LEGCYC paralogs were evaluated on phylogenies obtained in the previous analyses using the program codeml from the PAML package, version 3.13 (Yang 1997). To represent our best possible estimate of Lupinus phylogeny, we selected the Bayesian total-evidence tree with maximum posterior probability that could not be rejected statistically by any individual data set. In addition, we evaluated all models on the optimal trees from single-gene analyses of each LEGCYC gene to assess the sensitivity of our results to tree topology.
We evaluated a range of codon models for measuring selection and compared nested models by the standard likelihood ratio test (LRT) statistic, 2, against the
2 distribution with degrees of freedom equal to the difference in number of parameters. In all cases, we ran the analyses multiple times with different starting values of
and
to ensure that parameter estimates were stable. The codon models are described in more detail in the original articles, but we provide a brief overview of them here, following the model names used in Yang and Nielsen (2002). M0 is the simplest codon model with a single
parameter for all sites and branches of the phylogeny. M1, M2, and M3 are the site models in which
varies among sites but is constant across the phylogeny. M1 is the "neutral" model with two site classes that restricts
0 = 0 (implying strict purifying selection) and
1 = 1 (neutral evolution). M2, the "selection" model, is an extension of M1 and has an additional site class for which
2 can take any value. M3, the "discrete" model, is an extension of M0 and has k site classes, each with a separate
parameter. In this study we used k = 2 because additional site classes did not improve the likelihood, and having two site classes facilitates comparison with branch-site model B (see below). M7 and M8 use a discrete beta distribution to approximate among-site
variation, differing only in the latter, allowing an additional site class with
> 1 (Yang et al. 2000).
The branch models allow to vary among branches on the phylogeny but not among sites. We used two variants: the "free-ratio" model, in which
is estimated separately for each branch on the phylogeny, and a "two-ratio" model, in which the phylogeny is partitioned into "background" and "foreground" branches and
is estimated independently for each branch partition (Yang 1998). We targeted the Lupinus densiflorus branch and its sister lineage, the branch subtending the North American species, as foreground branches.
The branch-site models of Yang and Nielsen (2002) allow sites to have different nonsynonymous rates along foreground branches specified a priori and as such are similar to the two-ratio model described above. These models incorporate three parameters: two "background" ratios,
0 and
1, that are constrained to be less than or equal to one, and one ratio specific to the the foreground lineage of interest,
2, that may exceed one. Each codon site has either
0 or
1 across the entire tree (i.e., is not under selection), or just along background lineages, with
2 along foreground lineages (i.e., positively selected). Thus there are four site classes in these models. Model A of Yang and Nielsen (2002) restricts
0 = 0 and
1 = 1, and is therefore a branch-specific extension of model M1. Model B places no restrictions on
parameters and is thus a branch-specific extension of model M3.
To assess variation along the length of the LEGCYC paralogs, estimates of substitution rate and were obtained along a 60-bp sliding window of the alignments. HYPHY (Muse and Pond 2002) was used to estimate substitution rate and codeml was used to estimate
.
![]() |
Results |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Phylogenetic Analysis
The alignment of LEGCYC1B is 1,450 nucleotides in length, with coding sequence from sites 11320, followed by a short (approx. 95-bp) intron, and a short region of further coding sequence that was not used in tests of selection (see below). The alignment of LEGCYC1A is 1,292 nucleotides in length, with coding sequence from sites 11133. The aligned length of ITS, including the 5.8S region, was 681 nucleotides. For phylogenetic inference, AIC model comparisons indicated the use of the GTR + I + model for LEGCYC1B and LEGCYC1A, and the GTR +
model for ITS.
Phylogenetic analyses of each gene (LEGCYC1B, LEGCYC1A, and ITS) each yielded single ML trees that are shown in figure 2. The relationships inferred by each gene are generally consistent with one another and with previous analyses of ITS by Aïnouche and Bayer (1999). In that study, five major groups within Lupinus were identified: two comprising eastern and western New World species, respectively (clades A and E) and three comprising Old World species (clades B, C, and D), but ITS alone did not provide sufficient signal to resolve relationships among these groups. In this study the eastern New World lupins (clade A) are represented by L. texensis, but representatives of clade B were not sampled.
|
In only a single case, the placement of L. texensis, do different genes (LEGCYC1B and LEGCYC1A) each support conflicting relationships with posterior probability 0.95. If we consider the root of the phylogeny to be along the branch subtending the outgroup, Genista tenera, LEGCYC1B places L. texensis as the sister group to the rest of Lupinus, while LEGCYC1A places the clade of Old World species (L. cosentinii + L. digitatus + L. albus) as the sister group to the New World clades. It is possible that more comprehensive taxonomic sampling would resolve this conflict, which may be due to long-branch attraction of L. texensis to Genista tenera.
Combining the three genes for total-evidence phylogenetic analysis provides a more resolved and robust estimate of Lupinus phylogeny than analyses of single genes. The Markov chain sampled 1,410 distinct tree topologies from the posterior distribution. The tree with highest posterior probability (0.062) is shown in figure 3. All strongly supported clades inferred from single genes are also strongly supported by the total-evidence Bayesian analysis, with the exception of the position of L. texensis, yielding a total of 11 strongly supported nodes. This tree also cannot be rejected statistically (SH test) by any single-gene data set, and we therefore interpret it to be the best overall estimate of relationships among the species sampled, and use it in subsequent analyses of codon evolution in LEGCYC1B and LEGCYC1A. Perhaps the most notable new relationship revealed by the LEGCYC and three-gene analyses is within the western New World lineage, the North American species form a clade distinct from L. densiflorus, whose North American distribution may represent a disjunction from its origins in a primarily South American lineage.
|
Patterns of Molecular Evolution
Mean treelengths (substitutions/site ± SD) from the posterior distributions of trees visited by MCMC for each gene are as follows: LEGCYC1B, 0.415 ± 0.026; LEGCYC1A, 0.508 ± 0.030; ITS, 0.269 ± 0.029. Given that identical sets of taxa were sampled for each gene, and assuming independent gene trees, treelengths from the single-gene analyses reflect overall evolutionary rates for each gene. By this measure both LEGCYC genes have higher rates than ITS, which is consistent with their apparently greater phylogenetic signal. LEGCYC1A also has a higher substitution rate overall than LEGCYC1B, despite the fact that, being more ultrametric, its gene tree appears shorter than that of LEGCYC1B in terms of root-to-tip distance (fig. 2). The LEGCYC1A tree has longer branches, particularly toward the tips, and it has a greater length overall than LEGCYC1B.
For both LEGCYC1B and LEGCYC1A, substitution rates and values vary substantially among codon sites and across the phylogeny. Likelihood scores and parameter estimates for the various codon models for each gene are given in table 2. We estimated ancestral sequences with model M0 and mapped the synonymous and nonsynonymous changes on the phylogeny (fig. 4). On many branches, LEGCYC1B and LEGCYC1A have undergone different amounts of synonymous and nonsynonymous evolution.
|
|
In general, analyses of LEGCYC alignments using the site models with selection (M0, M3, and M8) provide no evidence for positive selection having acted on sites across the whole phylogeny. For LEGCYC1A with M8, = 1.510, but the proportion of sites in this class is exceedingly small (< 0.00001) and the support for this model over M7 is nonsignificant. In contrast, the sliding-window analysis reveals two regions of each LEGCYC paralog for which
> 1 (fig. 5). Likelihood ratio tests show that for LEGCYC1B, these peaks are not significantly greater than one, but for LEGCYC1A, the windows at positions 702 (
= 6.33), 960 (
= 4.25), and 966 (
= 7.29) do support the hypothesis of positive selection (P = 0.049, 0.038, and 0.005, respectively). Thus positive selection may have acted at some sites in these windows, but model M8 is unable to detect them. The difference in results may be due to the effect of conditioning on codon frequencies estimated from the whole alignment versus frequencies local to the sliding window.
|
Tests for Selection Along the L. densiflorus and North American Branches
Two-Ratio Model
For both LEGCYC paralogs the two-ratio model does not suggest positive selection along either the L. densiflorus branch or the North American branch. Likelihood ratio tests of M0 versus the two-ratio model are only significant for LEGCYC1B and the North American branch (table 3), and in this case the foreground 1 is close to zero and is less than the background
0, suggesting that, if anything, purifying selection was greater along the North American branch than elsewhere in the tree.
|
The positions of the positively selected sites in LEGCYC1B are scattered across the coding region, as are the seven other sites inferred to have undergone replacement substitution along the L. densiflorus branch (fig. 6). None fall within the TCP or R domains. Only one putative positively selected site is situated 5-prime of the TCP domain. In terms of secondary structure, most of the sites that change fall outside of predicted alpha helix regions (fig. 6).
|
Sensitivity of Results to Tree Topology
The results from the branch-site tests for selection appear to be somewhat sensitive to tree topology. Using the ML tree topology from the LEGCYC1A single-gene analysis, 2 < 1 and models M1 and M3 cannot be rejected in favor of models A and B, for both the L. densiflorus and North American branches (P >> 0.05 in all cases). If the LEGCYC1A tree is a more accurate estimate of that gene's history than the total-evidence tree, our inferences of positive selection along the L. densiflorus branch may be an artifact of apparent homoplasy in nonsynonymous substitutions caused by enforcing an incorrect topology on the data. However, with the SH test, the LEGCYC1A topology is rejected by both the LEGCYC1B data set and the total-evidence data set. Thus the preponderance of evidence would recommend the total-evidence tree as the preferred topology.
![]() |
Discussion |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Inferring Selection on LEGCYC from Codon Models
Codon-based evolutionary models suggest that selection pressure on LEGCYC genes in Lupinus has been varied among sites and among lineages. Estimates of raw nucleotide and synonymous and nonsynonymous rates of evolution describe a heterogeneous pattern of episodic strong purifying selection, relaxed purifying selection, and positive selection, with little congruence in the molecular signatures of these processes across genes. We term this pattern a "selectional mosaic" and suggest that it may be typical of teosinte branched 1-cycloidea-PCF (TCP) genes, most of which show high rates of evolution.
Our results indicate that sites in a coding region may experience either greater positive selection or greater purifying selection along a particular branch of a phylogeny. The branch-site models of Yang and Nielsen (2002) appear to be more sensitive in detecting variation in than models that accommodate among-site or among-branch variation in
, but not both. Positive selection is not supported at either LEGCYC locus using any site or branch model, but this may be due to a preponderance of neutral or purifying selection over most of the phylogeny, a pattern that is consistent with the sliding window analysis. This accords with what realistically might occur when natural selection acts on a protein, i.e., only a small proportion of amino acids (e.g. at active binding sites) change as its function evolves. In this light the branch-site models thus represent a substantial advance in inferential power for studies of molecular evolution, but a better understanding of the accuracy and power of LRTs (e.g., see Anisimova Bielawski, and Yang 2000) based on the branch-site models are needed. It is possible that previous studies that failed to detect positive selection with the site or branch models (e.g., Remington and Purugganan 2002; Hileman and Baum 2003) might upon reanalysis find evidence for positive selection with the branch-site models.
The idea of a selectional mosaic suggests one aspect of the current branch-site models that might be refined, namely the single foreground parameter 2 and its use as an indicator of selection pressure along the foreground lineage. Consider model B and LEGCYC1B: the large value of
2 along the L. densiflorus branch suggests positive selection, but statistical support for the model falls just on the side of nonsignificance, whereas for the North American branch, support for the model is highly significant but
2 is close to zero. The difference in support between branches is likely related to the proportion of sites in each case that have
2 along the foreground branch (
2 +
3): for the L. densiflorus branch
2 +
3 is small (0.026), while for the North American branch
2 +
3 is large (0.745). In other words, only 2.6% of sites are estimated to have higher nonsynonymous rates along the L. densiflorus branch, while 74.5% of sites are estimated to have lower nonsynonymous rates along the North American branch. These results do not preclude the possibility that positive selection did occur at some sites along the North American branch; after all, six synapomorphic amino acid substitutions are inferred along it without homoplasy. However, any signal of positive selection would be swamped out by the large proportion of sites that have lower nonsynonymous rates along the North American branch (or alternatively have higher nonsynonymous rates elsewhere in the tree). Likewise, detecting the small number of sites under positive selection along the L. densiflorus branch is contingent on the apparent absence of sites that have higher nonsynonymous rates elsewhere in the tree.
Given the objective of identifying adaptive coding region evolution along a particular branch of a gene phylogeny, one might consider alternative formulations of the branch-site model. For example, consider a model with two parameters associated with the foreground branch rather than just one. This would account for sites with lower nonsynonymous rates along the foreground branch in addition to sites with higher nonsynonymous rates, as might be expected if adaptation of the protein required amino acid fixation at some sites and replacement substitutions at others. One drawback of this proposal would be an increase in model complexity, with six site classes required instead of four, and the resulting loss of statistical power (flattening of the likelihood surface). To compensate for this, other alternatives could be explored, e.g., a model with two foreground
parameters and only a single background
parameter, which is the converse of how those parameters are distributed in model B; this would yield three site classes. The main liability of having two
ratios along the foreground branch, however, would be the interpretation of results: it might be difficult to tell whether support for the model comes from those sites under stronger positive selection along the foreground branch, or those sites under stronger purifying selection, or both. To discern this, it would be necessary to compare likelihood scores and
parameter estimates with those of the branch-site models of Yang and Nielsen (2002).
Rates of LEGCYC Evolution
What are the main factors responsible for the overall faster evolutionary rates of LEGCYC genes compared to ITS? One hypothesis is that rapid, divergent evolution of LEGCYC somehow promotes speciation; this has been shown in gamete recognition proteins of animals (e.g., Hellberg et al. 2000; Metz and Palumbi 1996). Evolution of transcriptional regulators is commonly believed to be a chief agent in the generation of novel morphological phenotypes that could drive lineage divergence (Doebley and Lukens 1998). However, it is unlikely that LEGCYC evolution has consistently driven speciation in Lupinus, which apart from L. densiflorus shows relatively little variation in floral morphology; if it had, we would expect to see a stronger signal of positive (diversifying) selection on sites over the whole phylogeny.
An alternative explanation is that the fast rate of evolution is a consequence of the selectional mosaic. The idea is that most of the time, for most sites in the gene, purifying selection is relaxed, allowing neutral substitutions to accumulate rapidly. Strong purifyng selection acts perodically and only on a small percentage of sites, and therefore has little effect on constraining gene evolution. This pattern of evolution comes from the fact that only certain sites (the DNA binding sites) are important, while the rest are relatively less constrained.
The faster substitution rate of LEGCYC1A over LEGCYC1B could be the result of functional redundancy following gene duplication (Ohno 1970), releasing LEGCYC1A from functional constraints and leading to relaxed purifying selection. Preliminary evidence suggests that like CYC genes in Antirrhinum, the role of LEGCYC lies in establishing floral organ identity: both LEGCYC genes are expressed adaxially in the developing flower, but LEGCYC1A apparently has a smaller area of expression than LEGCYC1B (H. Citerne, unpublished data).
Role of LEGCYC in Papilionoid Flower Evolution
In this study we find a high rate of nonsynonymous evolution at a handful of sites in a floral regulatory gene, LEGCYC1B, associated with a phylogenetic branch along which a morphological shift occurred from standard-dominated flowers to the wing-dominated flowers of L. densiflorus. Support for the branch-site models falls just short of statistical significance, but given the function of LEGCYC as a transcriptional regulator, we expect the number of sites under selection to be small and therefore less likely to be detected. Furthermore, there is a striking contrast in pattern with LEGCYC1A, which appears to have undergone greater purifying selection along the L. densiflorus branch. For these reasons we are inclined to favor the hypothesis that positive selection acted on LEGCYC1B in conjunction with this morphological transition. Of the two LEGCYC paralogs, LEGCYC1B appears to be the more promising candidate for further studies of the genetics of legume flower development. Determining quantitative expression profiles of each gene in L. densiflorus floral organs, and gene silencing/knockout mutant studies might provide additional evidence of functional differentiation of these loci and their role in developmental regulation.
Studies of LEGCYC in other legume taxa representing more dramatic morphological shifts away from the "typical" papilionoid flower are also needed, particularly those in which selection on standard morphology is implicated. For example, shifts to bird pollination have occurred in several legume groups such as Erythrina (Bruneau 1997), presumably involving intense selection for floral traits characteristic of that syndrome, e.g., an elongated, tubelike perianth and reduced standard size. Cadia, an enigmatic legume with unusual radially symmetrical flowers likely derived from a papilionoid ancestor, is also a prime candidate for LEGCYC analysis as this may help determine whether the flower has evolved as a result of homeosis or heterochrony (Cronk 2002).
![]() |
Acknowledgements |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
![]() |
Footnotes |
---|
Brandon Gaut, Associate Editor
![]() |
Literature Cited |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Akaike, H. 1974. A new look at the statistical model identification. IEEE Trans. Automat. Contr. 19:716-723.
Anisimova, M., J. P. Bielawski, and Z. Yang. 2000. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol. Biol. Evol. 18:1585-1592.[ISI]
Aïnouche, A.-K., and R. J. Bayer. 1999. Phylogenetic relationships in Lupinus (Fabaceae: Papilionoideae) based on internal transcribed spacer sequences (ITS) of nuclear ribosomal DNA. Am. J. Bot. 86:590-607.
Barrier, M., R. H. Robichaux, and M. D. Purugganan. 2001. Accelerated regulatory gene evolution in an adaptive radiation. Proc. Natl. Acad. Sci. USA 98:10208-10213.
Bruneau, A. 1997. Evolution and homology of bird pollination syndromes in Erythrina (Leguminosae). Am. J. Bot. 84:54-71.[Abstract]
Citerne, H. L., D. Luo, R. T. Pennington, E. Coen, and Q. C. B. Cronk. 2003. A phylogenomic investigation of CYCLOIDEA-like TCP genes in the Leguminosae. Plant Physiol. 131:1042-1053.
Cronk, Q. C. B. 2001. Plant evolution and development in a post-genomic context. Nat. Rev. Genet. 2:607-619.[CrossRef][ISI][Medline]
Cronk, Q. C. B. 2002. Perspectives and paradigms in plant evo-devo. Pp. 114 in Q. C. B. Cronk, R. M. Bateman, and J. A. Hawkins, eds. Developmental genetics and plant evolution. Taylor and Francis, London.
Doebley, J., and L. Lukens. 1998. Transcriptional regulators and the evolution of plant form. Plant Cell 10:1075-1082.
Ferguson, I. K., and S. C. Tucker, (eds.). 1994. Advances in Legume Systematics 6: Structural Botany. Royal Botanic Gardens Kew, Richmond, Surrey, UK.
Goldman, N., J. P. Anderson, and A. G. Rodrigo. 2000. Likelihood-based tests of topologies in phylogenetics. Syst. Biol. 49:652-670.[CrossRef][ISI][Medline]
Goldman, N., and Z. Yang. 1994. A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol. Biol. Evol. 11:725-736.
Hellberg, M. E., G. W. Moy, and V. D. Vacquier. 2000. Positive selection and propeptide repeats promote rapid interspecific divergence of a gastropod sperm protein. Mol. Biol. Evol. 17:458-466.
Heyn, C. C., and I. Herrnstadt. 1977. Seed coat of Old World Lupinus species. Bot. Not. 130:427-435.[ISI][Medline]
Hileman, L. C., and D. A. Baum. 2003. Why do paralogs persist? Molecular evolution of CYCLOIDEA and related floral symmetry genes in Antirrhineae (Veronicaceae). Mol. Biol. Evol. 20:591-600.
Huelsenbeck, J. P., and F. R. Ronquist. 2001. MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17:745-755.
Lukens, L., and J. Doebley. 2001. Molecular evolution of the teosinte branched gene among maize and related grasses. Mol. Biol. Evol. 18:627-638.
Luo, D., R. Carpenter, L. Copsey, C. Vincent, J. Clark, and E. Coen. 1999. Control of organ asymmetry in flowers of Antirrhinum. Cell 99:367-376.[ISI][Medline]
Luo, D., R. Carpenter, C. Vincent, L. Copsey, and E. S. Coen. 1996. Origin of floral asymmetry in Antirrhinum. Nature 383:794-799.[CrossRef][ISI][Medline]
Metz, E. C., and S. R. Palumbi. 1996. Positive selection and sequence rearrangements generate extensive polymorphism in the gamete recognition protein bindin. Mol. Biol. Evol. 13:397-406.[Abstract]
Muse, S. V., and B. S. Gaut. 1994. A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol. Biol. Evol. 11:715-724.
Muse, S. V., and S. L. K. Pond. 2002. HYPHY: hypothesis testing using phylogenies. Distributed by the authors from http://www.hyphy.org.
Nylander, J. A. A. 2002. MrModeltest, version 1.1b. Distributed by the author, Department of Systematic Zoology, EBC, Uppsala University, Sweden; johan.nylander@ebc.uu.se.
Ohno, S. 1970. Evolution by gene duplication. Springer-Verlag, New York.
Ouali, M., and R. D. King. 2000. Cascaded multiple classifiers for secondary structure prediction. Prot. Sci. 9:1162-1176.[Abstract]
Posada, D., and K. A. Crandall. 2001. Selecting the best-fit model of nucleotide substitution. Syst. Biol. 50:580-601.[CrossRef][ISI][Medline]
Remington, D. L., and M. D. Purugganan. 2002. GAI homologues in the Hawaiian silversword alliance (Asteraceae-Madiinae): molecular evolution of growth regulators in a rapidly diversifying plant lineage. Mol. Biol. Evol. 19:1563-1574.
Shimodaira, H., and M. Hasegawa. 1999. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16:1114-1116.
Swofford, D. L. 1998. PAUP* 4.0. Sinauer Associates, Sunderland, Mass.
White, T. J., T. Bruns, S. Lee, and J. Taylor. 1990. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. Pp. 315322 in M. Innis, D. Gelfand, J. Sninsky, and T. J. White, eds. PCR Protocols: A guide to methods and applications. Academic Press, San Diego, Calif.
Yang, Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13:555-556.[Medline]
Yang, Z. 1998. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol. Biol. Evol. 15:568-573.[Abstract]
Yang, Z., and R. Nielsen. 2002. Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol. Biol. Evol. 19:908-917.
Yang, Z., R. Nielsen, N. Goldman, and A.-M. K. Pederson. 2000. Codon-substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155:431-449.