Latest issue

Systematic Biology - RSS feed of current issue

URL

XML feed
http://sysbio.oxfordjournals.org

Last update

1 hour 37 min ago

April 11, 2013

00:16
00:16

Despite African rivers containing high species diversity, continental-scale studies investigating the mechanisms generating biological diversity of African riverine faunas are limited compared with lacustrine systems. To investigate the build-up of diversity in a tropical aquatic continental radiation, we test different models of lineage diversification and reconstruct the biogeographic history in a species-rich siluriform genus, Synodontis (~130 species), with a broad distribution across all major tropical African drainage basins. The resulting robust species-level phylogeny (~60% complete, based on a multigene data set) exhibits a near constant rate of lineage accumulation throughout the mid-Cenozoic to recent, irrespective of missing species and despite the changing environmental conditions that were prevalent during this time period. This pattern contrasts with the findings for species-level diversification of large clades that commonly show an early burst of cladogenesis followed by declining rates through time. The identification of distinct biogeographic clades demonstrates a correlation between river hydrology and cladogenesis, although there is evidence of recent repeat dispersal into the southern range of the focal group. We conclude that diverse freshwater fish radiations with tropical continental distributions represent important organisms to test hypotheses of diversification and investigate the effects of palaeo-landscapes and climates on present day biodiversity. [Biogeography; Cenozoic; diversification rates; molecular dating; phylogeny; Siluriformes; tropical rivers.]

00:16

Despite being the objects of numerous macroevolutionary studies, many of the best represented constituents of the fossil record—including diverse examples such as foraminifera, brachiopods, and mollusks—have mineralized skeletons with limited discrete characteristics, making morphological phylogenies difficult to construct. In contrast to their paucity of phylogenetic characters, the mineralized structures (tests and shells) of these fossil groups frequently have distinctive shapes that have long proved useful for their classification. The recent introduction of methodologies for including continuous data directly in a phylogenetic analysis has increased the number of available characters, making it possible to produce phylogenies based, in whole or part, on continuous character data collected from such taxa. Geometric morphometric methods provide tools for accurately characterizing shape variation and can produce quantitative data that can therefore now be included in a phylogenetic matrix in a nonarbitrary manner. Here, the marine gastropod genus Conus is used to evaluate the ability of continuous characters—generated from a geometric morphometric analysis of shell shape—to contribute to a total evidence phylogenetic hypothesis constructed using molecular and morphological data. Furthermore, the ability of continuous characters derived from geometric morphometric analyses to place fossil taxa with limited discrete characters into a phylogeny with their extant relatives was tested by simulating the inclusion of fossil taxa. This was done by removing the molecular partition of individual extant species to produce a "cladistic pseudofossil" with only the geometric morphometric derived characters coded. The phylogenetic position of each cladistic pseudofossil taxon was then compared with its placement in the total evidence tree and a symmetric resampling tree to evaluate the degree to which morphometric characters alone can correctly place simulated fossil species. In 33–45% of the test cases (depending upon the approach used for measuring success), it was possible to place the pseudofossil taxon into the correct regions of the phylogeny using only the morphometric characters. This suggests that the incorporation of extinct Conus taxa into phylogenetic hypotheses will be possible, permitting a wide range of macroevolutionary questions to be addressed within this genus. This methodology also has potential to contribute to phylogenetic reconstructions for other major components of the fossil record that lack numerous discrete characters. [Continuous characters; Conus; eigenshape analysis; fossil taxa; gastropoda; geometric morphometrics; phylogenetic analysis.]

00:16

In phylogenetic studies, the evolution of molecular sequences is assumed to have taken place along the phylogeny traced by the ancestors of extant species. In the presence of lateral gene transfer, however, this may not be the case, because the species lineage from which a gene was transferred may have gone extinct or not have been sampled. Because it is not feasible to specify or reconstruct the complete phylogeny of all species, we must describe the evolution of genes outside the represented phylogeny by modeling the speciation dynamics that gave rise to the complete phylogeny. We demonstrate that if the number of sampled species is small compared with the total number of existing species, the overwhelming majority of gene transfers involve speciation to and evolution along extinct or unsampled lineages. We show that the evolution of genes along extinct or unsampled lineages can to good approximation be treated as those of independently evolving lineages described by a few global parameters. Using this result, we derive an algorithm to calculate the probability of a gene tree and recover the maximum-likelihood reconciliation given the phylogeny of the sampled species. Examining 473 near-universal gene families from 36 cyanobacteria, we find that nearly a third of transfer events (28%) appear to have topological signatures of evolution along extinct species, but only approximately 6% of transfers trace their ancestry to before the common ancestor of the sampled cyanobacteria. [Gene tree reconciliation; lateral gene transfer; macroevolution; phylogeny.]

00:16

Understanding factors driving diversity across biodiversity hotspots is critical for formulating conservation priorities in the face of ongoing and escalating environmental deterioration. While biodiversity hotspots encompass a small fraction of Earth's land surface, more than half the world's plants and two-thirds of terrestrial vertebrate species are endemic to these hotspots. Tropical Southeast (SE) Asia displays extraordinary species richness, encompassing four biodiversity hotspots, though disentangling multiple potential drivers of species richness is confounded by the region's dynamic geological and climatic history. Here, we use multilocus molecular genetic data from dense multispecies sampling of freshwater fishes across three biodiversity hotspots, to test the effect of Quaternary climate change and resulting drainage rearrangements on aquatic faunal diversification. While Cenozoic geological processes have clearly shaped evolutionary history in SE Asian halfbeak fishes, we show that paleo-drainage re-arrangements resulting from Quaternary climate change played a significant role in the spatiotemporal evolution of lowland aquatic taxa, and provide priorities for conservation efforts. [Freshwater; geology; halfbeak; island radiation; Miocene; Pleistocene; river; Southeast Asia.]

00:16

Long-term specialization may limit the ability of a species to respond to new environmental conditions and lead to a higher likelihood of extinction. For permanent parasites and other symbionts, the most intriguing question is whether these organisms can return to a free-living lifestyle and, thus, escape an evolutionary "dead end." This question is directly related to Dollo's law, which stipulates that a complex trait (such as being free living vs. parasitic) cannot re-evolve again in the same form. Here, we present conclusive evidence that house dust mites, a group of medically important free-living organisms, evolved from permanent parasites of warm-blooded vertebrates. A robust, multigene topology (315 taxa, 8942 nt), ancestral character state reconstruction, and a test for irreversible evolution (Dollo's law) demonstrate that house dust mites have abandoned a parasitic lifestyle, secondarily becoming free living, and then speciated in several habitats. Hence, as exemplified by this model system, highly specialized permanent parasites may drastically de-specialize to the extent of becoming free living and, thus escape from dead-end evolution. Our phylogenetic and historical ecological framework explains the limited cross-reactivity between allergens from the house dust mites and "storage" mites and the ability of the dust mites to inhibit host immune responses. It also provides insights into how ancestral features related to parasitism (frequent ancestral shifts to unrelated hosts, tolerance to lower humidity, and pre-existing enzymes targeting skin and keratinous materials) played a major role in reversal to the free-living state. We propose that parasitic ancestors of pyroglyphids shifted to nests of vertebrates. Later the nest-inhabiting pyroglyphids expanded into human dwellings to become a major source of allergens. [Ancestral ecology; Dollo's law; evolutionary "dead end"; house dust mites; permanent parasitism; Pyroglyphidae.]

00:16

Genome-scale data offer the opportunity to clarify phylogenetic relationships that are difficult to resolve with few loci, but they can also identify genomic regions with evolutionary history distinct from that of the species history. We collected whole-genome sequence data from 29 taxa in the legume genus Medicago, then aligned these sequences to the Medicago truncatula reference genome to confidently identify 87 596 variable homologous sites. We used this data set to estimate phylogenetic relationships among Medicago species, to investigate the number of sites needed to provide robust phylogenetic estimates and to identify specific genomic regions supporting topologies in conflict with the genome-wide phylogeny. Our full genomic data set resolves relationships within the genus that were previously intractable. Subsampling the data reveals considerable variation in phylogenetic signal and power in smaller subsets of the data. Even when sampling 5000 sites, no random sample of the data supports a topology identical to that of the genome-wide phylogeny. Phylogenetic relationships estimated from 500-site sliding windows revealed genome regions supporting several alternative species relationships among recently diverged taxa, consistent with the expected effects of deep coalescence or introgression in the recent history of Medicago. [Medicago; phylogenomics; whole-genome resequencing.]

00:16

The branching structure of biological evolution confers statistical dependencies on phenotypic trait values in related organisms. For this reason, comparative macroevolutionary studies usually begin with an inferred phylogeny that describes the evolutionary relationships of the organisms of interest. The probability of the observed trait data can be computed by assuming a model for trait evolution, such as Brownian motion, over the branches of this fixed tree. However, the phylogenetic tree itself contributes statistical uncertainty to estimates of rates of phenotypic evolution, and many comparative evolutionary biologists regard the tree as a nuisance parameter. In this article, we present a framework for analytically integrating over unknown phylogenetic trees in comparative evolutionary studies by assuming that the tree arises from a continuous-time Markov branching model called the Yule process. To do this, we derive a closed-form expression for the distribution of phylogenetic diversity (PD), which is the sum of branch lengths connecting the species in a clade. We then present a generalization of PD which is equivalent to the expected trait disparity in a set of taxa whose evolutionary relationships are generated by a Yule process and whose traits evolve by Brownian motion. We find expressions for the distribution of expected trait disparity under a Yule tree. Given one or more observations of trait disparity in a clade, we perform fast likelihood-based estimation of the Brownian variance for unresolved clades. Our method does not require simulation or a fixed phylogenetic tree. We conclude with a brief example illustrating Brownian rate estimation for 12 families in the mammalian order Carnivora, in which the phylogenetic tree for each family is unresolved. [Brownian motion; comparative method; Markov reward process; phylogenetic diversity; pure-birth process; quantitative trait evolution; trait disparity; Yule process.]

00:16

Public DNA databases are becoming too large and too complex for manual methods to generate phylogenetic supermatrices from multiple gene sequences. Delineating the terminals based on taxonomic labels is no longer practical because species identifications are frequently incomplete and gene trees are incongruent with Linnaean binomials, which results in uncertainty about how to combine species units among unlinked loci. We developed a procedure that minimizes the problem of forming multilocus species units in a large phylogenetic data set using algorithms from graph theory. An initial step established sequence clusters for each locus that broadly correspond to the species level. These clusters frequently include sequences labeled with various binomials and specimen identifiers that create multiple alternatives for concatenation. To choose among these possibilities, we minimize taxonomic conflict among the species units globally in the data set using a multipartite heuristic algorithm. The procedure was applied to all available GenBank data for Coleoptera (beetles) including > 10 500 taxon labels and > 23 500 sequences of 4 loci, which were grouped into 11 241 clusters or divergent singletons by the BlastClust software. Within each cluster, unidentified sequences could be assigned to a species name through the association with fully identified sequences, resulting in 510 new identifications (13.9% of total unidentified sequences) of which nearly half were "trans-locus" identifications by clustering of sequences at a secondary locus. The limits of DNA-based clusters were inconsistent with the Linnaean binomials for 1518 clusters (13.5%) that contained more than one binomial or split a single binomial among multiple clusters. By applying a scoring scheme for full and partial name matches in pairs of clusters, a maximum weight set of 7366 global species units was produced. Varying the match weights for partial matches had little effect on the number of units, although if partial matches were disallowed, the number increased greatly. Trees from the resulting supermatrices generally produced tree topologies in good agreement with the higher taxonomy of Coleoptera, with fewer terminals compared with trees generated according to standard filtering of sequences using species labels. The study illustrates a strategy for assembling the tree-of-life from an ever more complex primary database. [BlastClust; data mining; graph theory; incongruence; multipartite matching; species delimitation; supermatrix.]

00:16

Polyploidy is an important speciation mechanism, particularly in land plants. Allopolyploid species are formed after hybridization between otherwise intersterile parental species. Recent theoretical progress has led to successful implementation of species tree models that take population genetic parameters into account. However, these models have not included allopolyploid hybridization and the special problems imposed when species trees of allopolyploids are inferred. Here, 2 new models for the statistical inference of the evolutionary history of allopolyploids are evaluated using simulations and demonstrated on 2 empirical data sets. It is assumed that there has been a single hybridization event between 2 diploid species resulting in a genomic allotetraploid. The evolutionary history can be represented as a species network or as a multilabeled species tree, in which some pairs of tips are labeled with the same species. In one of the models (AlloppMUL), the multilabeled species tree is inferred directly. This is the simplest model and the most widely applicable, since fewer assumptions are made. The second model (AlloppNET) incorporates the hybridization event explicitly which means that fewer parameters need to be estimated. Both models are implemented in the BEAST framework. Simulations show that both models are useful and that AlloppNET is more accurate if the assumptions it is based on are valid. The models are demonstrated on previously analyzed data from the genera Pachycladon (Brassicaceae) and Silene (Caryophyllaceae). [Allopolyploid; Bayesian; hybridization; network; phylogenetics.]