Science Enabled by Specimen Data

Marcussen, T., H. E. Ballard, J. Danihelka, A. R. Flores, M. V. Nicola, and J. M. Watson. 2022. A Revised Phylogenetic Classification for Viola (Violaceae). Plants 11: 2224.

The genus Viola (Violaceae) is among the 40–50 largest genera among angiosperms, yet its taxonomy has not been revised for nearly a century. In the most recent revision, by Wilhelm Becker in 1925, the then-known 400 species were distributed among 14 sections and numerous unranked groups. Here, we provide an updated, comprehensive classification of the genus, based on data from phylogeny, morphology, chromosome counts, and ploidy, and based on modern principles of monophyly. The revision is presented as an annotated global checklist of accepted species of Viola, an updated multigene phylogenetic network and an ITS phylogeny with denser taxon sampling, a brief summary of the taxonomic changes from Becker’s classification and their justification, a morphological binary key to the accepted subgenera, sections and subsections, and an account of each infrageneric subdivision with justifications for delimitation and rank including a description, a list of apomorphies, molecular phylogenies where possible or relevant, a distribution map, and a list of included species. We distribute the 664 species accepted by us into 2 subgenera, 31 sections, and 20 subsections. We erect one new subgenus of Viola (subg. Neoandinium, a replacement name for the illegitimate subg. Andinium), six new sections (sect. Abyssinium, sect. Himalayum, sect. Melvio, sect. Nematocaulon, sect. Spathulidium, sect. Xanthidium), and seven new subsections (subsect. Australasiaticae, subsect. Bulbosae, subsect. Clausenianae, subsect. Cleistogamae, subsect. Dispares, subsect. Formosanae, subsect. Pseudorupestres). Evolution within the genus is discussed in light of biogeography, the fossil record, morphology, and particular traits. Viola is among very few temperate and widespread genera that originated in South America. The biggest identified knowledge gaps for Viola concern the South American taxa, for which basic knowledge from phylogeny, chromosome counts, and fossil data is virtually absent. Viola has also never been subject to comprehensive anatomical study. Studies into seed anatomy and morphology are required to understand the fossil record of the genus.

Santos, J. M., C. Capinha, J. Rocha, and C. A. Sousa. 2022. The current and future distribution of the yellow fever mosquito (Aedes aegypti) on Madeira Island J. T. Wu [ed.],. PLOS Neglected Tropical Diseases 16: e0010715.

The Aedes aegypti mosquito is the main vector for several diseases of global importance, such as dengue and yellow fever. This species was first identified on Madeira Island in 2005, and between 2012 and 2013 was responsible for an outbreak of dengue that affected several thousand people. However, the potential distribution of the species on the island remains poorly investigated. Here we assess the suitability of current and future climatic conditions to the species on the island and complement this assessment with estimates of the suitability of land use and human settlement conditions. We used four modelling algorithms (boosted regression trees, generalized additive models, generalized linear models and random forest) and data on the distribution of the species worldwide and across the island. For both climatic and non-climatic factors, suitability estimates predicted the current distribution of the species with good accuracy (mean area under the Receiver Operating Characteristic curve = 0.88 ±0.06, mean true skill statistic = 0.72 ±0.1). Minimum temperature of coldest month was the most influential climatic predictor, while human population density, residential housing density and public spaces were the most influential predictors describing land use and human settlement conditions. Suitable areas under current climates are predicted to occur mainly in the warmer and densely inhabited coastal areas of the southern part of the island, where the species is already established. By mid-century (2041–2060), the extent of climatically suitable areas is expected to increase, mainly towards higher altitudes and in the eastern part of the island. Our work shows that ongoing efforts to monitor and prevent the spread of Ae. aegypti on Madeira Island will have to increasingly consider the effects of climate change.

Testo, W. L., A. L. de Gasper, S. Molino, J. M. G. y Galán, A. Salino, V. A. de O. Dittrich, and E. B. Sessa. 2022. Deep vicariance and frequent transoceanic dispersal shape the evolutionary history of a globally distributed fern family. American Journal of Botany.

Premise Historical biogeography of ferns is typically expected to be dominated by long-distance dispersal, due to their minuscule spores. However, few studies have inferred the historical biogeography of a large and widely distributed group of ferns to test this hypothesis. Our aims are to determine the extent to which long-distance dispersal vs. vicariance have shaped the history of the fern family Blechnaceae, to explore ecological correlates of dispersal and diversification, and to determine whether these patterns differ between the northern and southern hemispheres. Methods We used sequence data for three chloroplast loci to infer a time-calibrated phylogeny for 154 out of 265 species of Blechnaceae, including representatives of all genera in the family. This tree was used to conduct ancestral range reconstruction and stochastic character mapping, estimate diversification rates, and identify ecological correlates of diversification. Key results Blechnaceae originated in Eurasia and began diversifying in the late Cretaceous. A lineage comprising most extant diversity diversified principally in the austral Pacific region around the Paleocene-Eocene Thermal Maximum. Land connections that existed near the poles during periods of warm climates likely facilitated migration of several lineages, with subsequent climate-mediated vicariance shaping current distributions. Long-distance dispersal is frequent and asymmetrical, with New Zealand/Pacific Islands, Australia, and tropical America being major source areas. Conclusions Ancient vicariance and extensive long-distance dispersal have shaped the history of Blechnaceae in both the northern and southern hemispheres. The exceptional diversity in austral regions appears to reflect rapid speciation in these areas; mechanisms underlying this evolutionary success remain uncertain.

Amaral, D. T., I. A. S. Bonatelli, M. Romeiro-Brito, E. M. Moraes, and F. F. Franco. 2022. Spatial patterns of evolutionary diversity in Cactaceae show low ecological representation within protected areas. Biological Conservation 273: 109677.

Mapping biodiversity patterns across taxa and environments is crucial to address the evolutionary and ecological dimensions of species distribution, suggesting areas of particular importance for conservation purposes. Within Cactaceae, spatial diversity patterns are poorly explored, as are the abiotic factors that may predict these patterns. We gathered geographic and genetic data from 921 cactus species by exploring both the occurrence and genetic databases, which are tightly associated with drylands, to evaluate diversity patterns, such as phylogenetic diversity and endemism, paleo-, neo-, and superendemism, and the environmental predictor variables of such patterns in a global analysis. Hotspot areas of cacti diversity are scattered along the Neotropical and Nearctic regions, mainly in the desertic portion of Mesoamerica, Caribbean Island, and the dry diagonal of South America. The geomorphological features of these regions may create a complexity of areas that work as locally buffered zones over time, which triggers local events of diversification and speciation. Desert and dryland/dry forest areas comprise paleo- and superendemism and may act as both museums and cradles of species, displaying great importance for conservation. Past climates, topography, soil features, and solar irradiance seem to be the main predictors of distinct endemism types. The hotspot areas that encompass a major part of the endemism cells are outside or poorly covered by formal protection units. The current legally protected areas are not able to conserve the evolutionary diversity of cacti. Given the rapid anthropogenic disturbance, efforts must be reinforced to monitor biodiversity and the environment and to define/plan current and new protected areas.

Quiroga, M. P., and C. P. Souto. 2022. Ecological niche modeling, niche overlap, and good old Rabinowitz’s rarities applied to the conservation of gymnosperms in a global biodiversity hotspot. Landscape Ecology.

Context Biodiversity hotspots harbor 77% of endemic plant species. Patagonian Temperate Forest (PTF) is a part of a biodiversity hotspot, but over the past centuries, has been over-exploited, fragmented and replaced with exotic species plantations, lately also threatened by climate change. Objectives Our aim is to better understand patterns of habitat suitability and niche overlap of nine endemic gymnosperm species, key elements of the PTF, complementing traditional approaches of biodiversity conservation. Methods Using R packages and 3016 occurrence data, we deployed ecological niche models (ENM) in MaxEnt via kuenm, and classified species according to Rabinowitz’s types of rarity. We then overlapped their niches calculating Schoener's D index, and considered types of rarity in a spatial ecological context. Finally, we overlay high species’ suitability and protected areas and detected conservation priorities using GapAnalysis. Results We generated simplified ENMs for nine Patagonian gymnosperms and found that most niches overlap, and only one species displayed a unique niche. Surprisingly, we found that three species have divergent suitability of habitats across the landscape and not related with previously published geographic structure of neutral genetic variation. We showed that the rarer a species is the smaller niche volume tend to have, that six out of nine studied species have high conservation priority, and that there are conservation gaps in the PTF. Conclusion Our approach showed that there are unprotected suitable areas for native key species at high risk in PTF. Suggesting that integrating habitat-suitability models of multiple species, types of rarity, and niche overlap, can be a handy tool to identify potential conservation areas in global biodiversity hotspots.

Führding‐Potschkat, P., H. Kreft, and S. M. Ickert‐Bond. 2022. Influence of different data cleaning solutions of point‐occurrence records on downstream macroecological diversity models. Ecology and Evolution 12.

Digital point‐occurrence records from the Global Biodiversity Information Facility (GBIF) and other data providers enable a wide range of research in macroecology and biogeography. However, data errors may hamper immediate use. Manual data cleaning is time‐consuming and often unfeasible, given that the databases may contain thousands or millions of records. Automated data cleaning pipelines are therefore of high importance. Taking North American Ephedra as a model, we examined how different data cleaning pipelines (using, e.g., the GBIF web application, and four different R packages) affect downstream species distribution models (SDMs). We also assessed how data differed from expert data. From 13,889 North American Ephedra observations in GBIF, the pipelines removed 31.7% to 62.7% false positives, invalid coordinates, and duplicates, leading to datasets between 9484 (GBIF application) and 5196 records (manual‐guided filtering). The expert data consisted of 704 records, comparable to data from field studies. Although differences in the absolute numbers of records were relatively large, species richness models based on stacked SDMs (S‐SDM) from pipeline and expert data were strongly correlated (mean Pearson's r across the pipelines: .9986, vs. the expert data: .9173). Our results suggest that all R package‐based pipelines reliably identified invalid coordinates. In contrast, the GBIF‐filtered data still contained both spatial and taxonomic errors. Major drawbacks emerge from the fact that no pipeline fully discovered misidentified specimens without the assistance of taxonomic expert knowledge. We conclude that application‐filtered GBIF data will still need additional review to achieve higher spatial data quality. Achieving high‐quality taxonomic data will require extra effort, probably by thoroughly analyzing the data for misidentified taxa, supported by experts.

Sotuyo, S., E. Pedraza-Ortega, E. Martínez-Salas, J. Linares, and L. Cabrera. 2022. Insights into phylogenetic divergence of Dalbergia (Leguminosae: Dalbergiae) from Mexico and Central America. Frontiers in Ecology and Evolution 10.

The pantropical genus Dalbergia includes more than 250 species. Phylogenetic studies of the group are scarce and have only included two or three species distributed in Mexico. We obtained herbarium samples of Mexican, Central American, and South American species (sourced from MEXU). In addition, sequences of GenBank accessions were used to complement the study. Using internal transcribed spacer (ITS), the matK and rbcL sequences from 384 accessions comprising species from America, Asia, and Africa were sampled to evaluate phylogenetic relationships of Mexican species and infrageneric classifications based on morphological data. Phylogenetic analyses suggest that the genus Dalbergia is monophyletic and originated in South America. The species distributed in Mexico are not a monophyletic clade but are divided into four clades with affinities to South American and Asian species clades. There is no correlation between geography and large-scale phylogeny. The estimated ages of the Mexican and Central American clades ranged from 11.32 Ma (Dalbergia granadillo clade) to 1.88 Ma (Dalbergia ecastaphyllum clade). Multiple long-distance dispersal events should be used to explain the current genus distribution.

Marshall, B. M., C. T. Strine, C. S. Fukushima, P. Cardoso, M. C. Orr, and A. C. Hughes. 2022. Searching the web builds fuller picture of arachnid trade. Communications Biology 5.

Wildlife trade is a major driver of biodiversity loss, yet whilst the impacts of trade in some species are relatively well-known, some taxa, such as many invertebrates are often overlooked. Here we explore global patterns of trade in the arachnids, and detected 1,264 species from 66 families and 371 genera in trade. Trade in these groups exceeds millions of individuals, with 67% coming directly from the wild, and up to 99% of individuals in some genera. For popular taxa, such as tarantulas up to 50% are in trade, including 25% of species described since 2000. CITES only covers 30 (2%) of the species potentially traded. We mapped the percentage and number of species native to each country in trade. To enable sustainable trade, better data on species distributions and better conservation status assessments are needed. The disparity between trade data sources highlights the need to expand monitoring if impacts on wild populations are to be accurately gauged and the impacts of trade minimised. Trade in arachnids includes millions of individuals and over 1264 species, with over 70% of individuals coming from the wild.

Cano, Á., F. W. Stauffer, T. Andermann, I. M. Liberal, A. Zizka, C. D. Bacon, H. Lorenzi, et al. 2022. Recent and local diversification of Central American understorey palms. Global Ecology and Biogeography 31: 1513–1525.

Aim Central America is largely covered by hyperdiverse, yet poorly understood, rain forests. Understorey palms are diverse components of these forests, but little is known about their historical assembly. It is not clear when palms in Central America reached present diversity levels and whether most species arrived from neighbouring regions or evolved locally. We addressed these questions using the most species-rich American palm clades indicative of rain forests. We reconstructed and compared their phylogenomic and biogeographical history with the diversification of 54 other plant lineages, to gain a better understanding of the processes that shaped the assembly of Central American rain forests. Location Central America. Time period Cretaceous to present. Major taxa studied Arecaceae: Arecoideae: Bactridinae, Chamaedoreeae, Geonomateae. Methods We sampled 218 species through fieldwork and living collections. We sequenced their genomic DNA using target sequence-capture procedures. Using 12 calibration points, we reconstructed dated phylogenies under three approaches (multispecies coalescent, maximum likelihood and Bayesian inference), conducted biogeographical analyses (dispersal–extinction–cladogenesis) and estimated phylogenetic diversity metrics. Results Dated phylogenies revealed intense diversification in Central America from 12 Ma. Local diversification events were four times more frequent than dispersal events, and we found strong phylogenetic clustering in relationship to Central America. Main conclusions Our results suggest that most understorey palm species that characterize the Central American rain forests today evolved locally after repeated dispersal events, mostly from South America. Understorey palms in Central American rain forests diversified primarily after closure of the Central American Seaway at c. 13 Ma, suggesting that the Great American Biotic Interchange was a major trigger for plant diversification in Central American rain forests. This recent diversification contrasts with the much earlier existence of rain forest palms in neighbouring South America since c. 58 Ma. We found similar timings of diversification in 54 other seed plant lineages, suggesting an unexpectedly recent assembly of the hyperdiverse Central American flora.

Williams, C. J. R., D. J. Lunt, U. Salzmann, T. Reichgelt, G. N. Inglis, D. R. Greenwood, W. Chan, et al. 2022. African Hydroclimate During the Early Eocene From the DeepMIP Simulations. Paleoceanography and Paleoclimatology 37.

The early Eocene (∼56‐48 million years ago) is characterised by high CO2 estimates (1200‐2500 ppmv) and elevated global temperatures (∼10 to 16°C higher than modern). However, the response of the hydrological cycle during the early Eocene is poorly constrained, especially in regions with sparse data coverage (e.g. Africa). Here we present a study of African hydroclimate during the early Eocene, as simulated by an ensemble of state‐of‐the‐art climate models in the Deep‐time Model Intercomparison Project (DeepMIP). A comparison between the DeepMIP pre‐industrial simulations and modern observations suggests that model biases are model‐ and geographically dependent, however these biases are reduced in the model ensemble mean. A comparison between the Eocene simulations and the pre‐industrial suggests that there is no obvious wetting or drying trend as the CO2 increases. The results suggest that changes to the land sea mask (relative to modern) in the models may be responsible for the simulated increases in precipitation to the north of Eocene Africa. There is an increase in precipitation over equatorial and West Africa and associated drying over northern Africa as CO2 rises. There are also important dynamical changes, with evidence that anticyclonic low‐level circulation is replaced by increased south‐westerly flow at high CO2 levels. Lastly, a model‐data comparison using newly‐compiled quantitative climate estimates from palaeobotanical proxy data suggests a marginally better fit with the reconstructions at lower levels of CO2.