Science Enabled by Specimen Data
Streiff, S. J. R., E. O. Ravomanana, M. Rakotoarinivo, M. Pignal, E. P. Pimparé, R. H. J. Erkens, and T. L. P. Couvreur. 2024. High-quality herbarium-label transcription by citizen scientists improves taxonomic and spatial representation of the tropical plant family Annonaceae. Adansonia 46. https://doi.org/10.5252/adansonia2024v46a18
Herbarium specimens provide an important and central resource for biodiversity research. Making these records digitally available to end-users represents numerous challenges, in particular, transcribing metadata associated with specimen labels. In this study, we used the citizen science initiative ‘Les Herbonautes’ and the Récolnat network to transcribe specific data from all herbarium specimen labels stored at the Muséum national d’Histoire naturelle in Paris of the large tropical plant family Annonaceae. We compared this database with publicly available global biodiversity repository data and expert checklists. We investigated spatial and taxonomic advances in data availability at the global and country scales. A total of 20 738 specimens were transcribed over the course of more than two years contributing to and significantly extending the previously available specimen and species data for Annonaceae worldwide. We show that several regions, mainly in Africa and South East Asia not covered by online global datasets, are uniquely available in the P herbarium, probably linked to past history of the museum’s botanical exploration. While acknowledging the challenges faced during the transcription of historic specimens by citizen scientists, this study highlights the positive impact of adding records to global datasets both in space and time. This is illustrative for researchers, collection managers, policy makers as well as funders. These datasets will be valuable for numerous future studies in biodiversity research, including ecology, evolution, conservation and climate change science.
Akwaji Patrick Ishoro, Onah Dough Owojoku, Ajikah Linus Bashie, Oden Glory Nicholas, Okon Ekeng Ita, Nkang Nkoyo Emem4,, Amaraizu Mary Nneoma, Ugbogu Omokafe Alaba. 2024. Climate change and Pentaclethra macrophylla Benth: Forecasting alterations in native distributional range across West and Central Africa. Zenodo. https://doi.org/10.5281/zenodo.13835195
The tree species known as the African oil bean (Pentaclethra macrophylla Benth) retains numerous applications. For rural residents, almost all of its traded elements represent a significant source of income. Numerous terrestrial habitats have reportedly experienced negative biological, temporal, and spatial effects concerning climate change lately. Understanding the out-turn of changing climate towards the geographic distribution of species could help predict their growth or decline and, if necessary, provide appropriate conservation measures. We examined whether climate change will affect the geographical distribution of this species throughout its native distributional area across West and Central Africa in light of the strong interest that this species holds for rural African residents. Under AfriClim RCP 8.5 scenario 2070 conditions, the inquiry was carried out by applying the MaxEnt model. According to the MaxEnt results, climate change shall hold a major footprint toward species' native spread. About 5% (5889 km2) of the nations across West and Central Africa are predicted to have stable species populations. These are mostly the regions located along the southern coasts of Guinea Bissau, Sierra Leone, Liberia, Cote d'Ivoire, Nigeria, Cameroon, and Gabon. The model threshold indicated a huge 95.29% (119135.9 km2) reduction in the species' appropriate habitat. The southern coasts of Senegal, Ghana, Togo, and the Benin Republic, along with the Democratic Republic of the Congo, are predicted to be unsuitable, as are the topmost northern portions associated with the Sahel regions of West and Central African countries. Additionally, it is expected that the entire Burkina Faso, Central African Republic, Democratic Republic of the Congo, and south-eastern Angola will no longer be appropriate for the species. It is necessary to build up the preservation of the species by raising and establishing it in the anticipated suitable areas/agroforestry plan to ensure its sustainable usage and practicable conservation.
Bürger, M., and J. Chory. 2024. A potential role of heat‐moisture couplings in the range expansion of Striga asiatica. Ecology and Evolution 14. https://doi.org/10.1002/ece3.11332
Parasitic weeds in the genera Orobanche, Phelipanche (broomrapes) and Striga (witchweeds) have a devastating impact on food security across much of Africa, Asia and the Mediterranean Basin. Yet, how climatic factors might affect the range expansion of these weeds in the context of global environmental change remains unexplored. We examined satellite‐based environmental variables such as surface temperature, root zone soil moisture, and elevation, in relation to parasitic weed distribution and environmental conditions over time, in combination with observational data from the Global Biodiversity Information Facility (GBIF). Our analysis reveals contrasting environmental and altitude preferences in the genera Striga and Orobanche. Asiatic witchweed (Striga asiatica), which infests corn, rice, sorghum, and sugar cane crops, appears to be expanding its range in high elevation habitats. It also shows a significant association with heat‐moisture coupling events, the frequency of which is rising in such environments. These results point to geographical shifts in distribution and abundance in parasitic weeds due to climate change.
Serra‐Diaz, J. M., J. Borderieux, B. Maitner, C. C. F. Boonman, D. Park, W. Guo, A. Callebaut, et al. 2024. occTest: An integrated approach for quality control of species occurrence data. Global Ecology and Biogeography. https://doi.org/10.1111/geb.13847
Aim Species occurrence data are valuable information that enables one to estimate geographical distributions, characterize niches and their evolution, and guide spatial conservation planning. Rapid increases in species occurrence data stem from increasing digitization and aggregation efforts, and citizen science initiatives. However, persistent quality issues in occurrence data can impact the accuracy of scientific findings, underscoring the importance of filtering erroneous occurrence records in biodiversity analyses.InnovationWe introduce an R package, occTest, that synthesizes a growing open‐source ecosystem of biodiversity cleaning workflows to prepare occurrence data for different modelling applications. It offers a structured set of algorithms to identify potential problems with species occurrence records by employing a hierarchical organization of multiple tests. The workflow has a hierarchical structure organized in testPhases (i.e. cleaning vs. testing) that encompass different testBlocks grouping different testTypes (e.g. environmental outlier detection), which may use different testMethods (e.g. Rosner test, jacknife,etc.). Four different testBlocks characterize potential problems in geographic, environmental, human influence and temporal dimensions. Filtering and plotting functions are incorporated to facilitate the interpretation of tests. We provide examples with different data sources, with default and user‐defined parameters. Compared to other available tools and workflows, occTest offers a comprehensive suite of integrated tests, and allows multiple methods associated with each test to explore consensus among data cleaning methods. It uniquely incorporates both coordinate accuracy analysis and environmental analysis of occurrence records. Furthermore, it provides a hierarchical structure to incorporate future tests yet to be developed.Main conclusionsoccTest will help users understand the quality and quantity of data available before the start of data analysis, while also enabling users to filter data using either predefined rules or custom‐built rules. As a result, occTest can better assess each record's appropriateness for its intended application.
Anest, A., Y. Bouchenak-Khelladi, T. Charles-Dominique, F. Forest, Y. Caraglio, G. P. Hempson, O. Maurin, and K. W. Tomlinson. 2024. Blocking then stinging as a case of two-step evolution of defensive cage architectures in herbivore-driven ecosystems. Nature Plants. https://doi.org/10.1038/s41477-024-01649-4
Dense branching and spines are common features of plant species in ecosystems with high mammalian herbivory pressure. While dense branching and spines can inhibit herbivory independently, when combined, they form a powerful defensive cage architecture. However, how cage architecture evolved under mammalian pressure has remained unexplored. Here we show how dense branching and spines emerged during the age of mammalian radiation in the Combretaceae family and diversified in herbivore-driven ecosystems in the tropics. Phylogenetic comparative methods revealed that modern plant architectural strategies defending against large mammals evolved via a stepwise process. First, dense branching emerged under intermediate herbivory pressure, followed by the acquisition of spines that supported higher speciation rates under high herbivory pressure. Our study highlights the adaptive value of dense branching as part of a herbivore defence strategy and identifies large mammal herbivory as a major selective force shaping the whole plant architecture of woody plants. This study explores the evolution of two traits, branching density and spine presence, in the globally distributed plant family Combretaceae. These traits were found to have appeared in a two-step process in response to mammalian herbivory pressure, revealing the importance of large mammals in the evolution of plant architecture diversity.
Ract, C., N. D. Burgess, L. Dinesen, P. Sumbi, I. Malugu, J. Latham, L. Anderson, et al. 2024. Nature Forest Reserves in Tanzania and their importance for conservation S. S. Romanach [ed.],. PLOS ONE 19: e0281408. https://doi.org/10.1371/journal.pone.0281408
Since 1997 Tanzania has undertaken a process to identify and declare a network of Nature Forest Reserves (NFRs) with high biodiversity values, from within its existing portfolio of national Forest Reserves, with 16 new NFRs declared since 2015. The current network of 22 gazetted NFRs covered 948,871 hectares in 2023. NFRs now cover a range of Tanzanian habitat types, including all main forest types—wet, seasonal, and dry—as well as wetlands and grasslands. NFRs contain at least 178 of Tanzania’s 242 endemic vertebrate species, of which at least 50% are threatened with extinction, and 553 Tanzanian endemic plant taxa (species, subspecies, and varieties), of which at least 50% are threatened. NFRs also support 41 single-site endemic vertebrate species and 76 single-site endemic plant taxa. Time series analysis of management effectiveness tracking tool (METT) data shows that NFR management effectiveness is increasing, especially where donor funds have been available. Improved management and investment have resulted in measurable reductions of some critical threats in NFRs. Still, ongoing challenges remain to fully contain issues of illegal logging, charcoal production, firewood, pole-cutting, illegal hunting and snaring of birds and mammals, fire, wildlife trade, and the unpredictable impacts of climate change. Increased tourism, diversified revenue generation and investment schemes, involving communities in management, and stepping up control measures for remaining threats are all required to create a network of economically self-sustaining NFRs able to conserve critical biodiversity values.
Maurin, O., A. Anest, F. Forest, I. Turner, R. L. Barrett, R. C. Cowan, L. Wang, et al. 2023. Drift in the tropics: Phylogenetics and biogeographical patterns in Combretaceae. Global Ecology and Biogeography. https://doi.org/10.1111/geb.13737
Aim The aim of this study was to further advance our understanding of the species-rich, and ecologically important angiosperm family Combretaceae to provide new insights into their evolutionary history. We assessed phylogenetic relationships in the family using target capture data and produced a dated phylogenetic tree to assess fruit dispersal modes and patterns of distribution. Location Tropical and subtropical regions. Time Period Cretaceous to present. Major Taxa Studied Family Combretaceae is a member of the rosid clade and comprises 10 genera and more than 500 species, predominantly assigned to genera Combretum and Terminalia, and occurring on all continents and in a wide range of ecosystems. Methods We use a target capture approach and the Angiosperms353 universal probes to reconstruct a robust dated phylogenetic tree for the family. This phylogenetic framework, combined with seed dispersal traits, biome data and biogeographic ranges, allows the reconstruction of the biogeographical history of the group. Results Ancestral range reconstructions suggest a Gondwanan origin (Africa/South America), with several intercontinental dispersals within the family and few transitions between biomes. Relative abundance of fruit dispersal types differed by both continent and biome. However, intercontinental colonizations were only significantly enhanced by water dispersal (drift fruit), and there was no evidence that seed dispersal modes influenced biome shifts. Main Conclusions Our analysis reveals a paradox as drift fruit greatly enhanced dispersal distances at intercontinental scale but did not affect the strong biome conservatism observed.
Hill, A., M. F. T. Jiménez, N. Chazot, C. Cássia‐Silva, S. Faurby, L. Herrera‐Alsina, and C. D. Bacon. 2023. Apparent effect of range size and fruit colour on palm diversification may be spurious. Journal of Biogeography. https://doi.org/10.1111/jbi.14683
Aim Fruit selection by animal dispersers with different mobility directly impacts plant geographical range size, which, in turn, may impact plant diversification. Here, we examine the interaction between fruit colour, range size and diversification rate in palms by testing two hypotheses: (1) species with fruit colours attractive to birds have larger range sizes due to high dispersal ability and (2) disperser mobility affects whether small or large range size has higher diversification, and intermediate range size is expected to lead to the highest diversification rate regardless of disperser. Location Global. Time Period Contemporary (or present). Major Taxa Studied Palms (Arecaceae). Methods Palm species were grouped based on likely animal disperser group for given fruit colours. Range sizes were estimated by constructing alpha convex hull polygons from distribution data. We examined disperser group, range size or an interaction of both as possible drivers of change in diversification rate over time in a likelihood dynamic model (Several Examined State-dependent Speciation and Extinction [SecSSE]). Models were fitted, rate estimates were retrieved and likelihoods were compared to those of appropriate null models. Results Species with fruit colours associated with mammal dispersal had larger ranges than those with colours associated with bird dispersal. The best fitting SecSSE models indicated that the examined traits were not the primary driver of the heterogeneity in diversification rates in the model. Extinction rate complexity had a marked impact on model performance and on diversification rates. Main Conclusions Two traits related to dispersal mobility, range size and fruit colour, were not identified as the main drivers of diversification in palms. Increased model extinction rate complexity led to better performing models, which indicates that net diversification should be estimated rather than speciation alone. However, increased complexity may lead to incorrect SecSSE model conclusions without careful consideration. Finally, we find palms with more mobile dispersers do not have larger range sizes, meaning other factors are more important determinants of range size.
Richard-Bollans, A., C. Aitken, A. Antonelli, C. Bitencourt, D. Goyder, E. Lucas, I. Ondo, et al. 2023. Machine learning enhances prediction of plants as potential sources of antimalarials. Frontiers in Plant Science 14. https://doi.org/10.3389/fpls.2023.1173328
Plants are a rich source of bioactive compounds and a number of plant-derived antiplasmodial compounds have been developed into pharmaceutical drugs for the prevention and treatment of malaria, a major public health challenge. However, identifying plants with antiplasmodial potential can be time-consuming and costly. One approach for selecting plants to investigate is based on ethnobotanical knowledge which, though having provided some major successes, is restricted to a relatively small group of plant species. Machine learning, incorporating ethnobotanical and plant trait data, provides a promising approach to improve the identification of antiplasmodial plants and accelerate the search for new plant-derived antiplasmodial compounds. In this paper we present a novel dataset on antiplasmodial activity for three flowering plant families – Apocynaceae, Loganiaceae and Rubiaceae (together comprising c. 21,100 species) – and demonstrate the ability of machine learning algorithms to predict the antiplasmodial potential of plant species. We evaluate the predictive capability of a variety of algorithms – Support Vector Machines, Logistic Regression, Gradient Boosted Trees and Bayesian Neural Networks – and compare these to two ethnobotanical selection approaches – based on usage as an antimalarial and general usage as a medicine. We evaluate the approaches using the given data and when the given samples are reweighted to correct for sampling biases. In both evaluation settings each of the machine learning models have a higher precision than the ethnobotanical approaches. In the bias-corrected scenario, the Support Vector classifier performs best – attaining a mean precision of 0.67 compared to the best performing ethnobotanical approach with a mean precision of 0.46. We also use the bias correction method and the Support Vector classifier to estimate the potential of plants to provide novel antiplasmodial compounds. We estimate that 7677 species in Apocynaceae, Loganiaceae and Rubiaceae warrant further investigation and that at least 1300 active antiplasmodial species are highly unlikely to be investigated by conventional approaches. While traditional and Indigenous knowledge remains vital to our understanding of people-plant relationships and an invaluable source of information, these results indicate a vast and relatively untapped source in the search for new plant-derived antiplasmodial compounds.
Reichgelt, T., A. Baumgartner, R. Feng, and D. A. Willard. 2023. Poleward amplification, seasonal rainfall and forest heterogeneity in the Miocene of the eastern USA. Global and Planetary Change 222: 104073. https://doi.org/10.1016/j.gloplacha.2023.104073
Paleoclimate reconstructions can provide a window into the environmental conditions in Earth history when atmospheric carbon dioxide concentrations were higher than today. In the eastern USA, paleoclimate reconstructions are sparse, because terrestrial sedimentary deposits are rare. Despite this, the eastern USA has the largest population and population density in North America, and understanding the effects of current and future climate change is of vital importance. Here, we provide terrestrial paleoclimate reconstructions of the eastern USA from Miocene fossil floras. Additionally, we compare proxy paleoclimate reconstructions from the warmest period in the Miocene, the Miocene Climatic Optimum (MCO), to those of an MCO Earth System Model. Reconstructed Miocene temperatures and precipitation north of 35°N are higher than modern. In contrast, south of 35°N, temperatures and precipitation are similar to today, suggesting a poleward amplification effect in eastern North America. Reconstructed Miocene rainfall seasonality was predominantly higher than modern, regardless of latitude, indicating greater variability in intra-annual moisture transport. Reconstructed climates are almost uniformly in the temperate seasonal forest biome, but heterogeneity of specific forest types is evident. Reconstructed Miocene terrestrial temperatures from the eastern USA are lower than modeled temperatures and coeval Atlantic sea surface temperatures. However, reconstructed rainfall is consistent with modeled rainfall. Our results show that during the Miocene, climate was most different from modern in the northeastern states, and may suggest a drastic reduction in the meridional temperature gradient along the North American east coast compared to today.