nerc.ac.uk

Modelling the distribution of rare invertebrates by correcting class imbalance and spatial bias

Gaul, Willson; Sadykova, Dinara; White, Hannah J.; León‐Sánchez, Lupe; Caplat, Paul; Emmerson, Mark C.; Yearsley, Jon M.. 2022 Modelling the distribution of rare invertebrates by correcting class imbalance and spatial bias. Diversity and Distributions, 28 (10). 2171-2186. 10.1111/ddi.13619

Before downloading, please read NORA policies.
[thumbnail of N533424JA.pdf]
Preview
Text
N533424JA.pdf - Published Version
Available under License Creative Commons Attribution 4.0.

Download (1MB) | Preview

Abstract/Summary

Aim: Soil arthropods are important decomposers and nutrient cyclers, but are poorly represented on national and international conservation Red Lists. Opportunistic biological records for soil invertebrates are sparse, and contain few observations of rare species but a relatively large number of non-detection observations (a problem known as class imbalance). Robinson et al. (Diversity and Distributions, 24, 460) proposed a method for under-sampling non-detection data using a spatial grid to improve class balance and spatial bias in bird data. For taxa that are less intensively sampled, datasets are smaller, which poses a challenge because under-sampling data removes information. We tested whether spatially stratified under-sampling improved prediction performance of species distribution models for millipedes, for which large datasets are not available. We also tested whether using environmental predictor variables provided additional information beyond what is captured by spatial position for predicting species distributions. Location: Island of Ireland. Methods: We tested the spatially stratified under-sampling method of Robinson et al. (Diversity and Distributions, 24, 460) by using biological records to train species distribution models of rare millipedes. Results: Using spatially stratified under-sampled data improved species distribution model sensitivity (true positive rate) but decreased model specificity (true negative rate). The spatial pattern of under-sampling affected model performance. Training data that was under-sampled in a spatially stratified way sometimes produced worse models than did data that was under-sampled in an unstratified way. Geographic coordinates were as good as or better than environmental variables for predicting distributions of one out of six species. Main Conclusions: Spatially stratified under-sampling improved prediction performance of species distribution models for rare millipedes. Spatially stratified under-sampling was most effective for rarer species, although unstratified under-sampling was sometimes more effective. The good prediction performance of models using geographic coordinates is promising for modelling distributions of poorly studied species for which little is known about ecological or physiological determinants of occurrence.

Item Type: Publication - Article
Digital Object Identifier (DOI): 10.1111/ddi.13619
UKCEH and CEH Sections/Science Areas: Biodiversity (Science Area 2017-)
ISSN: 1366-9516
Additional Information. Not used in RCUK Gateway to Research.: Open Access paper - full text available via Official URL link.
Additional Keywords: class imbalance, Diplopoda, millipede, rare species, spatial bias, spatial under-sampling, species distribution model
NORA Subject Terms: Ecology and Environment
Date made live: 25 Oct 2022 15:00 +0 (UTC)
URI: https://nora.nerc.ac.uk/id/eprint/533424

Actions (login required)

View Item View Item

Document Downloads

Downloads for past 30 days

Downloads per month over past year

More statistics for this item...