Data-derived metrics describing the behaviour of field-based citizen scientists provide insights for project design and modelling bias
August, Tom ORCID: https://orcid.org/0000-0003-1116-3385; Fox, Richard; Roy, David B. ORCID: https://orcid.org/0000-0002-5147-0331; Pocock, Michael J.O. ORCID: https://orcid.org/0000-0003-4375-0445. 2020 Data-derived metrics describing the behaviour of field-based citizen scientists provide insights for project design and modelling bias. Scientific Reports, 10, 11009. 12, pp. https://doi.org/10.1038/s41598-020-67658-3
Before downloading, please read NORA policies.
|
Text
N528242JA.pdf - Published Version Available under License Creative Commons Attribution 4.0. Download (1MB) | Preview |
Abstract/Summary
Around the world volunteers and non-professionals collect data as part of environmental citizen science projects, collecting wildlife observations, measures of water quality and much more. However, where projects allow flexibility in how, where, and when data are collected there will be variation in the behaviour of participants which results in biases in the datasets collected. We develop a method to quantify this behavioural variation, describing the key drivers and providing a tool to account for biases in models that use these data. We used a suite of metrics to describe the temporal and spatial behaviour of participants, as well as variation in the data they collected. These were applied to 5,268 users of the iRecord Butterflies mobile phone app, a multi-species environmental citizen science project. In contrast to previous studies, after removing transient participants (those active on few days and who contribute few records), we do not find evidence of clustering of participants; instead, participants fall along four continuous axes that describe variation in participants’ behaviour: recording intensity, spatial extent, recording potential and rarity recording. Our results support a move away from labelling participants as belonging to one behavioural group or another in favour of placing them along axes of participant behaviour that better represent the continuous variation between individuals. Understanding participant behaviour could support better use of the data, by accounting for biases in the data collection process.
Item Type: | Publication - Article |
---|---|
Digital Object Identifier (DOI): | https://doi.org/10.1038/s41598-020-67658-3 |
UKCEH and CEH Sections/Science Areas: | Biodiversity (Science Area 2017-) |
ISSN: | 2045-2322 |
Additional Information. Not used in RCUK Gateway to Research.: | Open Access paper - full text available via Official URL link. |
Additional Keywords: | biodiversity, data mining, macroecology, statistical methods |
NORA Subject Terms: | Ecology and Environment |
Date made live: | 29 Jul 2020 11:04 +0 (UTC) |
URI: | https://nora.nerc.ac.uk/id/eprint/528242 |
Actions (login required)
View Item |
Document Downloads
Downloads for past 30 days
Downloads per month over past year