nerc.ac.uk

Evaluating the benefits of bayesian hierarchical methods for analyzing heterogeneous environmental datasets: a case study of marine organic carbon fluxes

Britten, Gregory L.; Mohajerani, Yara; Primeau, Louis; Aydin, Murat; Garcia, Catherine; Wang, Wei-Lei; Pasquier, Benoît; Cael, B. B. ORCID: https://orcid.org/0000-0003-1317-5718; Primeau, François W.. 2021 Evaluating the benefits of bayesian hierarchical methods for analyzing heterogeneous environmental datasets: a case study of marine organic carbon fluxes. Frontiers in Environmental Science, 9. https://doi.org/10.3389/fenvs.2021.491636

Before downloading, please read NORA policies.
[img]
Preview
Text
fenvs-09-491636.pdf - Published Version
Available under License Creative Commons Attribution 4.0.

Download (2MB) | Preview

Abstract/Summary

Large compilations of heterogeneous environmental observations are increasingly available as public databases, allowing researchers to test hypotheses across datasets. Statistical complexities arise when analyzing compiled data due to unbalanced spatial sampling, variable environmental context, mixed measurement techniques, and other reasons. Hierarchical Bayesian modeling is increasingly used in environmental science to describe these complexities, however few studies explicitly compare the utility of hierarchical Bayesian models to simpler and more commonly applied methods. Here we demonstrate the utility of the hierarchical Bayesian approach with application to a large compiled environmental dataset consisting of 5,741 marine vertical organic carbon flux observations from 407 sampling locations spanning eight biomes across the global ocean. We fit a global scale Bayesian hierarchical model that describes the vertical profile of organic carbon flux with depth. Profile parameters within a particular biome are assumed to share a common deviation from the global mean profile. Individual station-level parameters are then modeled as deviations from the common biome-level profile. The hierarchical approach is shown to have several benefits over simpler and more common data aggregation methods. First, the hierarchical approach avoids statistical complexities introduced due to unbalanced sampling and allows for flexible incorporation of spatial heterogeneitites in model parameters. Second, the hierarchical approach uses the whole dataset simultaneously to fit the model parameters which shares information across datasets and reduces the uncertainty up to 95% in individual profiles. Third, the Bayesian approach incorporates prior scientific information about model parameters; for example, the non-negativity of chemical concentrations or mass-balance, which we apply here. We explicitly quantify each of these properties in turn. We emphasize the generality of the hierarchical Bayesian approach for diverse environmental applications and its increasing feasibility for large datasets due to recent developments in Markov Chain Monte Carlo algorithms and easy-to-use high-level software implementations.

Item Type: Publication - Article
Digital Object Identifier (DOI): https://doi.org/10.3389/fenvs.2021.491636
ISSN: 2296-665X
Date made live: 06 May 2021 14:52 +0 (UTC)
URI: https://nora.nerc.ac.uk/id/eprint/530269

Actions (login required)

View Item View Item

Document Downloads

Downloads for past 30 days

Downloads per month over past year

More statistics for this item...