nerc.ac.uk

Identifying the spatial pattern and driving factors of nitrate in groundwater using a novel framework of interpretable stacking ensemble learning

Li, Xuan; Liang, Guohua; Wang, Lei; Yang, Yuesuo; Li, Yuanyin; Li, Zhongguo; He, Bin; Wang, Guoli. 2024 Identifying the spatial pattern and driving factors of nitrate in groundwater using a novel framework of interpretable stacking ensemble learning. Environmental Geochemistry and Health, 46 (11), 482. 10.1007/s10653-024-02201-1

Before downloading, please read NORA policies.
[thumbnail of Open Access Paper]
Preview
Text (Open Access Paper)
s10653-024-02201-1.pdf - Published Version
Available under License Creative Commons Attribution 4.0.

Download (4MB) | Preview

Abstract/Summary

Groundwater nitrate contamination poses a potential threat to human health and environmental safety globally. This study proposes an interpretable stacking ensemble learning (SEL) framework for enhancing and interpreting groundwater nitrate spatial predictions by integrating the two-level heterogeneous SEL model and SHapley Additive exPlanations (SHAP). In the SEL model, five commonly used machine learning models were utilized as base models (gradient boosting decision tree, extreme gradient boosting, random forest, extremely randomized trees, and k-nearest neighbor), whose outputs were taken as input data for the meta-model. When applied to the agricultural intensive area, the Eden Valley in the UK, the SEL model outperformed the individual models in predictive performance and generalization ability. It reveals a mean groundwater nitrate level of 2.22 mg/L-N, with 2.46% of sandstone aquifers exceeding the drinking standard of 11.3 mg/L-N. Alarmingly, 8.74% of areas with high groundwater nitrate remain outside the designated nitrate vulnerable zones. Moreover, SHAP identified that transmissivity, baseflow index, hydraulic conductivity, the percentage of arable land, and the C:N ratio in the soil were the top five key driving factors of groundwater nitrate. With nitrate threatening groundwater globally, this study presents a high-accuracy, interpretable, and flexible modeling framework that enhances our understanding of the mechanisms behind groundwater nitrate contamination. It implies that the interpretable SEL framework has great promise for providing valuable evidence for environmental management, water resource protection, and sustainable development, particularly in the data-scarce area.

Item Type: Publication - Article
Digital Object Identifier (DOI): 10.1007/s10653-024-02201-1
ISSN: 0269-4042
Date made live: 26 Nov 2024 14:24 +0 (UTC)
URI: https://nora.nerc.ac.uk/id/eprint/538451

Actions (login required)

View Item View Item

Document Downloads

Downloads for past 30 days

Downloads per month over past year

More statistics for this item...