Identifying the spatial pattern and driving factors of nitrate in groundwater using a novel framework of interpretable stacking ensemble learning
Li, Xuan; Liang, Guohua; Wang, Lei; Yang, Yuesuo; Li, Yuanyin; Li, Zhongguo; He, Bin; Wang, Guoli. 2024 Identifying the spatial pattern and driving factors of nitrate in groundwater using a novel framework of interpretable stacking ensemble learning. Environmental Geochemistry and Health, 46 (11), 482. 10.1007/s10653-024-02201-1
Before downloading, please read NORA policies.Preview |
Text (Open Access Paper)
s10653-024-02201-1.pdf - Published Version Available under License Creative Commons Attribution 4.0. Download (4MB) | Preview |
Abstract/Summary
Groundwater nitrate contamination poses a potential threat to human health and environmental safety globally. This study proposes an interpretable stacking ensemble learning (SEL) framework for enhancing and interpreting groundwater nitrate spatial predictions by integrating the two-level heterogeneous SEL model and SHapley Additive exPlanations (SHAP). In the SEL model, five commonly used machine learning models were utilized as base models (gradient boosting decision tree, extreme gradient boosting, random forest, extremely randomized trees, and k-nearest neighbor), whose outputs were taken as input data for the meta-model. When applied to the agricultural intensive area, the Eden Valley in the UK, the SEL model outperformed the individual models in predictive performance and generalization ability. It reveals a mean groundwater nitrate level of 2.22 mg/L-N, with 2.46% of sandstone aquifers exceeding the drinking standard of 11.3 mg/L-N. Alarmingly, 8.74% of areas with high groundwater nitrate remain outside the designated nitrate vulnerable zones. Moreover, SHAP identified that transmissivity, baseflow index, hydraulic conductivity, the percentage of arable land, and the C:N ratio in the soil were the top five key driving factors of groundwater nitrate. With nitrate threatening groundwater globally, this study presents a high-accuracy, interpretable, and flexible modeling framework that enhances our understanding of the mechanisms behind groundwater nitrate contamination. It implies that the interpretable SEL framework has great promise for providing valuable evidence for environmental management, water resource protection, and sustainable development, particularly in the data-scarce area.
Item Type: | Publication - Article |
---|---|
Digital Object Identifier (DOI): | 10.1007/s10653-024-02201-1 |
ISSN: | 0269-4042 |
Date made live: | 26 Nov 2024 14:24 +0 (UTC) |
URI: | https://nora.nerc.ac.uk/id/eprint/538451 |
Actions (login required)
View Item |
Document Downloads
Downloads for past 30 days
Downloads per month over past year