nerc.ac.uk

A systematic review of machine learning models for groundwater level prediction

Jesse, Gilbert; Boateng, Cyril D. ORCID: https://orcid.org/0000-0002-1721-4158; Aryee, Jeffrey N.A.; Osei, Marian A. ORCID: https://orcid.org/0000-0003-3481-7222; Wemegah, David D.; Gidigasu, Solomon S.R.; Britwum, Akyana; Afful, Samuel K.; Touré, Haoulata; Mensah, Vera; Owusu-Afriyie, Prinsca. 2025 A systematic review of machine learning models for groundwater level prediction. Applied Computing and Geosciences, 28, 100303. 29, pp. 10.1016/j.acags.2025.100303

Before downloading, please read NORA policies.
[thumbnail of N540581JA.pdf]
Preview
Text
N540581JA.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives 4.0.

Download (4MB) | Preview

Abstract/Summary

This study presents a comprehensive synthesis of machine learning (ML) techniques applied to groundwater level (GWL) prediction, focusing on model architectures, feature selection methods, hyperparameter tuning, optimization algorithms, and clustering techniques. A total of 223 peer-reviewed articles were systematically reviewed using the PRISMA framework to guide study identification, inclusion, and exclusion. Widely used models include artificial neural networks (ANN), support vector machines (SVM), long short-term memory networks (LSTM), and random forests (RF). More recent studies increasingly employ hybrid approaches that integrate wavelet transforms, signal decomposition, and optimization techniques such as particle swarm optimization (PSO), genetic algorithms (GA), and ant colony optimization (ACO). Transformer-based models have also begun to emerge as promising tools in this domain. A central focus of this review is feature selection, which remains one of the most underdeveloped areas in GWL modeling. Most studies rely on simple filter methods like autocorrelation and mutual information. While SHapley Additive exPlanations (SHAP) has gained some traction, more advanced techniques, such as recursive feature elimination (RFE), forward feature selection (FFS), factor analysis (FA), and self-organizing maps (SOM), are rarely used. Notably, no study systematically compared multiple feature selection strategies, limiting insights into their impact on model performance. Scientometric analysis shows that Iran, China, India, and the United States contribute the most impactful research. Despite strong predictive outcomes, trial-and-error remains the dominant approach to hyperparameter tuning. The review emphasizes the need for more systematic, interpretable, and generalizable ML approaches to support robust groundwater level (GWL) forecasting.

Item Type: Publication - Article
Digital Object Identifier (DOI): 10.1016/j.acags.2025.100303
UKCEH and CEH Sections/Science Areas: UKCEH Fellows
ISSN: 2590-1974
Additional Information: Open Access paper - full text available via Official URL link.
Additional Keywords: machine learning, groundwater level prediction, PRISMA, ANN, feature selection, optimization, hyperparameter tuning
NORA Subject Terms: Ecology and Environment
Hydrology
Computer Science
Date made live: 17 Nov 2025 11:47 +0 (UTC)
URI: https://nora.nerc.ac.uk/id/eprint/540581

Actions (login required)

View Item View Item

Document Downloads

Downloads for past 30 days

Downloads per month over past year

More statistics for this item...