@Article{Aderemi_etal2023, author="Aderemi, B. A. and Olwal, T. O. and Ndambuki, J. M. and Rwanga, S. S.", title="Groundwater levels forecasting using machine learning models: A case study of the groundwater region 10 at Karst Belt, South Africa", journal="Systems and Soft Computing", year="2023", volume="5", pages="200049", optkeywords="Artificial intelligence", optkeywords="Forecasting model", optkeywords="Groundwater levels", optkeywords="Machine learning", optkeywords="Neural networks", optkeywords="Rainfall", optkeywords="Regression", optkeywords="Temperature", optkeywords="Time series", abstract="The crucial role which groundwater resource plays in our environment and the overall well-being of all living things can not be underestimated. Nonetheless, mismanagement of resources, over-exploitation, inadequate supply of surface water and pollution have led to severe drought and an overall drop in groundwater resources{\textquoteright} levels over the past decades. To address this, a groundwater flow model and several mathematical data-driven models have been developed for forecasting groundwater levels. However, there is a problem of unavailability and scarcity of the on-site input data needed by the data-driven models to forecast the groundwater level. Furthermore, as a result of the dynamics and stochastic characteristics of groundwater, there is a need for an appropriate, accurate and reliable forecasting model to solve these challenges. Over the years, the broad application of Machine Learning (ML) and Artificial Intelligence (AI) models are gaining attraction as an alternative solution for forecasting groundwater levels. Against this background, this article provides an overview of forecasting methods for predicting groundwater levels. Also, this article uses ML models such as Regressions Models, Deep Auto-Regressive models, and Nonlinear Autoregressive Neural Networks with External Input (NARX) to forecast groundwater levels using the groundwater region 10 at Karst belt in South Africa as a case study. This was done using Python Mx. Version 1.9.1., and MATLAB R2022a machine learning toolboxes. Moreover, the Coefficient of Determination (R2);, Root Mean Square Error (RMSE), Mutual Information gain, Mean Absolute Percentage Error (MAPE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and the Mean Absolute Scaled Error (MASE)) models were the forecasting statistical performance metrics used to assess the predictive performance of these models. The results obtained showed that NARX and Support Vector Machine (SVM) have higher performance metrics and accuracy compared to other models used in this study.", optnote="exported from refbase (http://www.uhydro.de/base/show.php?record=219), last updated on Fri, 26 Jan 2024 19:51:25 +0100", issn="2772-9419", opturl="https://www.sciencedirect.com/science/article/pii/S2772941923000029" }