Dissanayake, H.K.H.K.Punchi-Manage, S.A.R.2025-11-052025-11-052025-11-07Proceedings of the Postgraduate Institute of Science Research Congress (RESCON)-2025, University of Peradeniya,p893051-4622https://ir.lib.pdn.ac.lk/handle/20.500.14444/5979Monitoring and forecasting river water quality is critical for sustainable water resource management, particularly in densely populated regions. This study evaluates and compares the performance of deep learning and machine learning models in forecasting the Weighted Arithmetic Water Quality Index (WAWQI) across 20 monitoring stations along the River Thames. Physicochemical parameters, including pH, temperature, total suspended solids (TSS) and nitrates, collected from 2009 to 2017, were used to compute WAWQI, with missing values approximated through linear interpolation. Long Short-Term Memory (LSTM) neural networks were trained using an eight-month input sequence to capture temporal dependencies, while Extreme Gradient Boosting (XGBoost) models utilised lagged WAWQI values (lags 1 – 3) and time-based features. Both models were independently developed for each site and evaluated using an 80:20 train-test split. For benchmarking, classical time-series models such as Seasonal Auto-Regressive Integrated Moving Average (SARIMA) and Prophet were also applied. Results show that both LSTM and XGBoost outperform classical time-series models. Model performance was evaluated using Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE). LSTM outperformed XGBoost at most sites, achieving lower RMSE at 12 sites and lower MAE and MAPE at 13 sites, highlighting its strength in capturing seasonal and temporal trends. Although the largest MAPE improvement was relatively small (1.0%), it was consistent across sites. LSTM performance declined at sites with irregular or noisy data, where XGBoost provided more robust predictions by modelling non-linear relationships and adapting to varying patterns. In addition, LSTM required longer computational training time, while XGBoost was significantly faster and more efficient for frequent retraining. These findings emphasise the importance of tailoring model selection to site-specific characteristics. Combining temporal deep learning approaches with tree-based methods can enhance the reliability and scalability of water quality forecasting, supporting informed decision-making in environmental monitoring and water resource management.enEnvironmental monitoringExtreme gradient boostingLong short-term memoryTime-series forecastingWater quality indexComparative forecasting of water quality index using long short-term memory and extreme gradient boostingArticle