Abstract
{ "background": "The sustainable management of water treatment infrastructure in West Africa requires robust, data-driven tools for long-term financial planning. Existing evaluations often lack integrated forecasting methodologies that link operational parameters directly to future cost trajectories, creating a gap in strategic asset management.", "purpose and objectives": "This data descriptor presents a novel, curated dataset and a methodological framework for evaluating the cost-effectiveness of water treatment systems. The primary objective is to provide a replicable model for forecasting operational expenditure, enabling proactive budget allocation and infrastructure investment planning.", "methodology": "A longitudinal dataset was constructed from operational records of multiple treatment facilities. The core analytical method is a seasonal autoregressive integrated moving average (SARIMA) model, specified as $\\phi(B)\\Phi(B^s)(1-B)^d(1-B^s)^D yt = \\theta(B)\\Theta(B^s)\\epsilont$, where $y_t$ represents the monthly cost per cubic metre of treated water. Model parameters were estimated using maximum likelihood, and forecasts were generated with 95% prediction intervals to quantify uncertainty.", "findings": "The forecasting model indicates a persistent upward trend in real-term operational costs, with a projected mean increase of approximately 17% over the forecast horizon. Prediction intervals widen substantially beyond the short term, highlighting significant uncertainty in long-range forecasts driven by variable energy and chemical input prices.", "conclusion": "The developed dataset and SARIMA modelling framework provide a quantitatively rigorous foundation for assessing the financial sustainability of water treatment infrastructure. The approach moves beyond descriptive analysis to generate probabilistic, time-series forecasts essential for risk-informed engineering management.", "recommendations": "Infrastructure managers should adopt similar forecasting methodologies for capital and operational planning. Future data collection should prioritise the granular recording of energy consumption and chemical dosing metrics to enhance model precision and explanatory power.", "key words": "water treatment, cost forecasting, SARIMA modelling, infrastructure management, operational expenditure, predictive maintenance, West Africa", "contribution statement": "This work provides the first integrated dataset