Streamflow forecasting under non-homogeneous and discontinuous data conditions using LSTM: Application to Nam Pan basin
Main Article Content
Abstract
Streamflow forecasting in small mountainous basins faces significant challenges due to non homogeneous and discontinuous observational data caused by sensor failures, irregular sampling, and missing records. This study presents a Long Short-Term Memory (LSTM) deep learning approach to handle discontinuous time series for water level forecasting in the Nam Pan River basin, northern Vietnam. The methodology integrates multiple data preprocessing techniques including linear and PCHIP interpolation, outlier removal, and Z-score normalization to address data irregularities. Input features comprise rainfall observations from multiple stations, lagged water levels (6–24 hours), and cyclical time encoding. The LSTM model achieves Nash–Sutcliffe Efficiency of 0.86–0.91 for 6-hour forecasts and 0.70–0.78 for 24-hour forecasts, with R² values of 0.88–0.94 and forecast assurance of 82–90%. Results demonstrate the model's robustness in handling imperfect data, confirming its applicability for operational flash flood early warning systems in data-limited mountainous catchments.
Article Details
Keywords
LSTM, Nam Pan river basin, flow forecasting, small mountainous catchment
References
Vietnam Meteorological and Hydrological Administration (2020–2024), Rainfall and water level datasets from meteorological–hydrological stations in Son La Province, Ministry of Natural Resources and Environment.
Vietnam Academy for Water Resources (2022), Overview report on the Da River basin and hydrological system of the Northwest region, Ministry of Agriculture and Rural Development.
C. Shen (2018), “A transdisciplinary review of deep learning research and its relevance for water resources scientists,” Water Resources Research, vol. 54, no. 11, pp. 8558–8593.
F. Kratzert, D. Klotz, C. Brenner, K. Schulz, and M. Herrnegger (2019), “Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks,” Hydrology and Earth System Sciences, vol. 23, no. 10, pp. 5089–5112.
C. Hu, Q. Wu, H. Li, S. Jian, L. Wang, and J. Chen (2021), “Deep learning approaches for hydrological time series prediction: A review,” Hydrological Sciences Journal, vol. 66, no. 8, pp. 1271–1290.
P. Bai, X. Liu, and S. Hu (2023), “Integrating CNN-LSTM networks for streamflow prediction in data-scarce basins,” Journal of Hydrology, vol. 617, p. 128136.
S. Hochreiter and J. Schmidhuber (1997), “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780.
D. P. Kingma and J. Ba (2015), “Adam: A method for stochastic optimization,” in Proceedings of the 3rd International Conference on Learning Representations (ICLR).
Vietnam Academy for Water Resources (2022), Overview report on the Da River basin and hydrological system of the Northwest region, Ministry of Agriculture and Rural Development.
C. Shen (2018), “A transdisciplinary review of deep learning research and its relevance for water resources scientists,” Water Resources Research, vol. 54, no. 11, pp. 8558–8593.
F. Kratzert, D. Klotz, C. Brenner, K. Schulz, and M. Herrnegger (2019), “Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks,” Hydrology and Earth System Sciences, vol. 23, no. 10, pp. 5089–5112.
C. Hu, Q. Wu, H. Li, S. Jian, L. Wang, and J. Chen (2021), “Deep learning approaches for hydrological time series prediction: A review,” Hydrological Sciences Journal, vol. 66, no. 8, pp. 1271–1290.
P. Bai, X. Liu, and S. Hu (2023), “Integrating CNN-LSTM networks for streamflow prediction in data-scarce basins,” Journal of Hydrology, vol. 617, p. 128136.
S. Hochreiter and J. Schmidhuber (1997), “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780.
D. P. Kingma and J. Ba (2015), “Adam: A method for stochastic optimization,” in Proceedings of the 3rd International Conference on Learning Representations (ICLR).