Volume 16, Issue 1 e12854
ORIGINAL ARTICLE
Open Access

Exploring the role of the long short-term memory model in improving multi-step ahead reservoir inflow forecasting

Xinran Luo

Xinran Luo

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

Hubei Provincial Key Lab of Water System Science for Sponge City Construction, Wuhan University, Wuhan, China

Search for more papers by this author
Pan Liu

Corresponding Author

Pan Liu

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

Hubei Provincial Key Lab of Water System Science for Sponge City Construction, Wuhan University, Wuhan, China

Correspondence

Pan Liu, State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan 430072, China.

Email: [email protected]

Search for more papers by this author
Qianjin Dong

Qianjin Dong

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

Hubei Provincial Key Lab of Water System Science for Sponge City Construction, Wuhan University, Wuhan, China

Search for more papers by this author
Yanjun Zhang

Yanjun Zhang

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

Hubei Provincial Key Lab of Water System Science for Sponge City Construction, Wuhan University, Wuhan, China

Search for more papers by this author
Kang Xie

Kang Xie

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

Hubei Provincial Key Lab of Water System Science for Sponge City Construction, Wuhan University, Wuhan, China

Search for more papers by this author
Dongyang Han

Dongyang Han

State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

Hubei Provincial Key Lab of Water System Science for Sponge City Construction, Wuhan University, Wuhan, China

Search for more papers by this author
First published: 12 September 2022
Citations: 1

Funding information: Innovation Team in Key Field of the Ministry of Science and Technology, Grant/Award Number: 2018RA4014; National Natural Science Foundation of China, Grant/Award Number: 51861125102; the Joint Funds of the National Natural Science Foundation of China, Grant/Award Number: U1865201

Abstract

Daily inflow forecasting is of vital importance in reservoir economic operation. In the context of hydrometeorological forecasting, the effectiveness of the data-driven models has been demonstrated as bias correctors for physically-based models or direct forecasting models. However, existing studies only highlight the performance improvements provided by the data-driven model, lacking a comprehensive investigation on whether the data-driven model should be used as bias correctors or direct forecasting models. This study constructs long short-term memory (LSTM)-based preprocessing and postprocessing techniques for a hydrological model, which are tested by linear scaling preprocessing and autoregressive (AR) postprocessing models. The integrated model is compared with the LSTM-only model. The Shuibuya and Zuojiang reservoirs in China are selected as case studies. Results indicate that: (1) LSTM-based bias correctors are effective in both preprocessing and postprocessing and (2) the integrated model is comparable to the LSTM-only model when trained with four or more years of data, while it is better than the LSTM-only model when trained with less data. These findings demonstrate that data-driven methods can effectively correct the bias in physically-based model output, and integrating the physical and data-driven models is useful in improving multi-step ahead reservoir inflow forecasting if limited data can be obtained.

1 INTRODUCTION

Days-ahead reservoir inflow forecasting is an indispensable component of dam operations around the world (Ahmad & Hussain, 2019). Reliable and accurate forecasts can significantly improve reservoir efficiencies with respect to socially important needs, such as flood control and hydropower generation (Anghileri et al., 2016; Anghileri et al., 2019; Li et al., 2018; Liu et al., 2015), while forecasts with a low accuracy may increase the risk of flooding and the unnecessary loss of water resources (Gong et al., 2020; Liu et al., 2020; Miao et al., 2016; Yang et al., 2018; Zhang et al., 2019).

With recent advances in meteorology and computing science, global scale short-to-medium-range numerical weather predictions (NWPs) are increasingly accurate over catchment scale, making them more valuable in informing reservoir operations (Ahmad & Hossain, 2020; Anghileri et al., 2019; Bauer et al., 2015; Monhart et al., 2019; Nanda et al., 2019; Sharma et al., 2018). Current reservoir inflow forecasting methods can be divided into two categories: physically-based modeling chains (Ahmad & Hossain, 2020; Deng et al., 2015; Georgakakos et al., 2014; Nanda et al., 2019; Peng et al., 2019) and data-driven direct forecast models (Ahmad & Hussain, 2019; Liao et al., 2020; Liu & Coulibaly, 2011; Zhong et al., 2018).

The physically-based modeling chains generally consist of four components: a climate model for weather prediction, a preprocessing technique for bias removal and downscaling, a hydrological model for rainfall-runoff modeling, and a postprocessing technique for real-time streamflow forecast updating. One major concern with these methods is the accumulated errors caused by weather and hydrological uncertainties (Gui et al., 2019; Liu et al., 2018; McInerney et al., 2020; Verkade et al., 2013; Xu et al., 2021). Statistical preprocessors and postprocessors are commonly adopted to mitigate this problem. A substantial number of researches have demonstrated that both implementing preprocessing and postprocessing separately and jointly are effective means of improving the forecast quality (Anghileri et al., 2019; da Silva et al., 2018; Fernández Bou et al., 2015; Woldemeskel et al., 2018; Zhang, Liu, Cheng, et al., 2018; Zhang, Liu, Zhao, et al., 2018). But the contributions of preprocessing and postprocessing are strongly dependent on the forecast system (Sharma et al., 2018). For instance, Sharma et al. (2018) found that postprocessing alone is more powerful than preprocessing, while Benninga et al. (2017) found that a strategy with no preprocessing or postprocessing produces the best evaluation scores. Overall, although the benefits from preprocessing and postprocessing are influenced by complicated factors, the joint implementation of preprocessing and postprocessing is necessary. In this way, the weather and hydrological errors can be quantified, although not completely, resulting in more reliable forecasts (Bennett et al., 2014).

Data-driven methods can directly link raw meteorological forecasts to reservoir inflow, and then learn from the latent relationship between the input and output data. Some researchers have shown that there are advantages to using artificial neural networks for inflow forecasting (Ahmad & Hossain, 2020; Ahmad & Hussain, 2019; Liao et al., 2020). Some others also use them as bias correctors (Nanda et al., 2019; Xu et al., 2021). However, traditional machine learning methods are very sensitive to the selected input features and prone to becoming trapped around local optima (Ahmad & Hussain, 2019; Sudheer et al., 2002). In recent years, deep learning (DL) has received considerable attention in various disciplines for its excellent model performance, efficiency, automation, and flexibility compared with conventional machine learning models (Feng et al., 2020; Shen, 2018). Specific to the field of hydrology, the DL model known as long short-term memory (LSTM) (Hochreiter & Schmidhuber, 1997) is commonly adopted for its ability to deal with time-series-related problems. The power of the LSTM-based model for hydrological modeling (HM; Jiang et al., 2020; Kratzert et al., 2018; Kratzert et al., 2019), streamflow prediction (Feng et al., 2020; Qi et al., 2019; Zuo et al., 2020), and multi-step-ahead flood forecasting (Kao et al., 2020; Xiang et al., 2020) has been extensively tested. Moreover, several studies have investigated the ability of the LSTM model in learning from mismatch (Lu et al., 2021; Ma et al., 2019). LSTM achieves unprecedented precision in these cases. However, existing studies solely highlight the performance improvements provided by the LSTM model, lacking a comprehensive investigation on whether the data-driven model should be used as a bias corrector or a direct forecast model.

The main object of this study is to investigate the role of the LSTM model in improving multi-step ahead reservoir inflow forecast. Two questions will be explored: (1) Can the LSTM model be used to improve preprocessing and postprocessing? (2) What are the advantages of physically-based models combined with LSTM correctors and LSTM-based direct forecast models? The organization of the remainder of this paper is as follows. Section 2 describes the model and experimental settings. Section 3 introduces the study sites, data, and hydrological models. Results and discussions about inflow forecasts are presented in Section 4. Finally, the conclusions of this study are presented in Section 5.

2 METHODOLOGY

As illustrated in Figure 1, this study mainly consists of four parts:
  1. Preparing the needed data, which includes forecasted and observed forcing (e.g., rainfall, temperature, and evaporation) and observations of reservoir inflow.
  2. Developing and evaluating LSTM-based bias correctors for a hydrological model. The LSTM model is developed for rainfall preprocessing and inflow postprocessing, respectively. The effectiveness of the LSTM-based bias correctors is evaluated using conventional processors, that is, linear scaling (LS) and autoregressive (AR) model.
  3. In Part III, the better performing integrated forecast model is further compared with the LSTM-based direct forecast model.
  4. Data-driven models usually require large amounts of data for model training. However, with the continuous upgrading of climate services, consistent and long-term records of NWP data are difficult to be collected. The purpose of Part IV is to evaluate the influence of sample size on the comparison results between the integrated model and the LSTM-only model.
Details are in the caption following the image
Framework for model development and comparison. AR, autoregressive; LS, linear scaling; LSTM, long short-term memory

2.1 LSTM network and model settings

LSTM is an evolutionary version of a simple recurrent neural network, with a more elaborate design to improve the model performance under long-term dependencies (Hochreiter & Schmidhuber, 1997). All the parameters involved in LSTM cells can be learned and updated automatically according to the latent relationship between the input data and the target data. This “end-to-end” pattern endows LSTM with outstanding application value in various fields because it removes the need to manually extract the model features. In this study, LSTM-based models are used for preprocessing, postprocessing, and direct reservoir inflow forecasting. An illustration of the input and output schemes are shown in Figure 2. Blue and pink rhombuses represent raw, and bias corrected forecasts within lead time, respectively. Yellow and gray rhombuses indicate observations and raw forecasts at recent time periods, respectively.

Details are in the caption following the image
Illustration of the input and output schemes of Equations (1-3)

2.1.1 Preprocessing of rainfall forecasts

To remove the inherent biases in climate model output and provide reliable estimations of rainfall uncertainty, the LSTM model is set up for preprocessing. The error characteristics of rainfall forecasts vary with lead time; therefore, the LSTM preprocessor is developed for each lead time, respectively. The input and output scheme is as follows:
P pre , t i = f P obs , t , P obs , t - 1 , , P obs , t - l pre , P raw , t - i i , P raw , t - i - 1 i , , P raw , t - i - l pre i , i = 1 , 2 , , T P raw , t i , P raw , t i - 1 , , P raw , t 1 (1)
where lpre denotes the time lag used in preprocessing; T is the maximum lead time (set to 5 days in this study); P obs , t is the observed rainfall at time t; P raw , t i and P pre , t i are the raw and preprocessed rainfall at time t + i for the lead time i, respectively.

2.1.2 Postprocessing of inflow forecasts

Inflow postprocessing relies on the autocorrelation of error series (McInerney et al., 2020; Zhang, Liu, Zhao, et al., 2018). The LSTM model is developed to correct the inflow forecasts for each lead time based on the most recent known errors, recent observation of reservoir inflow and climate forcing, and forecasted reservoir inflow and climate forcing within lead time:
Q fore , t i = f e t i , e t - 1 i , , e t - l post i , F obs , t , F obs , t - 1 , , F obs , t - l post , Q obs , t , Q obs , t - 1 , , Q obs , t - l post , i = 1 , 2 , , T F pred , t i , F pred , t i - 1 , , F pred , t 1 , Q raw , t i , Q raw , t i - 1 , , Q raw , t 1 (2)
where lpost denotes the time lag used in postprocessing; e t i is the inflow forecast error at time t for the lead time i; F obs , t is the observed climate forcing at time t; F pred , t i is the bias corrected climate forcing at time t + i for the lead time i; Q raw , t i and Q fore , t i are the raw and final reservoir inflow forecasts at time t + i for the lead time i, respectively.

2.1.3 LSTM-based direct forecast model

The LSTM model is also developed as a direct predictor of reservoir inflow. Taking the same inputs as the integrated model, the LSTM-based direct forecast model is expected to learn from the latent relationship between input and target variables without prior knowledge about the bias correction-HM-bias correction process. For the forecast of the reservoir inflow at time t + i with a lead time of i, the inputs include the recent lfore days' observation of reservoir inflow and climate forcing, raw rainfall forecasts for the time t − lfore to t, and real-time forecasts of climate forcing within lead time. The formula is as follows:
Q fore , t i = f Q obs , t , Q obs , t - 1 , , Q obs , t - l dire , F obs , t , F obs , t - 1 , , F obs , t - l dire , P raw , t 1 i , P raw , t - 1 1 i , , P raw , t - l dire 1 i , F raw , t i , F raw , t i - 1 , , F raw , t 1 , i = 1 , 2 , , T (3)
where ldire denotes the time lag used in the direct forecast; P raw , t 1 i represents rainfall forecasts for the time t with lead times from 1 to i; F raw , t i is the raw climate forcing from the climate model at time t + i for the lead time i.

As the learning of the LSTM model has nonnegligible randomness, several duplicate runs should be conducted. The best solution is then used for further calculations.

2.2 Benchmark processors and experimental setting

To put the LSTM models for preprocessing and postprocessing into context, benchmark preprocessing and postprocessing techniques are tested.

For preprocessing, several studies have demonstrated that the LS model is effective in correcting the bias of the climate model output (da Silva et al., 2018; Goshime et al., 2020; Ji et al., 2020). The LS is a mean-based transformation method and is easy to implement. The transformation equations are established for each grid as flows:
P obs i = α P raw i (4)
where P obs i and P raw i are observed and raw rainfall for grid i, respectively.

To further improve the skill of inflow forecasts, forecast-based postprocessing techniques are implemented. In which postprocessing models are developed using simulations from an NWP-forced hydrological model so that the NWP and hydrological uncertainties can be simultaneously ameliorated. An AR model is selected in this study. The AR model is established for each lead time using the most recent known error (Zhang, Liu, Zhao, et al., 2018). The order of the AR model is set as two, and its parameters are determined by the modified recursive least square method.

Finally, nine forecast schemes are established (see Table 1). The schemes LS-HM, LSTM-HM, HM-AR, and HM-LSTM indicate a hydrological model combined with a preprocessor or postprocessor alone. The schemes LS-HM-AR, LS-HM-LSTM, LSTM-HM-AR, and LSTM-HM-LSTM indicate a hydrological model combined with both a preprocessor and postprocessor. The LSTM-only model means the LSTM-based direct forecast model.

TABLE 1. Forecast schemes
Forecasting methods Schemes Preprocessing Hydrological modeling (HM) Postprocessing
LS LSTM AR LSTM
HM with bias correctors LS-HM
LSTM-HM
HM-AR
HM-LSTM
LS-HM-AR
LS-HM-LSTM
LSTM-HM-AR
LSTM-HM-LSTM
The LSTM-only model Customized LSTM for each lead time
  • Abbreviations: AR, autoregressive; LS, linear scaling; LSTM, long short-term memory.

2.3 Evaluation metrics

Metrics for model evaluation include Nash–Sutcliffe efficiency (NSE) coefficient (Nash & Sutcliffe, 1970), Pearson's correlation coefficient (PCC), and bias. The percent bias in 2% high-segment flow (FHV) and 30% low-segment flow (FLV) are also calculated (Yilmaz et al., 2008). Different from Yilmaz et al. (2008), the term FLV is not calculated in the log space for the existing of flows less than or equal to zero. Only indices for the test period are presented.

2.3.1 Nash–Sutcliffe efficiency coefficient

NSE = 1 - t = 1 N Q sim , t - Q obs , t 2 t = 1 N Q obs , t - Q ¯ obs 2 (5)
where Q sim , t and Q obs , t are the simulated and observed reservoir inflow at time t, respectively; Q ¯ obs is observed mean of reservoir inflow.

2.3.2 Pearson's correlation coefficient

PCC = t = 1 N Q sim , t - Q ¯ sim Q obs , t - Q ¯ obs t = 1 N Q sim , t - Q ¯ sim 2 Q obs , t - Q ¯ obs 2 (6)
where Q ¯ sim , t is the simulated mean of reservoir inflow.

2.3.3 Bias

Bias = t = 1 N Q sim , t - Q obs , t t = 1 N Q obs , t (7)

2.3.4 High-segment flow

FHV = h = 1 H Q sim , h - Q obs , h h = 1 H Q obs , h × 100 (8)
where h = 1, 2, …, H are flow indices for flows within exceedance probabilities lower than 0.02.

2.3.5 Low-segment flow

FLV = - 1 l = 1 L Q sim , l - Q sim , L l = 1 L Q obs , l - Q obs , L l = 1 L Q obs , l - Q obs , L × 100 (9)
where l = 1, 2, …, L are flow indices for flows with exceedance probabilities greater than 0.7; L is the index for the minimum flow.

3 CASE STUDIES

Two case studies are conducted to illustrate the role of the LSTM model in improving the accuracy of reservoir inflow forecasting. The location of the reservoirs, used data, and rainfall-runoff models are presented in this section.

3.1 Location

Figure 3 presents the location of the Shuibuya and Zuojiang reservoirs. The Shuibuya reservoir is located in Hubei province, China. As the first cascade in the mainstream of Qingjiang River, a branch of the Yangtze River, the main operational purposes of this dam are flood control and power generation. The drainage area that contributes to the Shuibuya reservoir inflow covers 10,860 km2, accounting for 64% of the whole watershed. Qingjiang river basin has a subtropical monsoon climate. Some 80% of the annual rainfall occurs from April to September. Storm flood events are frequent because of the complex climate and topography, threatening local security and the economy. Therefore, accurate reservoir inflow forecasting is desperately needed. The Zuojiang reservoir has a drainage area of 26,173 km2 and is located in the middle reaches of the Zuojiang river. The dam is primarily used for hydropower generation and has comprehensive utilization benefits such as irrigation, shipping, tourism, and aquaculture. The Zuojiang river, with a total length of 539 km, is a transborder river whose upper reaches belong to Vietnam (about 40% of the drainage area of the Zuojiang reservoir), and the middle and lower reaches belong to China.

Details are in the caption following the image
Location, drainage area, river, and gage stations for (a) Shuibuya reservoir and (b) Zuojiang reservoir

3.2 Data and HM

3.2.1 Shuibuya reservoir

The available data for this area consists of reservoir inflows to Shuibuya reservoir, rainfall, and maximum/minimum temperatures, all of which were obtained for the period 2014–2019. These years include relatively dry, normal, and wet conditions. The reservoir inflows, which are calculated through the inversion of the water balance function based on the observed reservoir level and outflows, were obtained from the operator of the Shuibuya reservoir. The meteorological data were recorded at the gage stations shown in Figure 3. Rainfall data were collected from the operator of the Shuibuya reservoir, and temperature data were obtained from the National Meteorological Information Center, China (http://data.cma.cn). After considering the time difference between weather predictions, all the obtained data were converted to daily time scales to satisfy the forecasting objectives of this research.

Real-time NWP data were obtained from the National Centers for Environmental Prediction, America (NCEP). The NCEP operational Global Forecast System gives global 16-day forecasts on a 0.25° spatial scale (about 30 km horizontal resolution) every 6 h from January 15, 2015, to the current time (https://rda.ucar.edu/datasets/ds084.1/). The first 10 days of forecasts are given at 3-h intervals, with subsequent predictions given at 12-h intervals. The first 5 days of forecasts are used in this study as the errors of weather predictions increase with the growth of lead time.

Generally, a climate model and a distributed hydrological model are combined for modeling the runoff processes. This is because the climate model outputs fit well with the gridded form of the distributed hydrological model inputs. The macroscale variable infiltration capacity (VIC) distributed hydrological model (Liang et al., 1994) is chosen to simulate the reservoir inflow. The model runs at a 0.25° grid spatial scale, including inputs of the rainfall and the maximum/minimum temperatures. The calibration and validation of the VIC model are detailed in Appendix A.

3.2.2 Zuojiang reservoir

Hourly observed data, including reservoir inflow and rainfall, and daily rainfall forecasts are obtained from January 2013 to October 2020. These data are collected from the operator of this dam. The estimated reservoir inflow, using the water balance method, is regarded as the “observed” inflow. The rainfall forecasts are issued every 12 h by the Guangxi Zhuang Autonomous Region Meteorological Bureau, China, and have a forecast horizon of 5 days. Both the observed and forecasted rainfall is given as an average. The conceptual rainfall-runoff model Xinanjiang (Zhao, 1992), which has been widely used for humid catchments in China, is selected to simulate the reservoir inflow. Besides, running a Xinanjiang model requires both rainfall and evapotranspiration (ET) data. Daily ET from 2013 to 2019 was downloaded from the National Meteorological Information Center (http://data.cma.cn). To fill the gaps of different time spans between rainfall and ET and lacking of ET forecasts, the ET data are averaged to 1 year because that ET has relatively little influence on reservoir inflow. Then, each year shares the same ET when running the Xinanjiang model. The calibration and validation of the Xinanjiang (XAJ) model are detailed in Appendix A.

4 FORECAST RESULTS AND DISCUSSIONS

4.1 Effectiveness of LSTM-based bias correctors

To demonstrate the effectiveness of LSTM-based bias correctors, a total of eight schemes are obtained by considering the configurations of HM with only a preprocessor, postprocessor, and both Table 2 and Table 3 present the forecast results for Shuibuya and Zuojiang reservoirs, respectively.

TABLE 2. Shuibuya: Performance of variable infiltration capacity (VIC) model with different processing scheme for multi-step ahead reservoir inflow forecast
Metrics Lead time (day) Schemes
LS-VIC LSTM-VIC VIC-AR VIC-LSTM LS-VIC-AR LS-VIC-LSTM LSTM-VIC-AR LSTM-VIC-LSTM
NSE t + 1 0.65 0.66 0.50 0.82 0.71 0.84 0.74 0.84
t + 2 0.57 0.59 0.22 0.70 0.58 0.72 0.61 0.72
t + 3 0.54 0.53 0.02 0.57 0.55 0.61 0.55 0.66
t + 4 0.40 0.43 −0.31 0.52 0.40 0.55 0.44 0.58
t + 5 0.29 0.40 −0.76 0.44 0.29 0.45 0.40 0.47
PCC t + 1 0.82 0.81 0.81 0.91 0.85 0.92 0.86 0.92
t + 2 0.78 0.77 0.76 0.84 0.78 0.86 0.79 0.86
t + 3 0.76 0.73 0.74 0.76 0.77 0.80 0.75 0.82
t + 4 0.68 0.67 0.65 0.77 0.68 0.76 0.68 0.79
t + 5 0.63 0.65 0.58 0.68 0.63 0.67 0.65 0.73
Bias t + 1 0.07 −0.03 0.16 −0.07 0.05 0.02 −0.02 0.02
t + 2 0.10 −0.05 0.30 0.04 0.09 −0.04 −0.04 −0.03
t + 3 0.11 −0.08 0.36 −0.04 0.10 0.07 −0.07 0.10
t + 4 0.12 −0.12 0.43 −0.04 0.12 −0.12 −0.13 0.00
t + 5 0.12 −0.13 0.48 0.01 0.12 −0.03 −0.13 0.09
FHV t + 1 −0.21 −0.29 −0.13 −0.11 −0.19 −0.07 −0.23 −0.07
t + 2 −0.27 −0.37 −0.14 −0.14 −0.28 −0.15 −0.36 −0.12
t + 3 −0.27 −0.43 −0.25 −0.42 −0.28 −0.26 −0.43 −0.17
t + 4 −0.40 −0.56 −0.22 −0.25 −0.40 −0.33 −0.56 −0.18
t + 5 −0.41 −0.55 −0.67 −0.43 −0.41 −0.56 −0.54 −0.35
FLV t + 1 −0.20 −0.02 −0.24 −0.08 −0.22 −0.39 −0.21 −0.39
t + 2 −0.30 −0.04 −0.39 −0.27 −0.28 −0.22 −0.08 −0.21
t + 3 −0.40 −0.08 −0.26 −0.52 −0.29 −0.38 −0.15 −0.60
t + 4 −0.49 −0.05 −0.35 0.32 −0.42 0.39 0.00 −0.38
t + 5 −0.53 −0.07 −0.42 −0.29 −0.51 −0.72 −0.08 −0.47
  • Note: Bold text corresponds to the best-performing scheme.
  • Abbreviations: AR, autoregressive; FHV, high-segment flow; FLV, low-segment flow; LS, linear scaling; LSTM, long short-term memory; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient.
TABLE 3. Zuojiang: Performance of Xinanjiang (XAJ) model with different processing scheme for multi-step ahead reservoir inflow forecast
Metrics Lead time (day) Schemes
LS-XAJ LSTM-XAJ XAJ-AR XAJ-LSTM LS-XAJ-AR LS-XAJ-LSTM LSTM-XAJ-AR LSTM-XAJ-LSTM
NSE t + 1 0.64 0.64 0.90 0.96 0.90 0.96 0.90 0.96
t + 2 0.59 0.59 0.70 0.88 0.70 0.87 0.70 0.88
t + 3 0.49 0.49 0.53 0.80 0.53 0.80 0.51 0.79
t + 4 0.42 0.40 0.42 0.70 0.44 0.72 0.41 0.73
t + 5 0.39 0.36 0.39 0.67 0.42 0.68 0.39 0.69
PCC t + 1 0.87 0.87 0.95 0.98 0.95 0.98 0.96 0.98
t + 2 0.83 0.83 0.86 0.94 0.86 0.93 0.86 0.94
t + 3 0.78 0.76 0.80 0.89 0.78 0.90 0.77 0.89
t + 4 0.73 0.69 0.75 0.84 0.73 0.85 0.70 0.85
t + 5 0.70 0.65 0.72 0.83 0.71 0.83 0.67 0.82
Bias t + 1 0.17 0.17 0.03 −0.03 0.03 0.04 0.03 0.01
t + 2 0.15 0.14 0.05 −0.01 0.04 −0.03 0.04 −0.05
t + 3 0.12 0.10 0.06 −0.01 0.05 0.01 0.04 −0.01
t + 4 0.09 0.05 0.08 −0.09 0.05 0.06 0.02 −0.04
t + 5 0.05 0.01 0.07 −0.02 0.02 0.06 −0.02 0.04
FHV t + 1 −0.07 −0.08 −0.01 0.02 −0.01 0.02 −0.01 0.00
t + 2 −0.17 −0.18 −0.15 −0.16 −0.15 −0.14 −0.15 −0.11
t + 3 −0.31 −0.34 −0.27 −0.29 −0.31 −0.24 −0.34 −0.29
t + 4 −0.38 −0.46 −0.32 −0.28 −0.38 −0.29 −0.46 −0.27
t + 5 −0.46 −0.56 −0.40 −0.32 −0.45 −0.34 −0.54 −0.33
FLV t + 1 −5.35 −5.35 0.06 −0.65 0.06 −0.18 0.06 0.30
t + 2 −5.25 −5.27 0.24 0.01 0.28 0.18 0.30 0.02
t + 3 −5.29 −5.35 0.37 −0.33 0.44 −0.86 0.46 −0.31
t + 4 −5.39 −5.47 0.32 −0.25 0.31 −0.94 0.28 −0.14
t + 5 −5.50 −5.60 0.26 −0.28 0.29 −0.33 0.29 −0.79
  • Note: Bold text corresponds to the best-performing scheme.
  • Abbreviations: AR, autoregressive; FHV, high-segment flow; FLV, low-segment flow; LS, linear scaling; LSTM, long short-term memory; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient.

The scores are generally better when the preprocessing and postprocessing technique are jointly implemented, which suggests that both the rainfall and hydrological uncertainty should be considered in hydrometeorological forecasting. However, the relative importance of preprocessing and postprocessing is not consistent in the two cases, which depend strongly on the selected models and research objects (Sharma et al., 2018). Within the preprocessing methods, the LSTM-based preprocessor corrects the bias in rainfall forecasts more effectively (detailed in Appendix B). The LS is a mean-based correction technique, which removes systematic bias in rainfall forecasts, while the relationship between the observations and forecasts is not simply linear. Nonetheless, there are also some cases in which the LS preprocessor makes the inflow forecasts more accurate. For instance, the scheme LS-XAJ provides better 4–5 days-ahead forecasts than LSTM-XAJ (Table 3). This is understandable because uncertainties from the hydrological model are included in the evaluation when the observed reservoir inflow is used as the benchmark. For postprocessing, the AR model corrects inflow forecasts effectively and produces consistent improvements for both reservoirs and all lead times. The gains in NSE value from AR vary from 0 to 0.06 and 0 to 0.26 for Shuibuya and Zuojiang reservoirs, respectively. The improvements seem to be larger at the shorter lead times. Using the same input, schemes with an LSTM postprocessor give further improvements in NSE, which range from 0.06 to 0.31. Skill improvements are also observed in terms of PCC, Bias, FLV, and FHV. The schemes with both an LSTM-based preprocessor and postprocessor perform best among all the schemes, although the differences are not significant in some cases, for example, scheme XAJ-LSTM versus LSTM-XAJ-LSTM for 4 days ahead forecasts of Zuojiang reservoir. Importantly, the LSTM-based bias correctors provide consistent improvements for preprocessing and postprocessing across lead times.

4.2 Comparison of the hybrid and LSTM-only models

This subsection examines whether a process-based model with bias correctors or a pure data-driven model performs better in multi-step ahead reservoir inflow forecasts. Figure 4 (Shuibuya) and Figure 5 (Zuojiang) illustrate the skill indices of the better-performing integrated scheme, that is, HM with LSTM correctors and the customized LSTM model for each lead time.

Details are in the caption following the image
Shuibuya: Performance of the integrated and long short-term memory (LSTM)-only models for validation years (2018–2019) from 1 to 5 days ahead. FHV, high-segment flow; FLV, low-segment flow; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient
Details are in the caption following the image
Zuojiang: Performance of the integrated and long short-term memory (LSTM)-only models for validation years (2019–2020) from 1–5 days ahead. FHV, high-segment flow; FLV, low-segment flow; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient

For the Shuibuya reservoir, the integrated model provides better scores across lead times. The NSE value (Figure 4a) decreases from 0.84 to 0.47 and 0.81 to 0.44 for the integrated model and LSTM-only model, respectively. The discrepancies in NSE value between the two paradigms range from 0.03 to 0.06, indicating that slight improvements can be obtained by using the integrated model. Similar results are obtained for the metrics of PCC, FHV, and FLV. Besides, the bias of the two models is within 10% for all lead times.

For the Zuojiang reservoir, however, there are no significant differences between the integrated model and the LSTM-only model. The NSE value for the integrated model is 0.96 at 1 day ahead forecast, and it drops to 0.69 at 5 days ahead forecast. For the LSTM-only model, the NSE value ranges from 0.96 to 0.67 for forecasts from one to 5 days ahead. The discrepancies in the NSE value are lower than 0.02, which can be attributed to the randomness in the training process (Gauch et al., 2021). The results of the other four metrics also show that the integrated model produces comparable forecasts with the LSTM-only model.

Figure 6 displays two typical flood events of the Shuibuya reservoir. The 1, 3, and 5 day(s) ahead forecasts are presented separately. The integrated and LSTM-only models performed similarly for the one-day ahead forecasts. Both models underestimated the flood peak of flood event 20,180,506 while overestimated the flood peak of flood event 20,180,706. It can be observed that the rainfall was overestimated for the two flood events. The overestimation of the flood peak can be easily attributed to the overestimations in rainfall forecasts, and the underestimation of the flood peak may be caused by other factors, for example, incorrect forecast of the temporal and spatial distribution of rainfall. For the 3 days ahead forecasts, the two models give close results for the flood event 20,180,506. However, the integrated model performs better for the forecasting of the flood event 20,180,706. The NSE value of the integrated model is 0.26 higher than the LSTM-only model for its better capturing of the flood peak. When the lead time grows to 5 days (Figures 5f and 6e), both the integrated and LSTM-only models struggle to capture the flood. The two models wrongly forecasted the timing of the flood peak for both flood events, which is consistent with the bias in rainfall forecasts.

Details are in the caption following the image
Shuibuya: Observations and forecasts of flood events 20,180,506 and 20,180,706. (a, b) Combination of 1 day ahead forecasts; (c, d) combination of 3 days ahead forecasts; (e, f) combination of 5 days ahead forecasts. LSTM, long short-term memory; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient

Figure 7 describes the forecasts for the Zuojiang reservoir. For the flood event 20,190,805, the integrated and LSTM-only models provide similar and accurate forecasts for 1, 3, and 5 days ahead, with the NSE value greater than 0.90. For the flood event 20,201,016, both models produced accurate forecasts for 1 day ahead. The LSTM-only model underestimated the flood peak in 3 days ahead forecasts, while the integrated model underestimated the flood peak in 5 days ahead forecasts. These underestimations may be attributed to the underestimations in rainfall forecasts because the reservoir inflow with a longer lead time is less related to the most recent inflow observations but highly related to rainfall forecasts.

Details are in the caption following the image
Zuojiang: Observations and forecasts of flood events 20,190,805 and 20,201,016. (a, b) Combination of 1 day ahead forecasts; (c, d) combination of 3 days ahead forecasts; (e, f) combination of 5 days ahead forecasts. LSTM, long short-term memory; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient

From the analyses above, it can be found that the integrated model provides better inflow forecasts of the Shuibuya reservoir than the LSTM-only model. These improvements are significant in statistics, with the discrepancy in NSE value ranging from 0.03 to 0.06, but not translated into practical improvements in flood hydrographs. For the Zuojiang reservoir, the integrated model and LSTM-only model show similar performance in evaluation metrics and flood hydrographs.

4.3 Influence of data length

The forecast results of the Shuibuya and Zuojiang reservoirs lead to different conclusions about the comparison of the integrated model and LSTM-only model. It is generally accepted that the length of training data has a significant influence on the performance of data-driven models. However, the length of the collected data of the Shuibuya reservoir is different from that of the Zuojiang reservoir. The data length of Shuibuya reservoir is 5 years (3 years data for model training), while the data length of Zuojiang reservoir is 8 years (6 years data for model training). Therefore, the influence of the data length must be considered. Taking Zuojiang reservoir as the testbed, the integrated model and LSTM-only model are trained with varying lengths of data, for example, 1, 2, 4, and 6 years, and the model performances are evaluated with the data of the last 2 years.

The comparison results are presented in Figure 8. Figure 8a depicts the NSE value of the integrated model and LSTM-only model under different lengths of training data and lead times. It can be found that the performance of the LSTM-only model decreases rapidly due to under-fitting, while the integrated model provides more robust forecasts under varying data lengths. The differences in NSE value (Figure 8b) between the two models become larger at the long lead times. This means that explicitly considering a rainfall preprocessing, rainfall-runoff modeling, and inflow postprocessing process is helpful for multi-step ahead reservoir inflow forecast. A well-calibrated rainfall-runoff model can reasonably represent the reservoir inflow process under specific initial conditions and predicted weather conditions. The errors of inflow forecasts mainly originate from the uncertainties in rainfall forecasts and defects of the rainfall-runoff model. The forecasting quality can be effectively improved if the two types of errors are corrected. For the LSTM-only model, there is no such a process guide. With increasing uncertainties in rainfall forecasts and decreasing useful information abstracted from recent observations as lead time grows, it becomes harder for the LSTM-only model to capture the latent relationship between input and output data. However, increasing the length of training data can greatly narrow the accuracy gap between the integrated and LSTM-only models; similar conclusions were drawn by Wang et al. (2019). For 5 days ahead forecasts, the difference in NSE value decreases from 0.11 to 0.02 when the length of training data grows from 1 to 6 years.

Details are in the caption following the image
Comparison of the integrated model and long short-term memory (LSTM)-only model with the length of training data being 1, 2, 4, and 6 years, respectively. (a) Scatter diagram of the Nash–Sutcliffe efficiency (NSE) value; (b) differences in the NSE value. When the difference is positive, the integrated model performs better

5 CONCLUSIONS

In this study, the role of the LSTM model in multi-step ahead reservoir inflow forecast was investigated by considering the integrated model (using the LSTM model as a bias corrector for a hydrological model) and LSTM-only models. First, the effectiveness of LSTM-based correctors in preprocessing and postprocessing is assessed by comparing them with conventional processors. Second, the performance of the integrated model and LSTM-only models is analyzed. Two reservoirs were taken as examples. The main conclusions are as follows:
  1. LSTM models for bias correction are effective in both preprocessing and postprocessing. In preprocessing, the LSTM corrector is superior to the linear transformation method, with lower root mean square error (RMSE) and higher PCC score. For postprocessing, the gains in NSE value from AR vary from 0 to 0.06 and 0 to 0.26 for Shuibuya and Zuojiang reservoirs, respectively. Using the same input, the LSTM postprocessor gives further improvements in NSE, which range from 0.06 to 0.31.
  2. The integrated model provides better results than the LSTM-only model in the Shuibuya reservoir, while it is comparable to the LSTM-only model in the Zuojiang reservoir. For the Shuibuya reservoir, slight improvements can be obtained by using the integrated model, with the discrepancy in NSE value ranging from 0.03 to 0.06. For the Zuojiang reservoir, the discrepancies in NSE value are lower than 0.02, which can be attributed to the randomness in the training process. Furthermore, with the Zuojiang reservoir as a testbed, the two types of models are trained with varying data lengths. The integrated model is comparable to the LSTM-only when trained with more than 4 years of data, while it is better than the LSTM-only under limited data, especially for the longer lead time. The difference in NSE value for 5 days ahead forecasts increases from 0.02 to 0.11 when the length of training data decreases from 6 to 1 year. This indicates that the length of training data has a significant influence on the integrated model and LSTM-only model.

This study demonstrated the efficacy of a rainfall-runoff model with LSTM-based bias correctors for flood forecasting, which is valuable for flood risk mitigation. However, some points need to be further investigated. The conclusions of this study can be verified at more catchments. Besides, with the continuous upgrading of climate services, it is difficult to collect consistent and long-term records of NWP data, which may limit the accuracy of the flood forecasting. One potential way to mitigate this problem is to develop a global LSTM model (Kratzert et al., 2018; Kratzert et al., 2019). At last, the conclusions emphasize the understanding of each part of the modeling chain, which implies that the performance of the LSTM model and other data-driven models may be improved by including the human cognition of physical processes, especially when limited data can be obtained.

ACKNOWLEDGMENTS

This study was supported by the Joint Funds of the National Natural Science Foundation of China (Grant No. U1865201), National Natural Science Foundation of China (Grant No. 51861125102), and Innovation Team in Key Field of the Ministry of Science and Technology (Grant No. 2018RA4014).

    CONFLICT OF INTEREST

    The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

    APPENDIX A

    A.1 Hydrological modeling

    A.1.1. Shuibuya reservoir

    The VIC model was established at a daily time scale and 0.25° spatial scale. To ensure a consistent time span of data for the calibration of VIC and LSTM models, data for the year 2014 were used to warm up the VIC model, while the data for 2015–2017 were used as the calibration period, and the remainder was used as the validation period. The simulated reservoir inflow and evaluation indices are presented in Figures A1 and A2.

    Details are in the caption following the image
    Variable infiltration capacity (VIC) simulations in calibration years (2015–2017) and evaluation metrics. FHV, high-segment flow; FLV, low-segment flow; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient
    Details are in the caption following the image
    Variable infiltration capacity (VIC) simulations in validation years (2018–2019) and evaluation metrics. FHV, high-segment flow; FLV, low-segment flow; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient

    A.1.2. Zuojiang reservoir

    The daily XAJ model was calibrated in the period from 2013 to 2018 (including a 60 days warmup period) and validated in 2019–2020. Figures A3 and A4 present the observed and simulated inflow. The XAJ model performs well in both periods, with the NSE value close to 0.9. However, the rain gauges cover only the area in China, making the inflow simulations show significant error during some periods. For instance, the floods in 2015 and 2018.

    Details are in the caption following the image
    Xinanjiang (XAJ) simulations in calibration years (2013–2018) and evaluation metrics. FHV, high-segment flow; FLV, low-segment flow; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient
    Details are in the caption following the image
    Xinanjiang (XAJ) simulations in validation years (2019–2020) and evaluation metrics. FHV, high-segment flow; FLV, low-segment flow; NSE, Nash–Sutcliffe efficiency; PCC, Pearson's correlation coefficient

    APPENDIX B

    B.1 NWP performance evaluation

    The rainfall forecasts are corrected by using LS and LSTM methods. A distributed hydrological model is established for Shuibuya reservoir; the LS method corrects the bias in rainfall forecast grid by grid, while the LSTM method corrects the gridded rainfall forecasts synchronously. Table B1 shows the metrics of the raw and bias-corrected rainfall forecasts for the Shuibuya and Zuojiang reservoirs. The evaluation was performed at the catchment scale. For both reservoirs, preprocessing techniques can effectively correct the bias in the rainfall forecasts, giving lower RMSE values. The LSTM-based method also gives some improvements in terms of PCC by considering non-linear relationships within input and output data.

    TABLE B1. Metrics of the raw and preprocessed rainfall for 1–5 days ahead
    Reservoir Methods Metrics t + 1 t + 2 t + 3 t + 4 t + 5
    Shuibuya Raw RMSE 5.32 7.18 8.35 9.19 8.75
    PCC 0.78 0.69 0.62 0.51 0.48
    LS RMSE 4.26 5.11 5.76 6.36 6.38
    PCC 0.78 0.69 0.62 0.51 0.48
    LSTM RMSE 4.03 4.66 5.16 5.56 5.75
    PCC 0.79 0.71 0.62 0.53 0.48
    Zuojiang Raw RMSE 5.72 6.07 6.23 6.59 6.70
    PCC 0.74 0.70 0.67 0.62 0.60
    LS RMSE 5.59 5.96 6.23 6.58 6.70
    PCC 0.74 0.70 0.67 0.62 0.60
    LSTM RMSE 5.44 5.80 6.19 6.48 6.54
    PCC 0.76 0.73 0.68 0.63 0.63
    • Abbreviations: LS, linear scaling; LSTM, long short-term memory; PCC, Pearson's correlation coefficient; RMSE, root mean square error.

    DATA AVAILABILITY STATEMENT

    (1) Real-time numerical weather prediction data for the Shuibuya reservoir were obtained from NCEP (https://rda.ucar.edu/datasets/ds084.1/). Other rainfall data are collected from the operators of the dams, with agreements requirement for use, and are not accessible to the public or research community directly. (2) Daily maximum/minimum temperatures and ET data were obtained from the National Meteorological Information Center (http://data.cma.cn). (3) All models or codes that support the findings of this study are available from the corresponding author upon reasonable request.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.