机构地区:[1]华北水利水电大学水利学院,郑州450045 [2]中国水利水电科学研究院流域水循环模拟与调控国家重点实验室,北京100048 [3]国家节水灌溉北京工程技术研究中心,北京100048 [4]安徽省淠史杭灌区管理总局,六安237005
出 处:《农业工程学报》2023年第13期113-122,共10页Transactions of the Chinese Society of Agricultural Engineering
基 金:十四五国家重点研发计划课题(2022YFD1900504);中国水利水电科学研究院技术创新团队项目(ID145B022021);河南省高等学校青年骨干教师培养计划项目(2020GGJS100)。
摘 要:渠道泄水闸能够快速排除灌区入渠洪水,避免渠道漫顶。研究以淠史杭灌区灌口集泄水闸为例,以闸门调度流量为目标变量,以不同时段过去和未来降雨量、泄水闸闸上实时水位及其变化量为特征变量,比较8种机器学习算法的预测精度,同时采用shapley additive explanations(SHAP)法分析特征变量重要性。结果表明:1)集成学习算法预测评价指标优于传统回归算法,8种机器学习算法中随机森林回归(random forest regression,RFR)算法预测精度最高(训练集均方根误差、平均绝对误差、均方误差及决定系数分别为0.146 m^(3)/s、0.094 m^(3)/s、0.021 m^(3)/s、0.976;测试集分别为0.306 m^(3)/s、0.197 m^(3)/s、0.093 m^(3)/s、0.931);2)采用SHAP法确定的特征变量重要性排序表明灌口集泄水闸闸上水位对于泄水闸调度流量的预测结果影响最大,占特征重要性值总和的34.6%;3)以过去6 h降雨量、过去9 h降雨量、未来6 h降雨量、灌口集泄水闸闸上水位作为输入变量的RFR算法预测灌口集泄水闸调度流量效果最佳,训练集均方根误差、平均绝对误差、均方误差及决定系数分别为0.126 m^(3)/s、0.080 m^(3)/s、0.016 m^(3)/s、0.982;测试集分别为0.263 m^(3)/s、0.164 m^(3)/s、0.069 m^(3)/s、0.950,研究结果对灌区防洪调度决策具有重要参考价值。The channel sluice can quickly remove the flood into the canal in the irrigation area.In order to provide a simple and efficient method for flood control scheduling decision of drainage sluice in irrigation area,this study took Pishihang Irrigation District as an example to establish a prediction model with dispatched flow as the target variable and 10 characteristic variables as independent variables.The 10 variables were the water level and rainfall of drainage sluice at irrigation mouth:rainfall in the past 1 hour,2 hour,3 hour,6 hour,and 9 hour and rainfall in the future 1 hour,3 hour and 6 hour,water level on the gates of the Guan Kouji drainage gate,difference in water level at the gate in the past half hour.The prediction accuracy of 8 machine learning algorithms was compared to pick the best algorithm.The Shapley Additive exPlanations(SHAP)method was used to analyze the importance of 10 groups of variables,and the influence weights of different variables on the prediction results were obtained.By comparing the prediction error indicators of the optimal algorithm under different variable combinations,the optimal variable combinations were selected,and the accuracy of the algorithm was further optimized to determine the final scheduling flow decision model.The results showed that:1)The integrated learning algorithm was better than the traditional regression algorithm in predicting the evaluation index.The order of prediction accuracy of ensemble learning algorithms was as follows:random forest regression(RFR)>extrme gradient boosting regression(XGR)>adapative bossting regression(ABR)>spoort vector regression(SVR),and Bagging had the highest accuracy in the three categories of ensemble learning algorithms.RFR had the highest prediction accuracy among the 8 machine learning algorithms(the root mean square error,mean absolute error,mean square error and determination coefficient of the training set were 0.146,0.094,0.021 m^(3)/s and 0.976,respectively.The root-mean-square error,mean absolute error and mean squ
关 键 词:灌溉 随机森林 机器学习 调度流量 集成学习 SHAP
分 类 号:TV122[水利工程—水文学及水资源]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...