检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:梁涛 柴露露 谭建鑫[2] 井延伟 吕梁年 LIANG Tao;CHAI Lulu;TAN Jianxin;JING Yanwei;LÜLiangnian(School of Artificial Intelligence and Data Science,Hebei University of Technology,Tianjin 300401,China;Hebei Jiantou New Energy Co.,Ltd.,Shijiazhuang 050011,China;Goldwind Science&Technology Co.,Ltd.,Beijing 102600,China)
机构地区:[1]河北工业大学人工智能与数据科学学院,天津300401 [2]河北建投新能源有限公司,河北石家庄050011 [3]金风科技股份有限公司,北京102600
出 处:《电力自动化设备》2025年第1期59-66,共8页Electric Power Automation Equipment
基 金:国家自然科学基金资助项目(2023YFB3407703);河北省科技支撑计划项目(F2021202022)。
摘 要:为了促进氢能与综合能源系统中其他能源的耦合,提高能源利用灵活性,减少系统碳排放,提出了一种氢耦合电-热综合能源系统(HCEH-IES)的运行优化方法。对HCEH-IES的各设备进行数学建模,并深入阐述深度强化学习算法的基本原理及双延迟深度确定性策略梯度(TD3)算法的流程;将HCEH-IES的不确定性优化调度问题转化为马尔可夫决策过程,并采用TD3算法将优化目标以及约束条件转换为奖励函数进行连续状态空间和动作空间下的动态调度决策,形成合理的能源分配管理方案;采用历史数据对智能体进行训练,并对比深度Q学习网络和深度确定性策略梯度算法获得的调度策略。结果表明,相较于深度Q学习网络和深度确定性策略梯度算法,基于TD3算法的调度策略具有更好的经济性,其结果更接近于CPLEX日前优化调度方法的经济成本且更适用于解决综合能源系统动态优化调度问题,有效地实现了能源灵活利用,提高了综合能源系统的经济性和低碳性。In order to promote the coupling of hydrogen energy with other energy sources in the integrated energy system,improve the flexibility of energy utilization and reduce the carbon emission of the system,an operation optimization method of hydrogen coupled electrothermal integrated energy system(HCEH-IES)is proposed.The mathematical model of each device in the HCEH-IES is established,and the basic principle of deep reinforcement learning algorithm and the process of twin delayed deep deterministic policy gradient(TD3)algorithm are described in detail.The uncertain optimal scheduling problem of HCEH-IES is transformed into Markov decision process,and the TD3 algorithm is used to convert optimization objective and constraints into reward functions for dynamic scheduling decision-making in continuous state space and action space,then a reasonable energy distribution management scheme is formed.The agents are trained with historical data,and the scheduling strategies obtained by deep Q learning network and deep deterministic policy gradient algorithm are compared.The results show that,compared with the deep Q learning network and the deep deterministic policy gradient algorithm,the scheduling strategy based on TD3 algorithm is more economic,and its results are closer to the economic cost of the CPLEX-based day-ahead optimal scheduling method,and it is more suitable to solve the dynamic optimal scheduling problem of the integrated energy system,which effectively realizes the flexible utilization of energy and improves the economy and low-carbon performance of the integrated energy system.
关 键 词:氢耦合电-热综合能源系统 可再生能源 深度强化学习 双延迟深度确定性策略梯度 能量优化管理 马尔可夫决策过程
分 类 号:TM73[电气工程—电力系统及自动化] TK01[动力工程及工程热物理] TK91
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.31