基于近端策略优化算法的直流配电网网损/电压优化控制  

Optimization control of network loss and voltage in DC distribution networks based on proximal policy optimization algorithm

在线阅读下载全文

作  者:张锋[1] 张培超 杨浩[1] ZHANG Feng;ZHANG Peichao;YANG Hao(Key Laboratory of Modern Power System Simulation and Control&Renewable Energy Technology,Ministry of Education(Northeast Electric Power University),Jilin 132012,China;Shandong Electric Power Engineering Consulting Institute Co.,Ltd.,Jinan 250013,China)

机构地区:[1]现代电力系统仿真控制与绿色电能新技术教育部重点实验室(东北电力大学),吉林吉林132012 [2]山东电力工程咨询院有限公司,山东济南250013

出  处:《电气应用》2025年第4期75-84,共10页Electrotechnical Application

基  金:吉林省自然科学基金面上项目(20240101108JC)。

摘  要:针对直流配电网传统控制策略在分布式电源出力波动情况下,存在电压频繁越限、网损增大的问题,提出了一种基于强化学习近端策略优化(Proximal Policy Optimization,PPO)算法的直流配电网网损/电压优化控制策略。构建了面向电压和网损优化的马尔可夫决策过程(Markov Decision Process,MDP)模型,分别定义了直流配电网网损/电压优化控制的状态空间和动作空间,利用电压偏差和网损优化目标划分设计奖励函数,确保策略在满足电压约束条件的同时优化网损,采用PPO算法进行策略优化训练,得到最优控制策略。以修改的IEEE16直流配电网为算例,通过DIgSILENT和Python进行所提方法的程序设计和仿真,结果表明,所提方法能够在分布式电源波动时,优化系统网损,并提升电压安全运行水平。The challenges posed by traditional control strategies in DC distribution networks is addresses,particularly in the presence of fluctuations in distributed energy generation,which lead to frequent voltage violations and increased network losses.A novel optimization control strategy based on Proximal Policy Optimization(PPO)algorithm is proposed to optimize both voltage and network losses.A Markov Decision Process(MDP)model is developed for voltage and network loss optimization,with defined state and action spaces for control.A reward function is designed based on the partitioning of voltage deviation and network loss optimization objectives,ensuring voltage constraints are met while minimizing network losses.The PPO algorithm,is employed for policy optimization training to obtain the optimal control strategy.The proposed approach is validated through simulation on a modified IEEE 16-bus DC distribution network,using DIgSILENT/PowerFactory and Python for program design and simulation.The results demonstrate that the proposed method effectively optimizes network losses and enhances voltage stability under fluctuating distributed energy outputs.

关 键 词:直流配电网 电压控制 网损优化 近端策略优化 深度强化学习 

分 类 号:TM721[电气工程—电力系统及自动化]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象