检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Shengren Hou Aihui Fu Edgar Mauricio Salazar Duque Peter Palensky Qixin Chen Pedro P.Vergara
机构地区:[1]Intelligent Electrical Power Grids(IEPG)Group,Delft University of Technology,Delft 2628CD,The Netherlands [2]Electrical Energy Systems(EES)Group,Eindhoven University of Technology,Eindhoven,The Netherlands [3]State Key Laboratory of Power Systems,Department of Electrical Engineering,Tsinghua University,Beijing 100084,China
出 处:《Journal of Modern Power Systems and Clean Energy》2025年第1期300-311,共12页现代电力系统与清洁能源学报(英文)
基 金:part of the DATALESs project(with project number 482.20.602)jointly financed by the Netherlands Organization for Scientific Research(NWO);the National Natural Science Foundation of China(NSFC)。
摘 要:The integration of distributed energy resources(DERs)has escalated the challenge of voltage magnitude regulation in distribution networks.Model-based approaches,which rely on complex sequential mathematical formulations,cannot meet the real-time demand.Deep reinforcement learning(DRL)offers an alternative by utilizing offline training with distribution network simulators and then executing online without computation.However,DRL algorithms fail to enforce voltage magnitude constraints during training and testing,potentially leading to serious operational violations.To tackle these challenges,we introduce a novel safe-guaranteed reinforcement learning algorithm,the Dist Flow safe reinforcement learning(DF-SRL),designed specifically for real-time voltage magnitude regulation in distribution networks.The DF-SRL algorithm incorporates a Dist Flow linearization to construct an expert-knowledge-based safety layer.Subsequently,the DF-SRL algorithm overlays this safety layer on top of the agent policy,recalibrating unsafe actions to safe domains through a quadratic programming formulation.Simulation results show the DF-SRL algorithm consistently ensures voltage magnitude constraints during training and real-time operation(test)phases,achieving faster convergence and higher performance,which differentiates it apart from(safe)DRL benchmark algorithms.
关 键 词:Voltage regulation distribution network safe reinforcement learning energy management
分 类 号:TM73[电气工程—电力系统及自动化]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.248