检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:郦芳菲 王海龙[2] 陆子雄 王忠 LI Fangfei;WANG Hailong;LU Zixiong;WANG Zhong(Yangzhong Power Supply Company,Yangzhong 212200,Jiangsu Province,China;NARI Technology Co.Ltd.,Nanjing 211106,Jiangsu Province,China)
机构地区:[1]国网江苏省电力有限公司扬中市供电分公司,江苏省扬中市212200 [2]国电南瑞科技股份有限公司,江苏省南京市211106
出 处:《现代电力》2023年第4期577-586,共10页Modern Electric Power
基 金:国家重点研发计划资助项目(2018YFB0905000)。
摘 要:针对交直流混合微电网优化调度中的不确定性建模难和复杂系统难以高效求解等问题,提出了一种通过人工策略引导提高智能体学习效率的人工辅助深度强化学习算法。首先,结合并网状态下混合微电网的需求侧响应特征,构建了最小化成本的优化调度模型。基于马尔科夫决策流程对优化调度过程进行建模,并根据优化调度模型设计奖励函数。然后,采用人工辅助的深度确定性策略梯度算法求解模型,通过智能体和环境的持续交互,不断更新神经网络参数进而得到最优决策。最后通过算例仿真验证了所提算法能有效提高智能体的学习效率,在减少模型训练时间的同时,有效降子系统的运行成本。In allusion to such troubles as difficulty of uncertainty modeling and difficult to solve complex system efficiently in optimal dispatching of AC/DC hybrid microgrid,an artificial assisted deep reinforcement learning algorithm,which could improve the learning efficiency of intelligent agent through artificial strategy guidance,was proposed.Firstly,combining with the characteristic of demand side response of hybrid microgrid under grid-connected state a cost-minimized optimal dispatching model was constructed.Based on Markov decision process the modeling of optimal dispatching process was conducted and based on optimal dispatching model the reward function was designed.Secondly,the designed model was solved by artificially assisted deep deterministic policy gradient algorithm,and by means of continuous interaction between intelligent agent and environment the parameter of neural network was continually updated and then the optimal decision was obtained.Finally,it was verified by computing example that using the proposed algorithm the learning efficiency of intelligent agent could be effectively improved and while the training time of the model was decreased the operating cost of the subsystem could be effectively reduced.
关 键 词:交直流混合微电网 分布式电源 深度确定性策略梯度法 优化调度 人工辅助训练
分 类 号:TM73[电气工程—电力系统及自动化]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.224.96.135