检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:宋宇航 陈宇帆 魏延岭 高山[3] SONG Yuhang;CHEN Yufan;WEI Yanling;GAO Shan(School of Automation,Southeast University,Nanjing 210096,China;Key Laboratory of Measurement and Control of Complex System of Engineering,Southeast University,Nanjing 210096,China;School of Electrical Engineering,Southeast University,Nanjing 210096,China)
机构地区:[1]东南大学自动化学院,江苏省南京市210096 [2]东南大学复杂工程系统测量与控制教育部重点实验室,江苏省南京市210096 [3]东南大学电气工程学院,江苏省南京市210096
出 处:《电力系统自动化》2024年第11期184-196,共13页Automation of Electric Power Systems
基 金:国家重点研发计划资助项目(2021YFB2501600)。
摘 要:针对电动汽车充电路径规划问题,提出了一种适用于强化学习的环境建模方法。该方法基于城市道路网格与充电站地理位置分布等现实情况,将电动汽车的基本行驶路径分为三段进行表达。在三段式表达方法的基础上,提出了状态空间、动作空间、状态转移与奖励函数的设计方案,将充电路径规划建模为马尔可夫决策过程,并利用Q学习方法与深度Q网络(DQN)方法求解。实验结果表明,基于三段式表达法的强化学习环境设计方案具有可解性与可迁移性,考虑了电动汽车从道路驶向充电站过程中的降速转弯等现实场景,同时将充电动作简化为一种行驶方向选择,提升了基于Q学习与DQN的强化学习算法效率。An environmental modeling method suitable for reinforcement learning is proposed for the charging path planning problem of electric vehicles.Based on the actual situation of urban road network and geographical distribution of charging stations,this method divides the basic driving path of electric vehicles into three segments for representation.Based on the three-segment expression method,the design scheme of state space,action space,state transition,and reward function is proposed.The charging path planning is modeled as a Markov decision process,and solved by the Q learning method and the deep Q network(DQN)method.The experimental results show that the design scheme of the reinforcement learning environment based on the threesegment expression method is solvable and portable.It takes into account the realistic scenarios such as the deceleration and turning of electric vehicles in the process of driving from the road to the charging station,and simplifies the charging action into a driving direction choice,which improves the efficiency of the reinforcement learning algorithm based on Q learning and DQN.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7