基于强化学习环境设计策略的电动汽车充电路径规划被引量：2

Charging Path Planning for Electric Vehicles Based on Reinforcement Learning Environment Design Strategy

作　　者：宋宇航陈宇帆魏延岭高山[3] SONG Yuhang;CHEN Yufan;WEI Yanling;GAO Shan(School of Automation,Southeast University,Nanjing 210096,China;Key Laboratory of Measurement and Control of Complex System of Engineering,Southeast University,Nanjing 210096,China;School of Electrical Engineering,Southeast University,Nanjing 210096,China)

机构地区：[1]东南大学自动化学院,江苏省南京市210096 [2]东南大学复杂工程系统测量与控制教育部重点实验室,江苏省南京市210096 [3]东南大学电气工程学院,江苏省南京市210096

出　　处：《电力系统自动化》2024年第11期184-196,共13页Automation of Electric Power Systems

基　　金：国家重点研发计划资助项目(2021YFB2501600)。

摘　　要：针对电动汽车充电路径规划问题,提出了一种适用于强化学习的环境建模方法。该方法基于城市道路网格与充电站地理位置分布等现实情况,将电动汽车的基本行驶路径分为三段进行表达。在三段式表达方法的基础上,提出了状态空间、动作空间、状态转移与奖励函数的设计方案,将充电路径规划建模为马尔可夫决策过程,并利用Q学习方法与深度Q网络(DQN)方法求解。实验结果表明,基于三段式表达法的强化学习环境设计方案具有可解性与可迁移性,考虑了电动汽车从道路驶向充电站过程中的降速转弯等现实场景,同时将充电动作简化为一种行驶方向选择,提升了基于Q学习与DQN的强化学习算法效率。An environmental modeling method suitable for reinforcement learning is proposed for the charging path planning problem of electric vehicles.Based on the actual situation of urban road network and geographical distribution of charging stations,this method divides the basic driving path of electric vehicles into three segments for representation.Based on the three-segment expression method,the design scheme of state space,action space,state transition,and reward function is proposed.The charging path planning is modeled as a Markov decision process,and solved by the Q learning method and the deep Q network(DQN)method.The experimental results show that the design scheme of the reinforcement learning environment based on the threesegment expression method is solvable and portable.It takes into account the realistic scenarios such as the deceleration and turning of electric vehicles in the process of driving from the road to the charging station,and simplifies the charging action into a driving direction choice,which improves the efficiency of the reinforcement learning algorithm based on Q learning and DQN.

关键词：电动汽车充电路径规划强化学习深度Q网络环境建模三段式表达法

分类号：U46[机械工程—车辆工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习环境设计策略的电动汽车充电路径规划被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习环境设计策略的电动汽车充电路径规划 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于强化学习环境设计策略的电动汽车充电路径规划被引量：2