检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:唐小林[1] 甘炯鹏 张振果 TANG Xiaolin;GAN Jiongpeng;ZHANG Zhenguo(College of Mechanical and Vehicle Engineering,Chongqing University,Chongqing 400044;School of Mechanical Engineering,Shanghai Jiao Tong University,Shanghai 200024)
机构地区:[1]重庆大学机械与运载工程学院,重庆400044 [2]上海交通大学机械与动力工程学院,上海200024
出 处:《机械工程学报》2025年第2期236-246,共11页Journal of Mechanical Engineering
基 金:国家自然科学基金(52222215,52072051);重庆市杰出青年基金(2023NSCQJQX0009)资助项目。
摘 要:为了探索多智能体深度强化学习算法在混合动力汽车多目标协同控制中的应用,提出了一种基于多智能体深度确定性策略梯度算法的混合动力车队协同能量管理策略。首先,利用交通仿真软件搭建横纵向耦合跟车场景,以模拟车联网环境实现对车辆信息的准确获取。其次,设计了包含横向变道及纵向跟车的基于规则及网格搜索的横纵向耦合跟车策略,以实现更高的通行效率。最后,利用多智能体深度确定性策略梯度算法设计混合动力车队自适应协同能量管理策略,实现车队整体效益最大化,并通过随机车流初始位置获取随机车队需求功率序列,从而增加策略训练的随机性,提高策略对不同工况的适应性。结果表明,多智能体的车队协同能量管理策略与单智能体相比拥有更好的整体优化效果,并且经随机工况训练后,其工况适应性得到了一定的提升。To explore the application of the multi-agent deep reinforcement learning(DRL)algorithm in hybrid electric vehicle multi-objective cooperative control,a multi-agent deep deterministic strategy gradient(MADDPG)algorithm-based hybrid electric vehicle platoon collaborative energy management strategy was proposed.Firstly,the traffic simulation software is used to build a transverse and longitudinal coupled car-following scene to simulate the internet of vehicles environment to achieve accurate acquisition of vehicle information.Secondly,a transverse and longitudinal coupled car-following strategy based on rule and grid search was designed,including lateral lane change and longitudinal car following,to achieve higher traffic efficiency Finally,the MADDPG algorithm was used to design an adaptive collaborative energy management strategy for the hybrid electric vehicle platoon to maximize the overall benefit,and the random vehicle demand power sequence was obtained through the initial position of random traffic flow,thus increasing the randomness of strategy training and improving the adaptability of the strategy to different driving conditions.The results show that the multi-agent vehicle platoon collaborative energy management strategy has a better overall optimization effect than the single agent,and its adaptability to driving conditions has been improved to a certain extent after training in random driving conditions.
关 键 词:横纵向耦合跟车 多智能体深度强化学习 混合动力车队 协同能量管理
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.144.136.254