基于深度确定性策略梯度的智能车汇流模型被引量：4

Traffic Merging Model for Intelligent Vehicle Based on Deep Deterministic Policy Gradient

作　　者：吴思凡杜煜徐世杰杨硕杜晨 WU Sifan;DU Yu;XU Shijie;YANG Shuo;DU Chen(Smart City College,Beijing Union University,Beijing 100101,China;College of Robotics,Beijing Union University,Beijing 100101,China;Beijing Key Laboratory of Information Service Engineering,Beijing Union University,Beijing 100101,China)

机构地区：[1]北京联合大学智慧城市学院,北京100101 [2]北京联合大学机器人学院,北京100101 [3]北京联合大学北京市信息服务工程重点实验室,北京100101

出　　处：《计算机工程》2020年第1期87-92,共6页Computer Engineering

基　　金：国家自然科学基金(91420202)

摘　　要：采用离散动作空间描述速度变化的智能车汇流模型不能满足实际车流汇入场景的应用要求,而深度确定性策略梯度(DDPG)结合策略梯度和函数近似方法,采用与深度Q网络(DQN)相同的网络结构,并使用连续动作空间对问题进行描述,更适合描述智能车速度变化。为此,提出一种基于DDPG算法的智能车汇流模型,将汇流问题转化为序列决策问题进行求解。实验结果表明,与基于DQN的模型相比,该模型的收敛速度较快,稳定性和成功率较高,更适合智能车汇入车辆场景的应用。Traffic merging models for intelligent vehicle that use discrete action space to describe changing speed cannot meet the application requirements of actual traffic merging scenarios.Deep Deterministic Policy Gradient(DDPG),which integrates policy gradient with function approximation methods and adopts the same network structure as Deep Q-Network(DQN),uses continuous action space for problem description.So DDPG is more suitable for describing the changing speed of intelligent vehicles.On this basis,this paper proposes a traffic merging model for intelligent vehicles based on the DDPG algorithm,reducing the traffic merging problem to a sequence decision problem to be resolved.Experimental results show that compared with DQN-based models,the proposed model has a faster convergence speed,higher reliability and a higher success rate,which means it is more applicable to traffic merging scenarios of intelligent vehicle.

关键词：智能车汇流深度确定性策略梯度深度Q网络连续动作空间

分类号：TP18[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度确定性策略梯度的智能车汇流模型被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度确定性策略梯度的智能车汇流模型 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于深度确定性策略梯度的智能车汇流模型被引量：4