基于强化学习的汇流瓶颈区可变限速策略研究被引量：16

Variable Speed Limit Control at Freeway Merge Bottlenecks Based on Reinforcement Learning

机构地区：[1]嘉兴学院,浙江嘉兴314211 [2]东南大学,南京210096 [3]加州大学,伯克利947201714

出　　处：《交通运输系统工程与信息》2015年第1期55-61,共7页Journal of Transportation Systems Engineering and Information Technology

基　　金：国家自然科学基金资助项目(51322810)

摘　　要：为提高高速公路汇流瓶颈区的通行效率,本文结合强化学习无需建立模型,具有智能学习的特点,对瓶颈区的可变限速策略进行了优化,首次提出了基于Q学习算法的可变限速控制策略.策略以最大化系统总流出车辆数为目标,通过遍历交通流状态集合,尝试不同限速值序列进行自适应学习.以真实路段交通流数据搭建了元胞传输模型仿真平台,通过将其与无控制和基于反馈控制的可变限速策略进行对比,对Q学习策略的控制效果进行评价.通行时间的降低和交通参数的变化表明,强化学习控制策略在提高汇流瓶颈区通行效率和改善交通流运行状况方面具有优越性.To improve the efficiency of freeway merge bottleneck, this paper optimizes the bottleneck variable speed limit strategy. Considering the characteristics of reinforcement learning that it is modelingfree and intelligent learning, a QL-VSL control strategy that integrates the Q-learning（QL） algorithm in the VSL control is proposed for the first time. The goal of the strategy is to maximize the outflow vehicle, it is adaptive learning through traversing traffic flow states and taking different speed limits. The cell transmission model（CTM） calibrated with the real traffic data is used for the simulation. The effectiveness of the proposed QL-VSL control strategy is evaluated with no VSL control and the feedback VSL control in the simulation. The travel time reduction and traffic parameter changes show that the proposed QL-VSL control strategy outperforms in improving the traffic efficiency and traffic operations at freeway merge bottlenecks.

关键词：智能交通可变限速强化学习高速公路汇流瓶颈区 Q学习算法

分类号：U491[交通运输工程—交通运输规划与管理]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的汇流瓶颈区可变限速策略研究被引量：16

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的汇流瓶颈区可变限速策略研究 被引量：16

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于强化学习的汇流瓶颈区可变限速策略研究被引量：16