基于强化学习的沥青路面长期性能养护决策方法被引量：2

Long-term performance maintenance decisions for asphalt pavements based on reinforcement learning

作　　者：侯明业王晓阳徐青杰杨博王笑风 HOU Mingye;WANG Xiaoyang;XU Qingjie;YANG Bo;WANG Xiaofeng(Henan Communications Planning&Design Institute Co.,Ltd.,Zhengzhou 450000 China)

机构地区：[1]河南省交通规划设计研究院股份有限公司,河南郑州450000

出　　处：《山东科学》2023年第3期108-114,共7页Shandong Science

基　　金：河南省交通运输厅科技项目(2021T2;2021T8;2021G3)。

摘　　要：针对道路长期性能养护决策中庞大的数据分析问题,将深度确定性策略梯度(deep deterministic policy gradient,DDPG)强化学习模型引入到了养护决策分析中,将道路性能的提升及养护资金的有效利用作为机器学习的奖励目标,建立了一套科学有效的沥青路面长期性能养护决策方法,经过与DQN(deep Q-learning network)算法和Q-Learning算法进行对比,DDPG算法所需要的采样数据更少、收敛速度更快,表现更为优异,可有效提升道路服役性能的评估效率,对沥青路面多目标长期养护决策方案的制定起着重要的推动作用。To address the huge data analysis problem in the decision-making for long-term road performance maintenance,this paper introduces the deep deterministic policy gradient(DDPG)reinforcement learning model in the maintenance decision analysis.A set of scientific and effective decision-making methods for long-term performance maintenance of asphalt pavements has been established through machine learning.These methods can improve road performance and make effective use of maintenance funds.Compared with the deep Q-learning network and Q-Learning algorithms,the DDPG algorithm requires less sampling data,converges faster,performs better,and can effectively improve the evaluation efficiency of the road service performance.Therefore,the proposed model plays an important role in the development of multi-objective maintenance decision-making for asphalt pavements.

关键词：交通工程沥青路面养护决策强化学习深度确定性策略梯度模型

分类号：U411[交通运输工程—道路与铁道工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的沥青路面长期性能养护决策方法被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的沥青路面长期性能养护决策方法 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于强化学习的沥青路面长期性能养护决策方法被引量：2