基于马尔可夫决策过程的出租车寻客路径优化  被引量:2

Route Optimization of Taxicab Based on Markov Decision Process

在线阅读下载全文

作  者:程琳[1] 张晨皓 于新莲 杜明洋 任姿蓉 CHENG Lin;ZHANG Chen-hao;YU Xin-lian;DU Ming-yang;REN Zi-rong(School of Transportation,Southeast University,Nanjing 211189,China)

机构地区:[1]东南大学交通学院,南京211189

出  处:《武汉理工大学学报》2022年第5期40-46,共7页Journal of Wuhan University of Technology

基  金:国家自然科学基金(52172318,52131203)。

摘  要:为提高出租车的长期收益,作者在路网网格化的基础上建立了基于马尔可夫决策过程的路径优化模型。将车辆当前所位于的网格位置定义为状态,将从当前网格选择某一相邻网格出发定义为动作,使用策略迭代法对问题进行求解,并采用高斯-赛德尔迭代进行加速。以深圳市典型工作日全天797辆出租车的GPS数据进行试算和仿真,计算速度是雅克比迭代的1.85倍。将该算法与随机游走和全局热点算法进行比较,结果表明,所提出模型的平均单位距离收益分别提高了22.1%和12.9%,载客里程占比分别提高了18.8%和10.4%,具有较好的优化效果。A dynamic route optimization model based on Markov decision process is formulated after gridding of network.The state is defined by the grid location of vehicle while the action is defined by the selection of an adjacent grid from the current location.The problem is solved by the policy iteration,Gauss Seidel iteration is used for acceleration.The model is tested by taking the GPS data of 797 taxicabs in a typical working day in Shenzhen,and the speed is 1.85 times that of Jacobi iteration.The model is compared with two heuristic algorithms by simulation,showing that the average profit per unit distance of the optimal route strategy is 22.1% and 12.9% higher than that of random walk and global hotspot algorithm,while the proportion of passenger carrying distance is 18.8% and 10.4% respectively, which has a good effect on route optimization.

关 键 词:城市交通 动态路径优化 马尔可夫决策过程 出租车 策略迭代 数据挖掘 

分 类 号:U121[交通运输工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象