检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李万林 王超[1,2] 许国良 雒江涛 张轩 LI Wanlin;WANG Chao;XU Guoliang;LUO Jiangtao;ZHANG Xuan(Institute of Communication and Information Engineering,Chongqing University of Posts and Telecommunications,Chongqing 400065,China;Electronic Information and Networking Research Institute,Chongqing University of Posts and Telecommunications,Chongqing 400065,China)
机构地区:[1]重庆邮电大学通信与信息工程学院,重庆400065 [2]重庆邮电大学电子信息与网络工程研究院,重庆400065
出 处:《电子与信息学报》2020年第12期3013-3020,共8页Journal of Electronics & Information Technology
基 金:重庆市自然科学基金(cstc2018jcyjAX0587);新型感知技术、信息融合处理及其应用(A2017-10)。
摘 要:针对密度聚类算法只能识别密度相近的簇类且计算复杂度高等问题,该文提出一种基于信令数据中时空轨迹信息的密度峰值快速聚类(ST-CFSFDP)算法。首先对低采样密度的信令数据进行预处理,消除轨迹震荡现象;然后基于密度峰值快速聚类(CFSFDP)算法显式地增加时间维度限制,将局部密度由2维扩展到3维,并提出高密度时间间隔以表征簇中心在时间维度上的数据特征;接着设计筛选策略以选取聚类中心;最后识别用户出行轨迹中的驻留点,完成出行链的划分。实验结果表明,所提算法适用于采样密度低且定位精度差的信令数据,相比CFSFDP算法更适用于时空数据,相比基于密度的时空聚类算法(ST-DBSCAN)召回率提升14%,准确率提升8%,同时降低计算复杂度。For the problem that the density-based clustering algorithm can only identify clusters with similar density and high computational complexity,a Clustering by Fast Search and Find of Density Peaks based on Spatio-Temporal trajectory information in mobile phone signaling data,namely ST-CFSFDP,is proposed.Firstly,the low sampling density signaling data are pre-processed to eliminate the trajectory oscillation phenomenon in the data.Then,based on the Clustering by Fast Search and Find of Density Peaks(CFSFDP)algorithm,the time dimension limitation is explicitly increased,and the local density is extended from twodimension to three-dimension.Moreover,in order to characterize the cluster center point in the time dimension,the concept of high-density time interval is defined.Secondly,the suitable cluster center screening strategy is developed to select automatically the appropriate cluster center.Finally,the resident points are identified in the travel trajectory of individual users over a period of time and the division of the travel chains is completed.The experimental results show that the algorithm is suitable for signaling data with low sampling density and poor positioning accuracy.It is more suitable for spatio-temporal data than CFSFDP algorithm.Compared with Density-Based Spatial Clustering of Applications with Noise based on Spatio-Temporal data(STDBSCAN)algorithm,the recall rate is improved by 14%,the accuracy rate is increased by 8%,and the computational complexity is also reduced.
关 键 词:信令数据 时空聚类 密度峰值快速聚类算法 驻留点识别 出行链
分 类 号:TN929.5[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.170