检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:郭奕杉 刘漫丹[1] GUO Yi-shan;LIU Man-dan(School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China)
机构地区:[1]华东理工大学信息科学与工程学院,上海200237
出 处:《计算机科学》2021年第S01期213-219,共7页Computer Science
摘 要:伴随着智能设备的普及和无线通信技术的发展,用户在使用无线网络满足各种需求时,无线网络也记录下了用户上网留下的大量时空轨迹数据。针对时空轨迹数据的异常检测已经成为数据挖掘领域一个新的研究热点。为了更好地关注学生健康发展,促进校园信息化建设,以真实校园上网数据为例,提出了一种基于多尺度阈值和密度相结合的谱聚类算法(Spectral Clustering Algorithm Based on The Combination of Multi-Scale Threshold And Density,MSTD-SC),使用基于最短时间距离子序列(Shortest Time Distance-Shortest Time Distance Subsequences,STD-STDSS)的亲和距离函数来构造初始相似度矩阵,进一步引入协方差尺度阈值和空间尺度阈值对相似度矩阵进行0-1化处理,以此得到更精确的样本相似度,接着对相似度矩阵进行特征值分解,得到新的特征向量空间,最后采用DBSCAN聚类避免了K-means算法需要人工确定聚类数目的缺陷。利用轮廓系数评估多种算法得到的实验结果,MSTD-SC算法体现出了更好的聚类性能。将其应用于用户个体的异常检测中,异常用户名单被验证是有效可信的。With the popularization of smart devices and the development of wireless communication technology,when users use wireless networks to meet various needs,wireless networks also record a large number of users’spatial-temporal trajectory data.Anomaly detection for spatial-temporal trajectory data becomes a new research hotspot in the field of data mining.In order to better pay attention to the healthy development of students and promote the informatization construction of campus,a spectral clustering algorithm based on the combination of multi-scale threshold and density(MSTD-SC)is proposed,taking the real internet usage data of campus as an example.Firstly,it uses the affinity distance function based on the shortest time distance-shortest time distance subsequences(STD-STDSS)to construct the initial adjacency matrix.Then it introduces the covariance scale eigenvector space by threshold and spatial scale eigenvector space by threshold to perform 0-1 processing on the adjacency matrix to obtain more accurate sample similarity.Next,comstructing a eigenvalue decomposition of the adjacency matrix.Finally,it uses DBSCAN clustering algorithm to avoid to manually determine the number of clusters.Using Silhouette Index to evaluate the experimental results obtained by multiple algorithms,MSTD-SC algorithm reflects better clustering performance.Applying it to individual user anomaly detection,the abnormal user list is verified to be effective and credible.
关 键 词:时空轨迹数据 校园无线网络 相似度 异常检测 谱聚类
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.171