iBelt:一种事件日志的可解释聚类分析方法  被引量:2

iBelt: An interpretable cluster analysis method for event logs

在线阅读下载全文

作  者:刘雯 王桂玲[1,2] LIU Wen;WANG Guiling(School of Information,North China University of Technology,Beijing 100144,China;Beijing Municipal Key Laboratory on Integration and Analysis of Large-Scale Stream Data,North China University of Technology,Beijing 100144,China)

机构地区:[1]北方工业大学信息学院,北京100144 [2]北方工业大学大规模流数据集成与分析技术北京市重点实验室,北京100144

出  处:《计算机集成制造系统》2022年第10期3175-3186,共12页Computer Integrated Manufacturing Systems

基  金:国家重点研发计划资助项目(2018YFB1402500);国家自然科学基金重点资助项目(61832004);国家自然科学基金国际(地区)合作与交流资助项目(62061136006)。

摘  要:鉴于当前大多数方法因在日志聚类结果上缺乏可解释性而影响应用,提出一种事件日志的可解释聚类分析方法iBelt。该方法定义“过程连接带”描述事件日志的分析结果,基于聚类树思想设计了提升聚类树模型,并采用方差和判别特征分析的无监督特征选择方法提升已有方法的聚类效果和拟合度,解决了高维数据影响过程连接带可解释性的弊端。通过公开数据集上的实验结果表明,所提方法分析得到的过程连接带具有简洁易懂的可解释规则,提升了对应过程模型的质量。When process mining based on complex event log is carried out, it is often necessary to cluster the event trace to simplify the structure of the process. However, most current trace clustering methods lack interpretability in the results which leads their application potential hampered a lot. For this reason, an interpretable cluster analysis method for event logs called iBelt was proposed. A “process connection belt” was defined to describe the analysis results of event logs. Based on the idea of clustering tree, the model of Clustering through Boosting Decision Tree(CLBDT) was designed, and the unsupervised feature selection method of variance and discriminant feature analysis was adopted to improve the clustering effect and fitting degree of existing methods, and solve the disadvantage of high-dimensional data affecting the interpretability of “process connection belt”. Experimental results on the public dataset showed that the resulted process connection belt had simple and easy to understand interpretable rules, and the quality of the associate process models had been improved.

关 键 词:过程挖掘 轨迹聚类 可解释性聚类 决策树 

分 类 号:T391[一般工业技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象