基于模糊邻域熵的多粒度离群点检测方法  被引量:1

Fuzzy Neighborhood Entropy-Based Multi-granularity Outlier Detection

在线阅读下载全文

作  者:汪贝琪 周杰 高灿[1,2] WANG Bei-qi;ZHOU Jie;GAO Can(College of Computer Science and Software Engineering,Shenzhen University,Shenzhen 518060,China;Key Laboratory of Intelligent Information Processing(Shenzhen University),Guangdong Province,Shenzhen 518060,China)

机构地区:[1]深圳大学计算机与软件学院,广东深圳518060 [2]广东省智能信息处理重点实验室(深圳大学),广东深圳518060

出  处:《模糊系统与数学》2022年第6期102-113,共12页Fuzzy Systems and Mathematics

基  金:国家自然科学基金资助项目(61806127,62076164)。

摘  要:离群点检测是数据挖掘和机器学习领域重要的研究方向之一,其目的是识别与其他样本表现显著不同的样本。本文提出了一种基于模糊邻域熵的多粒度离群点检测方法。首先,将模糊相似性引入邻域熵和相对熵,提出模糊邻域熵和相对模糊邻域熵的不确定性度量。其次,分析了模糊邻域熵和相对模糊邻域熵在逻辑和几何上的差异特性。最后,结合理想解法(TOPSIS)和多粒度序列提出了新的样本离群程度评判标准TFMME-OF(TOPSIS and Fuzzy Multigranulation Mixed Entropy-based Outlier Factor)。实验结果表明,该方法相较于其它同类方法有更好的离群点检测效果。Outlier detection is one of the important research directions in the field of data mining and machine learning. It aims to identify samples that are significantly different from other samples. In this paper, a multigranularity outlier detection method based on fuzzy neighborhood entropy is proposed. Firstly, fuzzy similarity is introduced into neighborhood entropy and relative entropy, and the uncertainty measures of fuzzy neighborhood entropy and fuzzy relative entropy are proposed. Secondly, the characteristics of fuzzy neighborhood entropy and fuzzy relative entropy in logical and geometric representation are analyzed. Finally, a new criterion TFMME-OF(TOPSIS and fuzzy multigranulation mixed entropy-based outlier factor) is proposed by combining TOPSIS and multigranularity sequence. Experimental results show that the proposed method achieves better outlier detection results in comparison with other representative methods.

关 键 词:离群点检测 邻域熵 模糊邻域熵 理想解法 多粒度序列 

分 类 号:O159[理学—数学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象