基于PageRank和互信息的多标签分类器链算法  

A Multi-Label Classifier Chain Algorithm Based on PageRank and Mutual Information

在线阅读下载全文

作  者:丁家满[1,2] 李欣宇 贾连印 胡爽[1] 王红斌 DING Jiaman;LI Xinyu;JIA Lianyin;HU Shuang;WANG Hongbin(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China;Key Laboratory of Artificial Intelligence in Yunnan Province,Kunming 650500,China)

机构地区:[1]昆明理工大学信息工程与自动化学院,云南昆明650500 [2]云南省人工智能重点实验室,云南昆明650500

出  处:《昆明理工大学学报(自然科学版)》2024年第3期103-115,共13页Journal of Kunming University of Science and Technology(Natural Science)

基  金:国家自然科学基金项目(62262034,62262035);云南省科技揭榜项目(202204BW050001).

摘  要:分类器链算法是解决多标签分类问题的一种有效方法.寻求分类器链中的标签合适顺序是该类算法的关键所在.单链模式中不恰当标签顺序严重影响分类性能,而采用随机多链方式带来的是算法复杂度徒增问题.针对上述问题,提出了一种基于PageRank和互信息的多标签分类器链算法.首先,探索标签和网页之间的共性,将标签之间的相似关系类比网页之间的链接;然后考虑全局相关性,利用互信息度量标签之间的相关性;最后,基于相关性信息,利用PageRank衡量网页重要性的思想对标签进行排序,形成分类器链.对来自不同领域的10个公开多标签数据集的实验结果表明,该算法能为分类器链找到合适的标签顺序,不仅提高了分类精度,而且降低了计算代价.Classifier chains are a sort of multi-label classification algorithm.For the classifier chains algorithm,finding the appropriate label order is the key to improving the classification accuracy.In single-order mode,improper label order seriously affected the classification performance,while adopting random multiple-order mode brought the problem of increasing algorithm complexity.To address the above issues,a multi-label classifier chain algorithm based on PageRank and mutual information is proposed.First,the similarities between labels and web pages are explored,analogizing the similarity between labels to the links between web pages,and then considering global relevance by using mutual information to measure the correlation between labels.Finally,based on the correlation information,the idea of PageRank to measure the importance of web pages is used to rank labels and form a classifier chain.Experiments on ten common multi-label data sets from different fields show that this method can find the appropriate label order for the classifier chains algorithm,improving the classification accuracy and reducing the computational cost.

关 键 词:多标签分类 分类器链 网页排名 标签相关性 互信息 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象