A Novel Framework for Learning and Classifying the Imbalanced Multi-Label Data  

在线阅读下载全文

作  者:P.K.A.Chitra S.Appavu alias Balamurugan S.Geetha Seifedine Kadry Jungeun Kim Keejun Han 

机构地区:[1]Department of Computer Science and Engineering,SRM Institute of Science and Technology,Tiruchirappalli,Tamil Nadu,603203,India [2]Department of Computer Science and Engineering,Periyar Maniammai Institute of Science&Technology(Deemed to be University),Thanjavur,Tamil Nadu,613403,India [3]School of Computer Science and Engineering,Chennai,Tamil Nadu,600048,India [4]Department of Applied Data Science,Noroff University College,Kristiansand,4612,Norway [5]Artificial Intelligence Research Center(AIRC),College of Engineering and Information Technology,Ajman University,P.O.Box 346,Ajman,United Arab Emirates [6]Department of Electrical and Computer Engineering,Lebanese American University,Byblos,10150,Lebanon [7]Department of Software,Kongju National University,Cheonan,31080,Republic of Korea [8]Division of Computer Engineering,Hansung University,Seoul,02876,Republic of Korea

出  处:《Computer Systems Science & Engineering》2024年第5期1367-1385,共19页计算机系统科学与工程(英文)

基  金:partly supported by the Technology Development Program of MSS(No.S3033853);by the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.2021R1A4A1031509).

摘  要:A generalization of supervised single-label learning based on the assumption that each sample in a dataset may belong to more than one class simultaneously is called multi-label learning.The main objective of this work is to create a novel framework for learning and classifying imbalancedmulti-label data.This work proposes a framework of two phases.The imbalanced distribution of themulti-label dataset is addressed through the proposed Borderline MLSMOTE resampling method in phase 1.Later,an adaptive weighted l21 norm regularized(Elastic-net)multilabel logistic regression is used to predict unseen samples in phase 2.The proposed Borderline MLSMOTE resampling method focuses on samples with concurrent high labels in contrast to conventional MLSMOTE.The minority labels in these samples are called difficult minority labels and are more prone to penalize classification performance.The concurrentmeasure is considered borderline,and labels associated with samples are regarded as borderline labels in the decision boundary.In phase II,a novel adaptive l21 norm regularized weighted multi-label logistic regression is used to handle balanced data with different weighted synthetic samples.Experimentation on various benchmark datasets shows the outperformance of the proposed method and its powerful predictive performances over existing conventional state-of-the-art multi-label methods.

关 键 词:Multi-label imbalanced data multi-label learning Borderline MLSMOTE concurrent multi-label adaptive weighted multi-label elastic net difficult minority label 

分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象