A Novel Metadata Based Multi-Label Document Classification Technique  

在线阅读下载全文

作  者:Naseer Ahmed Sajid Munir Ahmad Atta-ur Rahman Gohar Zaman Mohammed Salih Ahmed Nehad Ibrahim Mohammed Imran BAhmed Gomathi Krishnasamy Reem Alzaher Mariam Alkharraa Dania AlKhulaifi Maryam AlQahtani Asiya A.Salam Linah Saraireh Mohammed Gollapalli Rashad Ahmed 

机构地区:[1]Barani Institute of Information Technology(BIIT),PMAS Arid Agriculture University,Rawalpindi,46000,Punjab,Pakistan [2]Department of Computer Science(CS),College of Computer Science and Information Technology(CCSIT),Imam Abdulrahman Bin Faisal University,P.O.Box 1982,Dammam,31441,Saudi Arabia [3]Faculty Computer Science and Information Technology,Universiti Tun Hussein Onn Malaysia,Batu Pahat,Malaysia [4]Department of Computer Engineering(CE),College of Computer Science and Information Technology(CCSIT),Imam Abdulrahman bin Faisal University,P.O.Box 1982,Dammam,31441,Saudi Arabia [5]College of Business Administration,Imam Abdulrahman bin Faisal University,P.O.Box 1982,Dammam,31441,Saudi Arabia [6]Department of Computer Information System(CIS),College of Computer Science and Information Technology(CCSIT),Imam Abdulrahman bin Faisal University,P.O.Box 1982,Dammam,31441,Saudi Arabia [7]ICS Department,King Fahd University of Petroleum and Minerals,Dhahran,31261,Saudi Arabia

出  处:《Computer Systems Science & Engineering》2023年第8期2195-2214,共20页计算机系统科学与工程(英文)

摘  要:From the beginning,the process of research and its publication is an ever-growing phenomenon and with the emergence of web technologies,its growth rate is overwhelming.On a rough estimate,more than thirty thousand research journals have been issuing around four million papers annually on average.Search engines,indexing services,and digital libraries have been searching for such publications over the web.Nevertheless,getting the most relevant articles against the user requests is yet a fantasy.It is mainly because the articles are not appropriately indexed based on the hierarchies of granular subject classification.To overcome this issue,researchers are striving to investigate new techniques for the classification of the research articles especially,when the complete article text is not available(a case of nonopen access articles).The proposed study aims to investigate the multilabel classification over the available metadata in the best possible way and to assess,“to what extent metadata-based features can perform in contrast to content-based approaches.”In this regard,novel techniques for investigating multilabel classification have been proposed,developed,and evaluated on metadata such as the Title and Keywords of the articles.The proposed technique has been assessed for two diverse datasets,namely,from the Journal of universal computer science(J.UCS)and the benchmark dataset comprises of the articles published by the Association for computing machinery(ACM).The proposed technique yields encouraging results in contrast to the state-ofthe-art techniques in the literature.

关 键 词:Multilabel classification INDEXING METADATA content/data mining 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象