Efficient Feature Selection and ML Algorithm for Accurate Diagnostics  被引量:1

在线阅读下载全文

作  者:Vincent Omollo Nyangaresi Nidhal Kamel Taha El-Omari Judith Nyakanga Nyakina 

机构地区:[1]Faculty of Biological&Physical Sciences,Tom Mboya University College,Homabay,Kenya [2]The World Islamic Science and Education University,Amman,Jordan [3]School of Nursing,Uzima University,Kisumu,Kenya

出  处:《Journal of Computer Science Research》2022年第1期10-19,共10页计算机科学研究(英文)

摘  要:Machine learning algorithms have been deployed in numerous optimization,prediction and classification problems.This has endeared them for application in fields such as computer networks and medical diagnosis.Although these machine learning algorithms achieve convincing results in these fields,they face numerous challenges when deployed on imbalanced dataset.Consequently,these algorithms are often biased towards majority class,hence unable to generalize the learning process.In addition,they are unable to effectively deal with high-dimensional datasets.Moreover,the utilization of conventional feature selection techniques from a dataset based on attribute significance render them ineffective for majority of the diagnosis applications.In this paper,feature selection is executed using the more effective Neighbour Components Analysis(NCA).During the classification process,an ensemble classifier comprising of K-Nearest Neighbours(KNN),Naive Bayes(NB),Decision Tree(DT)and Support Vector Machine(SVM)is built,trained and tested.Finally,cross validation is carried out to evaluate the developed ensemble model.The results shows that the proposed classifier has the best performance in terms of precision,recall,F-measure and classification accuracy.

关 键 词:Accuracy CLASSIFIER ENSEMBLE F-MEASURE Machine learning Precision RECALL 

分 类 号:TP1[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象