A New Hybrid Feature Selection Sequence for Predicting Breast Cancer Survivability Using Clinical Datasets  

在线阅读下载全文

作  者:E.Jenifer Sweetlin S.Saudia 

机构地区:[1]Centre for Information Technology and Engineering,Manonmaniam Sundaranar University,Tirunelveli,India

出  处:《Intelligent Automation & Soft Computing》2023年第7期343-367,共25页智能自动化与软计算(英文)

摘  要:This paper proposes a hybrid feature selection sequence comple-mented with filter and wrapper concepts to improve the accuracy of Machine Learning(ML)based supervised classifiers for classifying the survivability of breast cancer patients into classes,living and deceased using METABRIC and Surveillance,Epidemiology and End Results(SEER)datasets.The ML-based classifiers used in the analysis are:Multiple Logistic Regression,K-Nearest Neighbors,Decision Tree,Random Forest,Support Vector Machine and Multilayer Perceptron.The workflow of the proposed ML algorithm sequence comprises the following stages:data cleaning,data balancing,feature selection via a filter and wrapper sequence,cross validation-based training,testing and performance evaluation.The results obtained are compared in terms of the following classification metrics:Accuracy,Precision,F1 score,True Positive Rate,True Negative Rate,False Positive Rate,False Negative Rate,Area under the Receiver Operating Characteristics curve,Area under the Precision-Recall curve and Mathews Correlation Coefficient.The comparison shows that the proposed feature selection sequence produces better results from all supervised classifiers than all other feature selection sequences considered in the analysis.

关 键 词:Accuracy feature selection filter methods ML-based classifiers wrapper methods 

分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象