可减轻腰椎间盘样本集类重叠的采样算法  

Sampling Algorithm for Reducing Class Overlap in Lumbar Disc Samples

作  者:赵鑫鑫 吴晓锋 ZHAO Xinxin;WU Xiaofeng(School of Mathematics and Statistics,Minnan Normal University,Zhangzhou 363000,China;School of Mathematics and Computer Science,Quanzhou Normal University,Quanzhou 362000,China)

机构地区:[1]闽南师范大学数学与统计学院,福建漳州363000 [2]泉州师范学院数学与计算机科学学院,福建泉州362000

出  处:《软件工程》2025年第1期40-45,共6页Software Engineering

基  金:福建省区域发展项目(2019Y3007)。

摘  要:医学数据的类重叠问题会严重影响疾病的智能诊断效果。为了减轻腰椎间盘样本的类重叠对分类器产生的不良影响,提出了一种可减轻类重叠的混合采样算法——CO_HS算法。该算法将训练样本划分为核心样本、边界样本和噪声样本,对重叠区域的样本进行采样,以减轻样本集的类重叠程度。采用CO_HS算法产生的新训练样本集训练RF等分类模型,并建立了6种新的腰椎间盘退变分类器。实验结果显示,建立的新分类器在多项性能指标上均实现了显著提升,其中准确度提升了7.8百分点~12.7百分点,kappa系数提升了11.6百分点~20.2百分点,敏感性提升了7.9百分点~16.8百分点,特异性提升了9.0百分点~18.2百分点,F指标提升了9.4百分点~18.4百分点。因此,CO_HS算法被证明是一种能有效解决样本类重叠问题、改善分类性能的高效方法。The class overlap problem in medical data can severely affect the performance of intelligent disease diagnosis.To mitigate the negative impact of class overlap in lumbar disc samples on classifiers,this paper proposes a CO_HS algorithm,a hybrid sampling algorithm to reduce class overlap.This algorithm divides the training samples into core samples,boundary samples,and noise samples,sampling from the overlapping region to reduce the degree of class overlap in the dataset.New training samples generated by the CO_HS algorithm are used to train classification models such as Random Forest(RF),resulting in the establishment of six new classifiers for lumbar disc degeneration.Experimental results indicate that the newly established classifiers show significant improvement across multiple performance metrics.Specifically,the accuracy has increased by 7.8 percentage points to 12.7 percentage points,the kappa coefficient has increased by 11.6 percentage points to 20.2 percentage points,sensitivity has been improved by 7.9 percentage points to 16.8 percentage points,specificity has been elevated by 9.0 percentage points to 18.2 percentage points,and the F-measure has been boosted by 9.4 percentage points to 18.4 percentage points.Therefore,the CO_HS algorithm is proven to be an effective method for addressing the class overlap issue and improving classification performance.

关 键 词:智能医学 类重叠 混合采样 腰椎间盘退变 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程] R604[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象