基于混合采样和SE_ResNet_SVM的不平衡多分类研究  

Unbalanced Multiclassification Study Based on Mixed Sampling and SE_ResNet_SVM

在线阅读下载全文

作  者:矫桂娥 翁铜铜[3] 张文俊 JIAO Guie;WENG Tongtong;ZHANG Wenjun(Shanghai Film Academy,Shanghai University,Shanghai 200072,China;College of Information Technology,Shanghai Jian Qiao University,Shanghai 201306,China;College of Information Technology,Shanghai Ocean University,Shanghai 201306,China)

机构地区:[1]上海大学上海电影学院,上海200072 [2]上海建桥学院信息技术学院,上海201306 [3]上海海洋大学信息学院,上海201306

出  处:《应用科学学报》2024年第6期1000-1015,共16页Journal of Applied Sciences

基  金:国家自然科学基金(No.61572434);上海科学技术委员会科普项目(No.19DZ22048)资助。

摘  要:针对结构化多分类算法中不平衡数据集类别分布不均导致分类难度增加的问题,本文提出了一种基于混合采样、压缩与激励(squeeze and excitation, SE)模块、改进深度残差网络和支持向量机(support vector machines, SVM)的网络模型SNSMRS (SMOTEENNmixed residual networks-SVM network)。首先,通过合成少数过采样和编辑最近邻技术来改善数据分布;然后,构建融合SE模块与通过融合批次归一化和群组归一化的深度残差网络来提取特征;最后,通过SVM进行输出网络模型。其中,SE模块增强了模型对特征的区分能力,提升了模型的鲁棒性;基于融合归一化的残差网络受批次大小的影响较小,并且避免了传统神经网络梯度消失和精度退化等问题,增强了网络的稳定性与准确度;SVM可以根据特征向量在空间上的分布进行全部特征的分割,特征利用率高,提高了模型的分类精度。在7个不同规模和领域的非平衡公开数据集上进行了对比和消融实验,结果表明,本文所提的网络模型SNSMRS不仅优于其他深度学习模型,而且相对于未改良的ResNet,Macro-F1和G-mean值分别提升了约3%和4%,同时在4个数据集上的Macro-F1和G-mean值均超过了95%。A network model SNSMRS(SMOTEENN-mixed residual networks-SVM network)based on hybrid sampling,squeeze and excitation(SE)module,improved deep residual network and support vector machines(SVM)is proposed to address the problem of uneven class distribution of unbalanced data sets in traditional structured multiclassification algorithms, which leads to increased classification difficulty. Firstly, the data distribution isimproved by synthesizing minority oversampling and editing nearest neighbors technique.Then the features are extracted by combining SE module and a deep residual network,improved with batch normalization and group normalization. Finally, the network modeluses support vector machine (SVM) to output the classification results. The SE moduleenhances the model’s feature differentiation ability and robustness. The improvements tothe ResNet, through fusion normalization, mitigate issues such as gradient vanishing andaccuracy degradation, and ensure stability and accuracy regardless of batch_size. Additionally,SVM enhances the classification accuracy by effectively utilizing feature vectors inspace to classify and extract features. Comparison and ablation experiments are conductedon seven unbalanced public datasets of various sizes and domains. The experimental resultsshow that the proposed model, SNSMRS, not only outperforms other deep learningmodels, but also increases the values of Macro-F1 and G-mean by approximately 3% and4%, respectively, compared with the original ResNet. Macro-F1 and G-mean values ofSNSMRS exceed 95% on four of the datasets, demonstrating its superior performance.

关 键 词:不平衡多分类 混合采样 压缩与激励模块 群组归一化 ResNet 支持向量机 

分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象