SE-CMT语义信息补充的特征提取模型

Feature Extraction Model of SE-CMT Semantic Information Supplement

作　　者：杜睿山[1,2] 周长坤解红涛[1] 李宏杰 DU Ruishan;ZHOU Changkun;XIE Hongtao;LI Hongjie(School of Computer and Information Technology,Northeast Petroleum School,Daqing 163318,China;Key Laboratory of Oil and Gas Reservoir and Underground Gas Storage Integrity Evaluations,Northeast Petroleum University,Daqing 163318,China)

机构地区：[1]东北石油大学计算机与信息技术学院,黑龙江大庆163318 [2]东北石油大学油气藏及地下储库完整性评价黑龙江省重点实验室,黑龙江大庆163318

出　　处：《哈尔滨理工大学学报》2024年第6期74-81,共8页Journal of Harbin University of Science and Technology

基　　金：国家重点研发计划(2022YFE0206800);黑龙江省自然科学基金(LH2021F004).

摘　　要：在图像分类中,有益的语义信息补充可以高效捕捉关键区域,提高分类性能。为了获得有益的图像语义信息,提出了一种SE-CMT(SE-Networks CNN Meet Transformer)模型。该模型依据简单的CNN特征提取理论,输入图像通过SE-CMT Stem重标定前面提取到的特征,再通过SE-CMT Block中的深度卷积层来增强特征。利用SE-CNN(Squeeze-and-Excitation Networks-CNN)提取低级特征、加强局部性,并结合Transformer建立长程依赖关系,通过融合SE-CNN和Transformer结构,提高特征提取性能。在ImageNet和CIFAR-10数据集上的实验结果表明:SE-CMT模型的分类准确率分别达到了85.47%和87.16%top-1精度,性能优于基线模型CMT和Vision Transformer。因此,本文提出的SE-CMT模型是一种有效的图像特征提取方法。In image classification,beneficial semantic information supplementation can efficiently capture key regions and improve classification performance.To obtain beneficial image semantic information,an SE-CMT(SE-Networks CNN Meet Transformer)model is proposed.The model is based on the simple CNN feature extraction theory,where the input image is rescaled by the SE-CMT Stem to the previously extracted features,and then the features are enhanced by the deep convolutional layer in the SE-CMT Block.The model uses SE-CNN(Squeeze-and-Excitation Networks-CNN)to extract low-level features,enhance localization,and combine with Transformer to establish long-range dependencies to improve feature extraction performance by fusing SE-CNN and Transformer structures.The experimental results on ImageNet and CIFAR-10 datasets show that the classification accuracy of the SE-CMT model reaches 85.47%and 87.16%top-1 accuracy,respectively,and the experiments show that the method outperforms the baseline models CMT and Vision Transformer.Therefore,the proposed SE-CMT model in this study is an effective method for image feature extraction.

关键词：图像分类权重矩阵卷积神经网络 TRANSFORMER 通道域

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

SE-CMT语义信息补充的特征提取模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

SE-CMT语义信息补充的特征提取模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索