基于多尺度特征融合的Swin Transformer满文识别研究  

The Swin Transformer-based Manchu character recognition model with multi-scale feature fusion

在线阅读下载全文

作  者:谭振江[1] 李明焱 王大东[1] TAN Zhen-jiang;LI Ming-yan;WANG Da-dong(College of Mathematics and Computer Science,Jilin Normal University,Siping 136000,China)

机构地区:[1]吉林师范大学数学与计算机学院,吉林四平136000

出  处:《吉林师范大学学报(自然科学版)》2025年第1期103-110,共8页Journal of Jilin Normal University:Natural Science Edition

基  金:吉林省教育厅科学研究项目(JJKH20240573KJ)。

摘  要:针对满文字符识别领域中非标准形态变体和一音多形等固有挑战,提出了一种基于Swin Transformer架构的多尺度特征融合模型(Multi-scale feature fusion based Swin Transformer,MR-SwinT).该模型通过引入多分辨率并行输入机制,实现了字符的细粒度局部特征与宏观语境信息的协同捕获.模型的核心优势在于充分利用了Swin Transformer的层级式窗口自注意力机制,该机制为大尺度特征建模提供了卓越的表达能力.此外,本文设计的SMTBlocks模块通过自适应加权调整策略,能有效实现多分辨率特征的动态融合,显著增强了模型对复杂字符的区分能力与泛化性能.实验结果表明MR-SwinT模型整词识别准确率为96.59%,单字符识别准确率为99.46%.To address the inherent challenges of non-standard morphological variants and multiple graphemic representations of the same phoneme in Manchu character recognition,this paper proposes MR-SwinT,a multi-scale feature fusion model based on the Swin Transformer architecture.The model enables synchronized capture of fine-grained local character features and macro-contextual information via a multi-resolution parallel input mechanism.A core advantage of the model is its full leverage of the Swin Transformer hierarchical,window-based self-attention mechanism,which offers exceptional representational capacity for large-scale feature modeling.Additionally,the SMT Blocks module,specifically designed in this study,achieves effective dynamic fusion of multi-resolution features through an adaptive weighting adjustment strategy,significantly enhancing the model discriminative power and generalization ability for complex characters.Experimental results indicate that the MR-SwinT model attains 96.59%accuracy for whole-word recognition and 99.46%accuracy for single-character recognition.

关 键 词:满文识别 Swin Transformer 深度学习 多尺度特征融合 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象