检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:高定国 侯闫 高红梅 索朗曲珍 GAO Dingguo;HOU Yan;GAO Hongmei;Suolang-Quzhen(School of Information Science and Technology,Tibet University,Lhasa 850000,China;Tibetan Information Technology Innovative Talent Cultivation Demonstration Base,Lhasa 850000,China)
机构地区:[1]西藏大学信息科学技术学院,西藏拉萨850000 [2]藏文信息技术创新人才培养示范基地,西藏拉萨850000
出 处:《高原科学研究》2023年第1期92-100,共9页Plateau Science Research
基 金:国家自然科学基金项目(62166038);西藏大学研究生高水平人才培养计划项目(2020-GSP-S177)。
摘 要:随着藏文信息处理技术的发展,藏文乌金字体的识别取得了很好的效果,但藏文乌梅字体由于书写风格差异大,检测和识别难,目前的乌梅字体识别仅限于以字丁识别、单一字体为主。近几年随着计算机字体的丰富,出现了乌梅印刷多字体文本。为了准确识别这类文本,文章基于中英文的预训练模型DBNet开展藏文文本检测,以ResNet-50为骨干网络的CRNN和SRN两种不同编码-解码方式开展端到端的乌梅印刷多字体文本识别,并以实验测试两种模型的识别结果。实验表明,当训练和测试所用字体一致时两个模型的识别效果相当;使用不在训练集中的另外8种乌梅字体进行测试时,SRN识别算法相比CRNN在TCR、TDR和LRA三个评价指标上分别提升0.5363%、1.7681%和3.4875%,表现出更强的泛化能力。With the development of Tibetan information processing technology,the recognition of the Tibetan Wujin font has achieved good results.However,due to the significant difference in writing style and difficulties in the detection and recognition of the Tibetan Wumei font,the current Wumei font recognition is only capable of recognizing the character and single font.In recent years,with the enrichment of computer fonts,there has been a multi-font text printed by Wumei.To recognize such texts,in this paper,we carried out a Wumei text detection based on DBNet,a pre-trained model in Chinese and English,and an end-to-end multi-character Wumei text recognition using CRNN and SRN,two different encoding and decoding methods,with ResNet-50 as the back-bone network.And the recognition results of the two models are examined.The experiment results show that when the fonts used in training and testing are consistent,the recognition effect of the two models is comparable.While,when examined using other eight Wumei fonts which were not included in the trainning set,compared with CRNN,the SRN recognition algorithm improved by 0.5363%,1.7681%,and 3.4875%on the three evalua-tion indexes of TCR,TDR,and LRA,respectively,showing a better generalization ability.
分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.14.247.147