检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:赵凡 张琳 闻治泉 杨林林 蔺广逢 ZHAO Fan;ZHANG Lin;WEN Zhiquan;YANG Linlin;LIN Guangfeng(Department of Information Science,School of Printing,Packaging and Digital Media,Xi’an University of Technology,Xi’an 710048,China)
机构地区:[1]西安理工大学印刷包装与数字媒体学院信息科学系,西安710048
出 处:《计算机工程与应用》2021年第6期159-167,共9页Computer Engineering and Applications
基 金:国家自然科学基金(61671376,61771386);陕西省重点研发计划(2020SF-359)。
摘 要:为了提高经典目标检测算法对自然场景文本定位的准确性,以及克服传统字符检测模型由于笔画间存在非连通性引起的汉字错误分割问题,提出了一种直接高效的自然场景汉字逼近定位方法。采用经典的EAST算法对场景图像中的文字进行检测。对初检的文字框进行调整使其更紧凑和更完整地包含文字,主要由提取各连通笔画成分、汉字分割和文字形状逼近三部分组成。矫正文字区域和识别文字内容。实验结果表明,提出的算法在保持平均帧率为3.1帧/s的同时,对ICDAR2015、ICDAR2017-MLT和MSRA-TD500三个多方向数据集上文本定位任务中的F-score分别达到83.5%、72.8%和81.1%;消融实验验证了算法中各模块的有效性。在ICDAR2015数据集上的检测和识别综合评估任务中的性能也验证了该方法相比一些最新方法取得了更好的性能。In order to improve the accuracy of the classic target detection algorithms for text localization in natural scenes,and to overcome the problem of incorrect segmentation of Chinese characters by traditional character detection models due to the non-connectivity between strokes,a direct and efficient Chinese text spotting method is proposed in this paper.Text box is detected by EAST algorithm.The detected text box is adjusted to make it more compact and contain text more comprehensively,which comprises the connected component extraction,Chinese character segmentation and text shape approximation.The extracted text regions are corrected and transcribed.Experimental results show that while maintaining 3.2 frame per second,the proposed algorithm has F-score of 83.5%,72.8%and 81.1%in text positioning task of three multi-oriented text datasets,ICDAR2015,ICDAR2017-MLT and MSRA-TD500,respectively.The ablation experiment verifies the effectiveness of each module in the proposed algorithm.The performance of the comprehensive evaluation task of detection and recognition on the ICDAR2015 data set also proves that the proposed method has achieved better performance than some of the latest methods.
关 键 词:文字检测 文字定位 文字识别 卷积神经网络 多方向文字 谱聚类
分 类 号:TP399[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.223.135.69