检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:付鹏斌 刘鹏辉 杨惠荣 董澳静 FU Pengbin;LIU Penghui;YANG Huirong;DONG Aojing(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China)
出 处:《计算机工程》2022年第3期253-262,共10页Computer Engineering
基 金:国家自然科学基金(61772048);北京市自然科学基金(4153058)。
摘 要:手写文本识别方法主要应用于文本输入技术,对人机交互领域的发展起关键作用。针对多数在线输入法无法识别中英文混合手写识别的问题,提出一种在线中英文混合手写文本识别方法。通过对文本笔画进行基于水平相对位置、垂直重叠率、面积重叠率规则的整合以及连笔切分,得到一系列字符片段,同时利用笔画个数、宽高比、中心偏离、平滑度等几何特征和识别置信度,对字符片段进行中英文分类。在此基础上,根据分类结果并结合自然语言模型的路径评价及动态规划搜索算法,分别对候选的中、英文字符片段进行合并处理,得到待识别的中、英文字符序列,并将其分别送入卷积神经网络的中、英文识别模型中,得到手写文本识别结果。实验结果表明,在线手写中英文混合文本识别正确率达93.67%,不仅能切分在线手写中文文本行,而且对包含字符连笔的在线手写中英文文本行也有较好的切分效果。Handwritten text recognition is mainly used in text input technology,which plays a key role in the development of human-computer interaction. To address the lack of functionality for Chinese and English mixed handwritten text recognition in most online input methods,an online Chinese and English mixed handwritten text recognition method is proposed.Through the integration of text strokes based on the horizontal relative position,vertical overlap rate,area overlap rate rules,and continuous stroke segmentation,a series of character segments are obtained.In addition,Chinese and English character segments are classified based on the number of strokes,aspect ratio,center deviation,smoothness,and recognition confidence. On this basis,according to the classification results,combined with the path evaluation of the natural-language model and dynamic programming search algorithm,the candidate and English character segments are combined to obtain the Chinese and English character sequences to be recognized,which are,respectively,sent to the Chinese and English recognition models of the Convolutional Neural Network(CNN)to obtain the handwritten text recognition results. The experimental results show that and the recognition accuracy of the online handwritten Chinese and English mixed text is 93.67%,the proposed method can segment online handwritten Chinese text lines as well as online handwritten Chinese and English text lines containing characters.
关 键 词:在线手写识别 中英文混合手写 中英文分类 文本行切分 路径评价
分 类 号:TP391.43[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7