检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:齐艳媚 田学东 左丽娜 QI Yan-mei;TIAN Xue-dong;ZUO Li-na(School of Cyber Security and Computer,Hebei University,Baoding 071002,China)
机构地区:[1]河北大学网络空间安全与计算机学院
出 处:《科学技术与工程》2019年第30期232-240,共9页Science Technology and Engineering
基 金:河北省教育厅河北省高等学校科学技术研究重点项目(ZD2017208)资助
摘 要:针对古籍文献版面图像切分中存在的过切分和粘连等问题,提出基于犹豫模糊集的古籍汉字图像切分方法。首先,对古籍汉字版面图像进行连通区域搜索,获取版面中的笔画部件,实现古籍汉字的初切分;然后,对过切分汉字进行特征分析,提取过切分区域的特征,利用犹豫模糊集在处理多属性决策问题方面的优势,通过建立犹豫模糊集来判断过切分区域归属同一古籍汉字的隶属度,并据此进行过切分区域的合并;最后,对存在粘连和重叠的汉字采用分段像素跳跃数突变分析方法进行分割。在28886个古籍汉字上的实验结果显示,可以达到92.3%的切分准确率和85.7%的过切分合并准确率。In order to solve the problems of over-segmentation and cohesion in the layout segmentation of ancient books and documents,a segmentation method of Chinese characters in ancient books based on hesitant fuzzy sets was established.Firstly,to realize the originally segmenting of Chinese characters in ancient books,the connected area search was carried out to obtain stokes in layout.Secondly,the feature analysis of over-segmentation Chinese characters was performed to extract the feature information of over-segmentation regions.Based on the advantages of the hesitant fuzzy sets in dealing with multi-attribute decision-making problems,a hesitant fuzzy sets was established to judge the membership degree of over-segmentation regions belonging to the same ancient Chinese characters,then merged the over-segmentation regions.Finally,the Chinese characters with adhesions and overlaps are segmented by using segmented pixel jump number mutation analysis method.In conclusion,the experimental results on 28886 ancient Chinese characters show that the accuracy of segmentation can reach 92.3%and the accuracy of the over-segmentation is 85.7%.
分 类 号:TP391.43[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.38