检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:芷香香 高定国 ZHI Xiangxiang;GAO Dingguo(College of Information Science and Technology,Tibet University,Lhasa 850000,China)
机构地区:[1]西藏大学信息科学技术学院,西藏拉萨850000
出 处:《高原科学研究》2022年第2期89-101,共13页Plateau Science Research
基 金:国家自然科学基金项目(62166038);西藏大学研究生高水平人才培养计划项目(00060701).
摘 要:为更好利用和挖掘藏文古籍文献内容,文章首先研究了手写藏文古籍文本的特点,按照其字形大小构建了3种数据集;其次采用PSENet、PixelLink、PANNet 3种基于分割的深度学习文本检测算法对多种字体的手写藏文古籍文本进行了检测;再评估了3种算法对手写藏文古籍文本的检测性能,分析了3种算法检测多种手写藏文古籍字体和字形大小的效果,指出了在同库实验中PSENet和PANNet性能优于Pixel⁃Link,跨库实验中PixelLink性能优于PSENet和PANNet。To sufficiently use and fully explore the content of Tibetan ancient handwritten books,Tibetan ancient handwritten books must be digitized.For digitaization of Tibetan ancient handwritten books,the first key step is to detect Tibetan text from the books correctly.And hence,in this paper firstly the characteristics of Tibetan an⁃cient handwritten books is studied,and three datasets is constructed according to the font size of Tibetan ancient handwritten books.Secondly,three algorithms i.e.PSENet,PixelLink and PANNet,which are based on deep learning text detection algorithms,are applied to detect the text of Tibetan ancient handwritten books with multi⁃ple fonts,and the evaluation of performance of the three algorithms is carried out.Moreover,the performance of the three algorithms in detecting various fonts and font size of Tibetan ancient handwritten books are compared.Our results show that the performance of PSENet and PANNet are better than that of PixelLink in detecting Tibet⁃an ancient handwritten books with three font sizes,while the performance of PixelLink is better than PSENet and PANNet in the cross-database experiment.
关 键 词:藏文古籍 多字体 文本检测 PSENet PixelLink PANNet
分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.26