基于三维特征和Transformer的数字化古籍文档图像矫正  

Digital Ancient Book Document Image Correction Based on 3D Features and Transformer

在线阅读下载全文

作  者:赵微 牟大中 李夏童 屈千林 曹鹏[1] ZHAO Wei;MU Dazhong;LI Xiatong;QU Qianlin;CAO Peng(Beijing Key Laboratory of Signal and lnformation Processing for High-end Printing Equipment,Beijing Institute of Graphic Communication,Beijing 102600,China)

机构地区:[1]北京印刷学院高端印刷装备信号与信息处理北京市重点实验室,北京102600

出  处:《北京印刷学院学报》2024年第8期66-72,共7页Journal of Beijing Institute of Graphic Communication

基  金:北京印刷学院校级学科建设与研究生教育项目(21090324012)研究成果。

摘  要:古籍文档图像矫正是古籍文档数字化中的一个关键环节,对提高古籍数字化质量具有重要的现实意义。针对古籍中普遍存在的氧化弯曲、粘连折叠、装订方式特殊等原因导致的形变复杂、矫正难度大的问题,本文提出了一种基于深度学习和三维特征信息提取的古籍文档图像矫正方法。首先使用U-Net形式的编码器-解码器提取古籍文档图像的三维特征,然后基于Transformer模型对得到的三维特征图进行后向映射,最后使用双线性插值得到矫正后的图像。为了验证所提出方法的有效性,在两个自制测试集上分别进行实验。实验结果表明,该方法在局部失真(Local Distortion,LD)概率上,相较于DewarpNet模型降低了2.61%~6.58%。实验证明所提出的方法能有效完成古籍文档图像的矫正任务,提升古籍数字化质量。The image correction of ancient literature documents is a key link in the digitization of ancient literature documents,which has important practical significance for improving the quality of ancient literature digitization.This paper proposes a method for correcting ancient book document images based on deep learning and 3D feature information extraction,in response to the problems of complex deformation and difficult correction caused by oxidation bending,adhesive folding and special binding methods commonly found in ancient books.Firstly,use a U-Net encoder decoder to extract the three-dimensional features of ancient document images.Then,based on the Transformer model,perform backward mapping on the obtained 3D feature map.Finally,use bilinear interpolation to obtain the corrected image.To verify the effectiveness of the proposed method,experiments were conducted on two self-made test sets.The experimental results show that this method reduces Local Distortion(LD)by 2.61%~6.58%compared to the DewarpNet model.The experimental results show that the proposed method can effectively complete the task of correcting ancient book document images and improve the digital quality of ancient books.

关 键 词:古籍图像 文档图像矫正 三维信息提取 TRANSFORMER 编码器-解码器 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象