融合边特征与注意力的表格结构识别模型  被引量:2

Table structure recognition model integrating edge features and attention

在线阅读下载全文

作  者:吕学强[1] 张煜楠 韩晶 崔运鹏[2] 李欢 LYU Xueqiang;ZHANG Yunan;HAN Jing;CUI Yunpeng;LI Huan(Beijing Key Laboratory of Internet Culture and Digital Dissemination Research(Beijing Information Science and Technology University),Beijing 100101,China;Key Laboratory of Agricultural Big Data,Ministry of Agriculture(Agricultural Information Institute of Chinese Academy of Agricultural Science),Beijing 100081,China)

机构地区:[1]网络文化与数字传播北京市重点实验室(北京信息科技大学),北京100101 [2]农业农村部农业大数据重点实验室(中国农业科学院农业信息研究所),北京100081

出  处:《计算机应用》2023年第3期752-758,共7页journal of Computer Applications

基  金:国家自然科学基金资助项目(62171043)。

摘  要:针对现有方法在表格结构识别问题中存在的先验知识依赖、鲁棒性不足、表达能力不足等问题,提出一种新的融合边特征与注意力的表格结构识别模型——GEAN-TSR。首先,提出图边注意力网络(GEAN)并作为模型的主干网络,在边卷积结构的基础上引入并改进图注意力机制聚合图节点特征,解决图网络在特征提取过程中的信息损失的问题,提高图网络的表达能力;然后,引入边特征融合模块融合浅层图节点信息与图网络输出,增强图网络的局部信息提取能力与表达能力;最后,将门控循环单元(GRU)提取的图节点文本特征融入文本特征融合模块对边进行分类预测。在SciTSR-COMP数据集上的对比实验中,相较于目前最优的模型SEM,GEAN-TSR的召回率与F1值分别提升2.5与1.4个百分点。在消融实验中,GEAN-TSR采用特征融合模块后,所有指标都取得了最优值,验证了模块的有效性。实验结果表明,GEAN-TSR能够有效提升网络性能,更好地完成表格结构识别任务。Aiming at the problems in the existing methods such as dependence on prior knowledge,insufficient robustness,and insufficient expression ability in table structure recognition,a new table structure recognition model integrating edge features and attention was proposed,namely Graph Edge-Attention Network based Table Structure Recognition model(GEAN-TSR).Firstly,Graph Edge-Attention Network(GEAN)was proposed as the backbone network,and based on edge convolution structure,the graph attention mechanism was introduced and improved to aggregate graph node features,so as to solve the problem of information loss in the process of feature extraction of graph network,and improve the expression ability of graph network.Then,an edge feature fusion module was introduced to fuse the shallow graph node information with the graph network output to enhance the local information extraction and expression abilities of the graph network.Finally,the graph node text features extracted by Gated Recurrent Unit(GRU)were integrated into the text feature fusion module for edge’s classification and prediction.Comparative experiments on Scientific paper Table Structure Recognition-COMPlicated(SciTSR-COMP)dataset show that the recall and F1 score of GEAN-TSR are increased by 2.5 and 1.4 percentage points,respectively in comparison with the existing optimal model Split,Embed and Merge(SEM).Ablation experiments show that all the indicators of GEAN-TSR have achieved the optimal values after using the feature fusion module,proving the effectiveness of the module.Experimental results show that GEAN-TSR can effectively improve the network performance and better complete the task of table structure recognition.

关 键 词:图神经网络 图注意力网络 特征融合 表格结构识别 表格解析 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象