基于深度学习的子句级文本摘要模型

Clause-level Text Summarization Model Based on Deep Learning

作　　者：房一泉沈斌[1,2] 程华杜嘻嘻[2] FANG Yiquan;SHEN Bin;CHENG Hua;DU Xixi(Informatization Office,East China University of Science and Technology,Shanghai 200237;School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237)

机构地区：[1]华东理工大学信息化办公室,上海200237 [2]华东理工大学信息科学与工程学院,上海200237

出　　处：《计算机与数字工程》2024年第7期2135-2140,共6页Computer & Digital Engineering

基　　金：赛尔网络下一代互联网技术创新项目(编号:NGII20170520)资助。

摘　　要：针对文本自动摘要任务中整句级抽取式模型存在摘要过于冗余,以及训练目标与评价目标不匹配的问题,论文提出了一种基于深度学习的子句级文本摘要模型(CS-ASum)。首先,基于依存句法从原文中抽取子句级单元;然后,利用基于BERT预训练模型和基于Transformer模型的编码器获得子句的向量表示,得到初步候选摘要;最后,通过摘要匹配器计算候选摘要和原文的语义得分,得到最佳摘要。在CNN/Daily Mail数据集上的实验结果表明,CS-ASum在自动评测和人工评测中均优于对比模型,相较于表现最好的生成式摘要模型和抽取式摘要模型,CS-ASum的平均ROUGE指标值分别提高了0.76%和1.07%,由此可见,CS-ASum模型在自动文本摘要任务中比基础模型获得了更简洁、更忠于原文的摘要。The sentence-level extractive summarization model is relatively redundant and the training objectives do not match the evaluation goals in text automatic summarization task.In order to solve these problems,a clause-level text summarization model based on deep learning(CS-ASum)is proposed.Firstly,the clause-level units of the original text are extracted based on dependen⁃cy parsing method.Then,the vector representation of the clause are obtained by using the encoders based on BERT pre-trained model and Transformer model,and the preliminary summary candidates are received.Finally,the summary matcher is used to cal⁃culate the semantic similarity between the summary candidates and the original text,and the best summary is obtained.The experi⁃mental results on CNN/Daily Mail dataset show that,CS-ASum model is better than the compared models in both automatic evalua⁃tion and manual evaluation.Compared with the best-performing abstractive summarization model and extractive summarization mode,the average ROUGE value of CS-ASum model is increased by 0.76%and 1.07%respectively.Therefore,CS-ASum model acquires more concise and comprehensive summaries than the basic models in automatic text summarization task.

关键词：文本摘要深度学习子句级依存句法 ROUGE评测

分类号：TP391.1[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度学习的子句级文本摘要模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度学习的子句级文本摘要模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索