基于强化正则的小样本自动摘要方法

Automatic Summarization of Small Samples Based on Enhanced Regularization

作　　者：李清万卫兵 LI Qing;WAN Weibing(School of Electronic and Electrical Engineering,Shanghai University of Engineering Science,Shanghai 201620,China)

机构地区：[1]上海工程技术大学电子电气工程学院,上海201620

出　　处：《电子科技》2024年第7期16-24,共9页Electronic Science and Technology

基　　金：科技创新2030“新一代人工智能”重大项目(2020AAA0109300)。

摘　　要：文本自动摘要旨在从文本信息中提取主要语句以压缩信息。现有生成式自动摘要方法无法充分利用预训练模型对原文语义进行学习,导致生成内容易丢失重要信息,当面对样本数量较少的数据集时容易发生过拟合。为了解决此类问题并获得更好的微调性能,文中使用预训练模型mT5(multilingual T5)作为基线,通过结合R-drop(Regularized dropout)对模型微调进行强化正则来提高模型学习能力,同时利用Sparse softmax减少预测生成的模糊性来确保输出准确度。模型在中文数据集LCSTS和CSL上通过计算BLEU(Bilingual Evaluation Understudy)进行优化方法超参数测试,并采用Rouge作为评测指标分别对数据集进行了不同数量级的评测。实验结果表明,经过优化的预训练模型能够更好地学习原文语义表征,在小样本情况下模型能够保持较好的拟合效果,并且能够生成实用性较高的结果。Automatic text summarization aims to extract the main statements from text information for the purpose of compressing information.Existing generative automatic summarization methods do not take full advantage of the pre-trained model to learn the semantics of the original text,resulting in the loss of important information in the generated content,when the data set with a small number of samples is often prone to overfitting.In order to solve such problems and obtain better fine-tuning performance,the pre-trained model mT5(multilingual T5)is used as a baseline to improve the learning ability of the model by combining R-drop(Regularized dropout)with reinforced regularity for model fine-tuning,and Sparse softmax is used to reduce the ambiguity of prediction generation to ensure the accuracy of the output.The model calculates BLEU(Bilingual Evaluation Understudy)for hyperparameter test on Chinese data sets LCSTS and CSL,and uses Rouge as evaluation index to evaluate data sets of different orders of magnitude.The experimental results show that the optimized pre-trained model can better learn the semantic representation of the original text,and the model can maintain a good fit in the small samples and generate more practical results.

关键词：文本自动摘要文本生成预训练模型小样本数据强化正则稀疏化输出语义表征学习 mT5

分类号：TP391.1[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化正则的小样本自动摘要方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化正则的小样本自动摘要方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索