基于胶囊网络的多任务少样本文本隐写分析被引量：2

Multi-Task Few-Shot Text Steganalysis Based on Capsule Network

作　　者：杨雨张梓葳文娟 YANG Yu;ZHANG Zi-Wei;WEN Juan(College of Information and Electrical Engineering,China Agricultural University,Beijing 100091)

机构地区：[1]中国农业大学信息与电气工程学院,北京100091

出　　处：《计算机学报》2022年第12期2592-2604,共13页Chinese Journal of Computers

基　　金：国家自然科学基金项目(61802410)资助。

摘　　要：文本隐写分析是一种通过统计特征来区分载密文本和正常文本的技术.目前,最先进的文本隐写分析模型大多使用深度神经网络在单一任务上进行训练和测试.因此,现有模型在检测某种特定的隐写文本时有较好的性能.当待检测文本的领域、所使用的隐藏算法和嵌入容量发生变化时,模型的隐写分析性能会有一定程度的下降.为了增强文本隐写分析模型在不同检测任务上的快速自适应能力,并使模型能够处理少样本场景下的隐写分析任务,本文提出了一种基于胶囊网络的文本隐写分析方法.具体来说,使用带有自注意力的Bi-LSTM(Bidirectional Long Short-Term Memory)作为通用任务提取器,从支持集和查询集中获取文本的句子表示;任务映射器作为元学习者主导元训练过程,在获取支持集的句子表示后,学习单个文本与任务间的非线性映射关系;然后,将映射结果和查询集的句子表示输入分类器,度量文本与任务之间的匹配程度;最后,基于均方误差MSE(Mean Square Error)损失和KL散度(Kullback-Leibler Divergence)计算总预测损失.大量实验证明,我们的模型可以快速适应各种不同的任务,并在1-shot、5-shot和10-shot的检测任务中对三个域的平均检测精度分别达到了85.11%、88.63%和91.91%.Text steganalysis is a technique to distinguish steganographic text from normal text using statistical features.Currently,the most advanced text steganalysis models are trained and tested on a single steganalysis task through deep neural structure and achieve excellent performance in detecting stego text generated by a specific steganography method with a certain embedding capacity in one kind of domain.However,when the target task changes(including the text domains,the steganographic algorithms used to generate the text,and the embedding capacity),the steganalysis performance of the model degrades to a certain extent.This paper proposes a capsule network-based approach for text steganalysis to enhance the model performance in different tasks,making the model achieve fast adaptation in few-shot scenarios.Specifically,we use a Bi-LSTM(Bidirectional Long Short-Term Memory) with a self-attention structure as a generic feature extractor to obtain sentence representations from the support set and query set.The task mapper guides the meta-training process as a meta-learner,learning a non-linear mapping relationship between a single text and a task after acquiring the sentence representations of the support set.After that,the mapping vector and the sentence representations of the query set are input to the classifier to obtain their matching degree.Finally,the total prediction loss composed of MSE and Kullback-Leibler Divergence losses is calculated.Extensive experiments demonstrate that our model can quickly adapt to various tasks and achieve the average detection accuracy of85.11 %,88.63%,and 91.91% for the three domains under 1-shot,5-shot,and 10-shot,respectively.

关键词：文本隐写分析快速自适应少样本学习元学习胶囊网络

分类号：TP309[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于胶囊网络的多任务少样本文本隐写分析被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于胶囊网络的多任务少样本文本隐写分析 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于胶囊网络的多任务少样本文本隐写分析被引量：2