检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:安俊秀[1] 杨林旺 柳源 AN Junxiu;YANG Linwang;LIU Yuan(School of Software Engineering,Chengdu University of Information Technology,Chengdu Sichuan 610225,China)
机构地区:[1]成都信息工程大学软件工程学院,成都610225
出 处:《计算机应用》2025年第4期1139-1147,共9页journal of Computer Applications
基 金:国家社会科学基金资助项目(22BXW048);成都市科技重点研发支撑计划项目(2022-YF05-00454-SN)。
摘 要:针对离散词扰动和嵌入扰动方法中未充分考虑潜在空间词向量之间距离边界的问题,提出一种邻近性语义感知的对抗性自动编码器(SPAAE)方法。首先,采用对抗自动编码器作为底层模型;其次,根据词向量的邻近距离求得噪声向量概率分布的标准差;最后,通过对概率分布进行随机采样,动态调整扰动参数,从而最大限度模糊自身语义且不影响其他词向量的语义。实验结果表明,与DAAE(DenoisingAdversarialAuto-Encoders)和EPAAE(Embedding Perturbed Adversarial Auto-Encoders)方法相比,所提方法在Yelp数据集上的自然流畅度分别提升了14.88%、15.65%;在Scitail数据集上的文本风格迁移(TST)的准确率分别提升了11.68%、6.45%;在Tenses数据集上的BLEU(BiLingual Evaluation Understudy)值分别提升了28.16%、26.17%。可见,SPAAE方法不仅在理论上提供了一种更精确的词向量扰动方式,而且在7个公开数据集上展示了它在不同风格迁移任务中的显著优势。特别是在网络舆论引导中,所提方法可以用于情感文本的风格迁移。Aiming at the problem that the distance boundaries between word vectors in latent space are not fully considered in discrete word perturbation and embedding perturbation methods,a Semantic Proximity-aware Adversarial Auto-Encoders(SPAAE)method was proposed.Firstly,adversarial auto-encoders were used as the underlying model.Secondly,standard deviation of the probability distribution of noise vectors was obtained on the basis of proximity distance of the word vectors.Finally,by randomly sampling the probability distribution,the perturbation parameters were adjusted dynamically to maximize the blurring of its own semantics without affecting the semantics of other word vectors.Experimental results show that compared with the DAAE(Denoising Adversarial Auto-Encoders)and EPAAE(Embedding Perturbed Adversarial Auto-Encoders)methods,the proposed method has the natural fluency increased by 14.88%and 15.65%,respectively,on Yelp dataset;the proposed method has the Text Style Transfer(TST)accuracy improved by 11.68%and 6.45%,respectively,on Scitail dataset;the proposed method has the BLEU(BiLingual Evaluation Understudy)increased by 28.16%and 26.17%,respectively,on Tenses dataset.It can be seen that SPAAE method provides a more accurate way of perturbing word vectors in theory,and demonstrates its significant advantages in different style transfer tasks on 7 public datasets.Especially in the guidance of online public opinion,the proposed method can be used for style transfer of emotional text.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7