机构地区:[1]江西师范大学计算机信息工程学院,南昌330022
出 处:《清华大学学报(自然科学版)》2024年第5期789-800,共12页Journal of Tsinghua University(Science and Technology)
基 金:国家自然科学基金地区科学基金项目(62266021);江西省教育厅科学技术研究项目(GJJ2200330)。
摘 要:情感分布学习(emotion distribution learning, EDL)采用情感分布记录给定样本在各个情绪上的表达程度,在处理具有模糊性的多标签情绪分析任务时具有明显优势。情感分布标签增强技术将已标注的情绪单标签增强为情感分布,可以解决EDL缺乏已标注情感分布的实验数据集的问题。然而,已有的情感分布标签增强方法采用离散空间情绪模型表示情绪,存在情绪间的相关信息丢失和情绪表达不连续等问题。针对上述问题,该文引入基于连续维度的效价-唤醒-支配(valence-arousal-dominance, VAD)心理学情绪模型,提出融合VAD情绪知识的文本情感分布标签增强方法(VAD emotion knowledge-based text emotion distribution label enhancement, VADLE)。VADLE方法基于先验的VAD情绪模型中的情绪距离,先为英文句子的真实情绪标签和句中情感词的情绪标签分别生成先验情感分布,再通过分布叠加将2种先验情感分布统一。通过英文单标签文本情感数据集的对比实验表明:VADLE方法在情绪预测任务方面的性能优于已有的情感分布标签增强方法。[Objective]Existing emotion distribution label enhancement(EDLE)methods construct the emotion distribution based on a discrete spatial emotion model;hence,expressing the correlation between emotions in a granular manner with continuous values is challenging.Therefore,herein,a valence-arousal-dominance(VAD)emotion knowledge-based text emotion distribution label enhancement(VADLE)method is proposed based on the VAD continuous-dimensional psychology emotion model.Unlike existing EDLE methods,VADLE uses VAD emotion knowledge in a three-dimensional continuous space to model emotion correlations and generate a more nuanced emotion distribution.The VADLE method comprises several steps:(1)Extraction of emotion word information via referencing lexicon and extracting emotion words from a given sentence.(2)Generation of priori emotion distributions for emotion labels using a local linear-weighting algorithm.The algorithm measures the effect of secondary emotion on the primary emotion based on the VAD emotional spatial distance and assigns weights to nearby emotions using a Gaussian kernel.(3)Construction of sentence-level emotion distribution by combining the prior emotion distributions of sentence and textual emotion words.Furthermore,this study uses joint loss to train a multitask emotion distribution learning model based on the robustly optimized bidirectional encoder representations from transformers pretraining approach(RoBERTa)pretrained language model.This approach simultaneously optimizes the prediction of emotion distribution and classification.The sentence text features extracted using the RoBERTa pretrained model are then passed through a fully connected layer to generate a probability distribution over all emotion labels.Based on this probability distribution,the model utilizes the Kullback-Leibler(KL)loss for measuring the distance between the predicted and actual distributions,optimizing the emotion distribution prediction task.Simultaneously,cross-entropy loss is employed for optimizing the emotion recognitio
关 键 词:情感分布标签增强 情感分布学习 VAD情绪空间 情感词典
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...