检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王兰馨 王卫亚[1] 程鑫[1] WANG Lanxin;WANG Weiya;CHENG Xin(School of Information Engineering,Chang’an University,Xi’an 710064,China)
出 处:《计算机工程与应用》2022年第4期192-197,共6页Computer Engineering and Applications
基 金:国家重点研发计划(2018YFB1600800)。
摘 要:针对单一模态情感识别精度低的问题,提出了基于Bi-LSTM-CNN的语音文本双模态情感识别模型算法。该算法采用带有词嵌入的双向长短时记忆网络(bi-directional long short-term memory network,Bi-LSTM)和卷积神经网络(convolutional neural network,CNN)构成Bi-LSTM-CNN模型,实现文本特征的提取,将其与声学特征融合结果作为联合CNN模型的输入,进行语音情感计算。基于IEMOCAP多模态情感检测数据集的测试结果表明,情感识别准确率达到了69.51%,比单一模态模型提高了至少6个百分点。To address the problem of low accuracy of single-modal emotion recognition,a speech-text bimodal emotion recognition model algorithm based on Bi-LSTM-CNN is proposed.The algorithm uses a Bi-LSTM(bi-directional long short-term memory network)with word embedding and a CNN(convolutional neural network)to form a Bi-LSTM-CNN model for text feature extraction,and the fusion results with acoustic features are used as the input of the joint CNN model for speech emotion computation.The test results based on the IEMOCAP multimodal emotion detection dataset show that the emotion recognition accuracy reaches 69.51%,which is at least 6 percentage points better than the single text modality model.
关 键 词:语音情感识别 卷积神经网络(CNN) 长短时记忆网络(LSTM) 特征融合
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222