结合Bi-LSTM-CNN的语音文本双模态情感识别模型被引量：18

Bimodal Emotion Recognition Model for Speech-Text Based on Bi-LSTM-CNN

作　　者：王兰馨王卫亚[1] 程鑫[1] WANG Lanxin;WANG Weiya;CHENG Xin(School of Information Engineering,Chang’an University,Xi’an 710064,China)

机构地区：[1]长安大学信息工程学院,西安710064

出　　处：《计算机工程与应用》2022年第4期192-197,共6页Computer Engineering and Applications

基　　金：国家重点研发计划(2018YFB1600800)。

摘　　要：针对单一模态情感识别精度低的问题,提出了基于Bi-LSTM-CNN的语音文本双模态情感识别模型算法。该算法采用带有词嵌入的双向长短时记忆网络(bi-directional long short-term memory network,Bi-LSTM)和卷积神经网络(convolutional neural network,CNN)构成Bi-LSTM-CNN模型,实现文本特征的提取,将其与声学特征融合结果作为联合CNN模型的输入,进行语音情感计算。基于IEMOCAP多模态情感检测数据集的测试结果表明,情感识别准确率达到了69.51%,比单一模态模型提高了至少6个百分点。To address the problem of low accuracy of single-modal emotion recognition,a speech-text bimodal emotion recognition model algorithm based on Bi-LSTM-CNN is proposed.The algorithm uses a Bi-LSTM(bi-directional long short-term memory network)with word embedding and a CNN(convolutional neural network)to form a Bi-LSTM-CNN model for text feature extraction,and the fusion results with acoustic features are used as the input of the joint CNN model for speech emotion computation.The test results based on the IEMOCAP multimodal emotion detection dataset show that the emotion recognition accuracy reaches 69.51%,which is at least 6 percentage points better than the single text modality model.

关键词：语音情感识别卷积神经网络(CNN) 长短时记忆网络(LSTM) 特征融合

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

结合Bi-LSTM-CNN的语音文本双模态情感识别模型被引量：18

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

结合Bi-LSTM-CNN的语音文本双模态情感识别模型 被引量：18

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

结合Bi-LSTM-CNN的语音文本双模态情感识别模型被引量：18