基于ASGRU-CNN时空双通道的语音情感识别  被引量:3

Speech Emotion Recognition Based on ASGRU-CNN Spatiotemporal Dual Channel

在线阅读下载全文

作  者:高鹏淇 黄鹤鸣[1,2] GAO Peng-qi;HUANG He-ming(School of Computer Science,Qinghai Normal University,Xining Qinghai 810008,China;State Key Laboratory of Tibetan Intelligent Information Processing and Application,Xining Qinghai 810008,China)

机构地区:[1]青海师范大学计算机学院,青海西宁810008 [2]藏语智能信息处理及应用国家重点实验室,青海西宁810008

出  处:《计算机仿真》2024年第4期180-186,共7页Computer Simulation

基  金:国家自然科学基金(620660039);青海省自然科学基金(2022-ZJ-925)。

摘  要:语音情感识别是实现人机交互的关键,如何提升语音情感识别的准确率以及更有效地提取具有情感代表性的特征是语音情感识别所面临的问题之一。针对以上问题,构建了一种包含空间特征提取模块和时序特征提取模块的双通道时空语音情感识别模型ASGRU-CNN。模型总体框架由两条并行分支组成:第一分支为空间特征提取模块,由三维卷积、二维卷积及池化操作共同构成级联结构;第二分支为时序特征提取模块,由切片循环神经网络内嵌门控循环单元及注意力机制构成。模型以韵律特征及谱特征的融合特征作为输入特征,经过双分支处理后,进入全连接层进行语音情感分类。在CASIA与EMO-DB数据库上进行相关实验,并通过数据扩充增加训练样本,与其它语音情感识别模型实验结果相比,所提出的模型具有较好的鲁棒性和泛化性。Speech emotion recognition is the key to achieving human-computer interaction,and how to improve the accuracy of speech emotion recognition is a major problem for speech emotion recognition.To realize this,a novel speech recognition model called ASGRU-CNN is proposed.The overall framework of the proposed model consists of two parallel branches:the first branch is the spatial feature extraction module consisted of 3D convolution,2D convolution,and pooling operations together to form a cascade structure;The second branch is the temporal feature extraction module consisted of a slicing cycle and an attention mechanism.The model takes the fused features of rhythmic features and spectral features as the input,and enters the fully connected layer for speech emotion classification after the double branching process.The relevant experiments has been conducted on CASIA and EMO-DB databases and on their expanded version.Compared with the experimental results of other speech emotion recognition models,the proposed model has better robustness and generalizability.

关 键 词:语音情感识别 融合特征 切片循环神经网络 注意力机制 数据扩充 

分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象