基于语谱图和卷积神经网络的语音情感识别  被引量:8

Speech emotion recognition based on spectrogram and CNNs

在线阅读下载全文

作  者:田熙燕[1] 徐君鹏[1] 杜留锋[2] 

机构地区:[1]河南科技学院信息工程学院,河南新乡453002 [2]南京邮电大学通信与信息工程学院,江苏南京210003

出  处:《河南科技学院学报(自然科学版)》2017年第2期62-68,共7页Journal of Henan Institute of Science and Technology(Natural Science Edition)

基  金:国家青年科学基金资助项目(61501260);河南省教育厅重点项目(5201029140111)

摘  要:针对语音情感识别的特征提取和分类模型构建问题,首先提出了一种基于语谱图的特征提取方法,将语谱图进行归一灰度化后,利用Gabor滤波器进行纹理特征提取,并采用主成分分析(principal component analysis,PCA)对特征矩阵进行降维;然后分析了卷积神经网络(convolutional neural networks,CNNs)并把其作为情感识别分类器;最后在Emo DB和CASIA库进行了不同的比对实验.实验结果取得了较高情感识别率,表明了所提特征提取方法的有效性以及CNNs用作情感分类的可行性.To solve the problem of feature extraction and classification in speech emotion recognition,first a feature extraction method based on spectrogram was proposed,the method uses Gabor filter to extract the texture feature from the normalized spectrum gray image,and reduce these feature matrix dimension using the PCA.Then the convolutional neural networks was used as an emotion recognition classifier.Finally the performance of this system was assessed by computer simulations and a higher recognition rates were achieved respectively on the Emo DB and CASIA database through comparative experiment in different conditions,the results showed that the method proposed in this paper is effective and the CNNs can be used successfully for emotion recognition as a classifier.

关 键 词:语音情感识别 语谱图 GABOR滤波器 PCA CNNS 

分 类 号:TN912.34[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象