融合声门波信号频谱特征的语音情感识别

Speech Emotion Recognition Combined with the Spectrum Feature of Glottal Waveform

作　　者：李昊璇[1] 师宏慧乔晓艳[1] LI Haoxuan SHI Honghui QIAO Xiaoyan(College of Physics and Electronics Engineering, Shanxi University, Taiyuan 030006, Chin)

机构地区：[1]山西大学物理电子工程学院,山西太原030006

出　　处：《测试技术学报》2017年第1期8-16,共9页Journal of Test and Measurement Technology

基　　金：山西省回国留学人员科研资助项目(2014-010);山西省自然科学基金资助项目(2013011016-2)

摘　　要：为了提高语音情感识别的准确率,本文针对新的声门波信号频谱特征抛物线频谱参数(parabolic spectralparameter,PSP)和谐波丰富因子(harmonic richness factor,HRF)进行了研究,并将其应用到语音的情感识别中.提取6种不同情感(生气、害怕、高兴、中性、悲伤和惊奇)语音信号的发音速率和短时能量、基音频率、前3个共振峰、12阶Mel频率倒谱系数(MFCC)的最大值、最小值、变化范围和平均值等常用特征构成一个特征矢量,并利用主成分分析方法降维;提取声门波信号的频谱特征PSP和HRF,并分析了PSP和HRF的情感表达能力;采用深度学习栈式自编码算法对只有常用特征以及融合了声门波信号频谱特征后的特征进行分类.结果表明:融合声门波信号频谱特征后识别率更高.In order to improve the accuracy of emotional speech recognition,the parabolic spectral parameter（PSP）and harmonic richness factor（HRF）which are frequent domain features of the glottal waveform are analyzed,and they are applicated in speech emotion recognition.First of all,acquisition the pronunciation rate and the maximum,minimum,range and average of pitch frequency,first three formant parameters,12 order Mel frequency cepstrum coefficients（MFCC）of six different emotions speech signals（angry,fear,happy,neutral,sad,surprise）to construct a feature vector,And use principal component analysis（PCA）method to reduce the vector dimension;Then,extract PSP and HRF of the glottal waveform,and analyze the emotional expression ability of PSP and HRF;Finally,using the stacked autoencoderclassifier aims to classify the features which are traditional and have the characteristics of the glottal signal.The results show that it can achieve a higher recognition rate to combine with thethe spectrum feature of glottal waveform.

关键词：声门波信号抛物线频谱参数谐波丰富因子栈式自编码语音情感识别

分类号：TN912.3[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

融合声门波信号频谱特征的语音情感识别

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

融合声门波信号频谱特征的语音情感识别

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索