检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李昊璇[1] 师宏慧 乔晓艳[1] LI Haoxuan SHI Honghui QIAO Xiaoyan(College of Physics and Electronics Engineering, Shanxi University, Taiyuan 030006, Chin)
机构地区:[1]山西大学物理电子工程学院,山西太原030006
出 处:《测试技术学报》2017年第1期8-16,共9页Journal of Test and Measurement Technology
基 金:山西省回国留学人员科研资助项目(2014-010);山西省自然科学基金资助项目(2013011016-2)
摘 要:为了提高语音情感识别的准确率,本文针对新的声门波信号频谱特征抛物线频谱参数(parabolic spectralparameter,PSP)和谐波丰富因子(harmonic richness factor,HRF)进行了研究,并将其应用到语音的情感识别中.提取6种不同情感(生气、害怕、高兴、中性、悲伤和惊奇)语音信号的发音速率和短时能量、基音频率、前3个共振峰、12阶Mel频率倒谱系数(MFCC)的最大值、最小值、变化范围和平均值等常用特征构成一个特征矢量,并利用主成分分析方法降维;提取声门波信号的频谱特征PSP和HRF,并分析了PSP和HRF的情感表达能力;采用深度学习栈式自编码算法对只有常用特征以及融合了声门波信号频谱特征后的特征进行分类.结果表明:融合声门波信号频谱特征后识别率更高.In order to improve the accuracy of emotional speech recognition,the parabolic spectral parameter(PSP)and harmonic richness factor(HRF)which are frequent domain features of the glottal waveform are analyzed,and they are applicated in speech emotion recognition.First of all,acquisition the pronunciation rate and the maximum,minimum,range and average of pitch frequency,first three formant parameters,12 order Mel frequency cepstrum coefficients(MFCC)of six different emotions speech signals(angry,fear,happy,neutral,sad,surprise)to construct a feature vector,And use principal component analysis(PCA)method to reduce the vector dimension;Then,extract PSP and HRF of the glottal waveform,and analyze the emotional expression ability of PSP and HRF;Finally,using the stacked autoencoderclassifier aims to classify the features which are traditional and have the characteristics of the glottal signal.The results show that it can achieve a higher recognition rate to combine with thethe spectrum feature of glottal waveform.
关 键 词:声门波信号 抛物线频谱参数 谐波丰富因子 栈式自编码 语音情感识别
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13