基于改进线性预测基音频率的语音情感识别系统被引量：5

Speech Emotion Recognition System Based on Improved Linear Prediction Pitch Frequency

作　　者：汪兰兰蔡昌新[1] WANG Lan-lan;CAI Chang-xin(Electronic Information College,Yangtze University,Jingzhou 434023,China)

机构地区：[1]长江大学电子信息学院,荆州434023

出　　处：《科学技术与工程》2022年第26期11524-11532,共9页Science Technology and Engineering

基　　金：国家自然科学基金(62173049)。

摘　　要：针对目前常见的语音特征提取方法应用于真实环境中,所提取的语音特征包含有噪声干扰的问题,进而导致情感识别时出现的分类模糊化情况,为此提出一种新的语音特征提取方法,即线性预测基音频率特征提取方法。它主要是基于线性预测系数来构建模型,利用构建的模型消除声道响应信息以及抑制噪声干扰。由于此方法对于分类模糊化问题没有得到较好改善,利用模型相同的LPC美尔倒频谱系数(LPC Mel cepstral coefficients,LPCMCC)来对线性预测基音频率进行改进,并设计基于线性预测基音频率、其改进特征、LPCMCC与支持向量机(support vector machines,SVM)的语音情感识别对比实验。对比实验表明,此改进特征提取方法应用在情感识别领域的平均精度最高为84%,比线性预测基音频率和LPCMCC要高出22%、14%。为了测试此改进特征在真实环境中的分类效果,在此改进特征的基础上设计了一种基于MATLAB GUI技术的语音情感识别系统。实验结果表明这种新的改进特征能有效改善情感识别时出现的分类模糊化情况,基于此改进特征的语音情感系统能广泛地识别出噪声干扰下的说话人情感。In view of the current common speech feature extraction methods applied to the real environment,the extracted speech feature contains noise interference,which leads to the classification ambiguity in emotion recognition.Therefore,a new speech feature extraction method,namely linear prediction pitch frequency feature extraction method,was proposed.It is mainly based on linear prediction coefficient to construct a model,using the constructed model to eliminate the vocal tract response information and suppress noise interference.As this method did not achieve a better improvement for the classification ambiguity problem that occurred in emotion recognition,the LPC Mel cepstral coefficients with the same model was used to improve linear prediction pitch frequency and the comparative experiments on speech emotion recognition based on linear prediction pitch frequency,its improved features,LPCMCC and support vector machines were designed.The comparative experiments indicate that the average accuracy of this improved feature extraction method in the field of emotion recognition is up to 84%,which is 22%and 14%higher than that of linear pitch frequency prediction and LPCMCC,respectively.In order to test the classification effect of the improved feature in the real environment,a speech emotion recognition system based on MATLAB GUI technology was designed on the basis of the improved feature.Experimental results show that this new improved feature can effectively improve the classification ambiguity in emotion recognition,and the speech emotion system based on the improved feature can widely recognize the speaker s emotion in the presence of the noise interference.

关键词：噪声干扰线性预测基音频率 LPCMCC SVM 改进特征 MATLAB GUI技术

分类号：TP391.9[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进线性预测基音频率的语音情感识别系统被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进线性预测基音频率的语音情感识别系统 被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于改进线性预测基音频率的语音情感识别系统被引量：5