检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:汪兰兰 蔡昌新[1] WANG Lan-lan;CAI Chang-xin(Electronic Information College,Yangtze University,Jingzhou 434023,China)
出 处:《科学技术与工程》2022年第26期11524-11532,共9页Science Technology and Engineering
基 金:国家自然科学基金(62173049)。
摘 要:针对目前常见的语音特征提取方法应用于真实环境中,所提取的语音特征包含有噪声干扰的问题,进而导致情感识别时出现的分类模糊化情况,为此提出一种新的语音特征提取方法,即线性预测基音频率特征提取方法。它主要是基于线性预测系数来构建模型,利用构建的模型消除声道响应信息以及抑制噪声干扰。由于此方法对于分类模糊化问题没有得到较好改善,利用模型相同的LPC美尔倒频谱系数(LPC Mel cepstral coefficients,LPCMCC)来对线性预测基音频率进行改进,并设计基于线性预测基音频率、其改进特征、LPCMCC与支持向量机(support vector machines,SVM)的语音情感识别对比实验。对比实验表明,此改进特征提取方法应用在情感识别领域的平均精度最高为84%,比线性预测基音频率和LPCMCC要高出22%、14%。为了测试此改进特征在真实环境中的分类效果,在此改进特征的基础上设计了一种基于MATLAB GUI技术的语音情感识别系统。实验结果表明这种新的改进特征能有效改善情感识别时出现的分类模糊化情况,基于此改进特征的语音情感系统能广泛地识别出噪声干扰下的说话人情感。In view of the current common speech feature extraction methods applied to the real environment,the extracted speech feature contains noise interference,which leads to the classification ambiguity in emotion recognition.Therefore,a new speech feature extraction method,namely linear prediction pitch frequency feature extraction method,was proposed.It is mainly based on linear prediction coefficient to construct a model,using the constructed model to eliminate the vocal tract response information and suppress noise interference.As this method did not achieve a better improvement for the classification ambiguity problem that occurred in emotion recognition,the LPC Mel cepstral coefficients with the same model was used to improve linear prediction pitch frequency and the comparative experiments on speech emotion recognition based on linear prediction pitch frequency,its improved features,LPCMCC and support vector machines were designed.The comparative experiments indicate that the average accuracy of this improved feature extraction method in the field of emotion recognition is up to 84%,which is 22%and 14%higher than that of linear pitch frequency prediction and LPCMCC,respectively.In order to test the classification effect of the improved feature in the real environment,a speech emotion recognition system based on MATLAB GUI technology was designed on the basis of the improved feature.Experimental results show that this new improved feature can effectively improve the classification ambiguity in emotion recognition,and the speech emotion system based on the improved feature can widely recognize the speaker s emotion in the presence of the noise interference.
关 键 词:噪声干扰 线性预测基音频率 LPCMCC SVM 改进特征 MATLAB GUI技术
分 类 号:TP391.9[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.144.104.210