检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:周玥媛 孔钦[1] ZHOU Yue-yuan;KONG Qin(Nanjing University Jinling College,Nanjing 210089,China)
出 处:《计算机技术与发展》2020年第5期76-83,共8页Computer Technology and Development
基 金:全国高校计算机基础教学研究与改革课题(AFCEC-2016-18);南京大学金陵学院重点教改项目(0010521816,0010521806)。
摘 要:声纹识别技术实现的关键点在于从语音信号中提取语音特征参数,此参数具备表征说话人特征的能力。基于GMM-UBM模型,通过Matlab实现文本无关的声纹识别系统,对主流静态特征参数MFCC、LPCC、LPC以及结合动态参数的MFCC,从说话人确认与说话人辨认两种应用角度进行性能比较。在取不同特征参数阶数、不同高斯混合度和使用不同时长的训练语音与测试语音的情况下,从理论识别效果、实际识别效果、识别所用时长、识别时长占比等多个方面进行了分析与研究。最终结果表明:在GMM-UBM模式识别方法下,三种静态特征参数中MFCC绝大多数时候具有最佳识别效果,同时其系统识别耗时最长;识别率与语音特征参数的阶数之间并非单调上升关系。静态参数在结合较佳阶数的动态参数时能够提升识别效果;增加动态参数阶数与提高系统识别效果之间无必然联系。The key of the voiceprint recognition technology is to extract speech feature parameters from speech signals,which have the capacity of representing speakers’ features. Based on Gaussian mixture model-universal background model,the mainstream static feature parameters which are MFCC,LPCC,LPC and MFCC combining with dynamic parameters are compared in terms of speaker identification and speaker verification according to text-independent voiceprint recognition systems realized by Matlab. In the case of taking different feature parameter order,different Gaussian mixture degree and using different time length of training and testing speech,we analyze and research in several aspects such as theoretical recognition effect,actual recognition effect,time consumption for recognizing and its ratio of components etc. The final results show that in the pattern recognition method GMM-UBM,MFCC do have best recognition effect and largest time consumption for system recognizing in the most of time. There is no monotonically ascending relation between the recognition rate and the order of speech feature parameters. MFCC combining with better order of dynamic parameters can improve recognition effect. Increasing the order of dynamic parameters is not definitely associate with improving recognition effect.
关 键 词:GMM-UBM 声纹识别 特征参数性能 说话人确认 说话人辨认
分 类 号:TP301[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:52.14.189.148