检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:应武[1]
出 处:《电子测量与仪器学报》2007年第3期48-51,共4页Journal of Electronic Measurement and Instrumentation
摘 要:说话人识别从本质上看是从语音信息中提取说话人特征,并通过一定的方式进行模式识别的过程。辨别说话人的方法很多,本文认为先从语音中提出元音,再通过计算元音的MFCC(美尔频标倒谱系数)特征参数,并与DTW(动态时间规整)结合进行多人多单词试验,实验证明这种识别方式能提高识别率5%左右——从原字平均识别率为83%提高到取元音后平均识别率为88%。In essence, speaker recognition is a process of extracting speaker's features from speech information and carrying out pattern recognition through certain method. There are many ways to identify the speaker. This paper proposes that we first extract vowels from the speech sounds, calculate MFCC (Mel Frequency Cepstral Coefficient) parameters of the vowels, which are combined with DTW (Dynamic Time Warping), and then carry out multi person, multi word trial. Experiment result shows that the proposed recognition method can improve the recognition rate about 5%, that is, after extracting vowels the average word recognition rate increases from 83% to 88%.
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.8