检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:LIN Wei YANG Lili XU Boling
出 处:《Progress in Natural Science:Materials International》2006年第10期1072-1078,共7页自然科学进展·国际材料(英文版)
基 金:Supported by National Natural Science Foundation of China (Grant Nos. 60272037 and 60340420325)
摘 要:In this paper, the frequency characteristics of Chinese whispered speech were investigated by a filter bank analysis. It was shown that the first and the third formants were more important than the other formants in the speaker identification of Chinese whispered speech. The experiment showed that the 800-1200 Hz and 2800-3200 Hz ranges were the most significant frequency ranges in discriminating the speaker. Based on this result, a new feature scale named whisper sensitive scale (WSS) was proposed to replace the common scale, Mel scale, and to extract the cepstral coefficient from whispered speech signal. Furthermore, a speaker identification system in whispered speech was presented based on the modified Hidden Markov Models integrating advantages of WSCC (the whisper sensitive cepstral coefficient) and LPCC. And the new system performed better in solving the problem of speaker identification of Chinese whispered speech than the traditional method.在这篇论文,中国低声说的讲话的频率特征被过滤器银行分析调查。这被显示出第一并且第三共振峰比在汉语的说话者鉴定的另外的共振峰低语是更重要的讲话。实验证明 800-1200 Hz 和 2800-3200 Hz 范围是在区别说话者的最重要的频率范围。基于这结果,新特征规模把耳语称为敏感规模(WSS ) 被建议代替普通规模,并且从低声说的讲话信号提取 cepstral 系数, Mel 规模。而且,在低声说的讲话的一个扬声器鉴定系统基于集成 WSCC 的优点的修改隐藏的 Markov 模型被介绍(耳语敏感 cepstral 系数) 并且 LPCC。并且新系统在比传统的方法解决中国低声说的讲话的说话者鉴定的问题更好表现了。
关 键 词:speaker identification Chinese whispered speech whisper sensitive scale.
分 类 号:TN912[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222