基于K-L散度模型聚类的快速说话人辨识方法被引量：5

K-L Divergence Based Model Clustering Method for Fast Speaker Identification

机构地区：[1]哈尔滨工业大学计算机科学与技术学院,哈尔滨150001 [2]青岛科技大学信息科学与技术学院,青岛266035

出　　处：《模式识别与人工智能》2010年第6期856-861,共6页Pattern Recognition and Artificial Intelligence

基　　金：国家973计划项目(No.2007CB311100);国家863计划重点项目(No.2006AA010103)资助

摘　　要：在网络应用环境下,需要处理的音频数据和注册说话人急剧增加,传统说话人辨识方法难以满足实时性要求.文中提出采用K-L散度的说话人模型聚类方法,从而构造一个分级辨识模型,提高辨识效率.研究利用类辨识信息估计置信度的方法,可尽早有效排除集外说话人.实验结果显示,文中方法可使辨识速度平均提高3.2倍,而闭集辨识错误率平均只有0.9%的增加.采用类辨识置信度进一步提高开集辨识速度,并且在保持集内错误率不变的情况下,使集外错误率相对下降5.1%.With the increase of enrolled speakers and audio data to be recognized,the conventional speaker identification methods can not meet the real-time demand for internet application environment.A K-L divergence based speaker model clustering method is proposed to construct a hierarchical identification system,which remarkably improves the recognition efficiency.Moreover,the confidence measure using class-level identification information is also investigated to effectively exclude out-of-set speaker as early as possible.The experimental results show the proposed method averagely increases the identification speed by 3.2 times while the error rate of closed-set identification only increases about 0.9% compared with the conventional method.The open-set identification can be speeded up by using class-level confidence measure and a relatively 5.1% error rate reduction can be achieved on out-of-set speakers identification while keeping the identification performance of in-set speakers unchanged.

关键词：K-L散度模型聚类置信度说话人辨识网络环境

分类号：TN912.34[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于K-L散度模型聚类的快速说话人辨识方法被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于K-L散度模型聚类的快速说话人辨识方法 被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于K-L散度模型聚类的快速说话人辨识方法被引量：5