基于DNN与基音周期的说话人识别被引量：5

Speaker Recognition Based on DNN and Pitch Period

作　　者：张学祥雷菊阳[1] ZHANG Xue-xiang;LEI Ju-yang(School of Mechanical and Automobile Engineering,Shanghai University of Engineering Science,Shanghai 201620,China)

机构地区：[1]上海工程技术大学机械与汽车工程学院

出　　处：《计算机与现代化》2020年第1期122-126,共5页Computer and Modernization

摘　　要：传统说话人识别框架大多建立在高斯混合模型(GMM)上的,然而这种浅层学习模型不能有效地表征数据特征之间的高阶相关性,识别效果较差。本文提出一种基于深度神经网络(Deep Neural Network,DNN)与基音周期(Pitch Period,PP)相结合的说话人识别方法,模型主线识别以对数梅尔滤波器组特征参数作为DNN的输入,通过训练DNN模型提取说话人的声纹特征;针对DNN模型阈值设定人的主观性影响,利用动态时间规整技术匹配说话人基音周期进行辅助识别。实验结果表明,这种双重识别方法等错误率可以达到1. 6%,较DNN系统与EM-GMM系统等错误率分别降低了1. 2%和2. 4%,并且在噪声环境中仍具有较好的鲁棒性。Traditional speaker recognition frameworks are mostly based on the Gauss mixture model( GMM),but this shallow learning model can not effectively represent the high-order correlation between data features,thus the recognition effect is poor. In this paper,a speaker recognition method based on Deep Neural Network( DNN) and Pitch Period( PP) is proposed. The logarithmic Meier filter bank feature parameters are used as the input of DNN for mainline identification,and the voiceprint characteristics of the speaker are extracted through training DNN model. To eliminate the subjective influence of threshold setting in DNN model,dynamic time warping technology is used to match pitch period of the speaker for assistant recognition. The experimental results show that equal error rate( EER) of this dual recognition method reaches 1. 6%,which decreases respectively by 1. 2% and 2. 4% compared with DNN system and EM-GMM system,and this method still has good robustness in noise environment.

关键词：深度神经网络基音周期说话人识别动态时间规整双重识别

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于DNN与基音周期的说话人识别被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于DNN与基音周期的说话人识别 被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于DNN与基音周期的说话人识别被引量：5