基于内容的x⁃vector文本相关SV研究  被引量:1

Content⁃Dependent x⁃vector for Text⁃Dependent Speaker Verification

在线阅读下载全文

作  者:陈亚峰[1] 郭武[1] CHEN Yafeng;GUO Wu(University of Science and Technology of China,National Engineering Laboratory for Speech and Language Information Processing,Hefei,230027,China)

机构地区:[1]中国科学技术大学语音及语言信息处理国家工程实验室,合肥230027

出  处:《数据采集与处理》2020年第5期850-857,共8页Journal of Data Acquisition and Processing

摘  要:x‑vector系统将一段不定长的语音通过神经网络映射成固定维的矢量来表征说话人信息,该系统在文本无关的说话人确认(Speaker verification,SV)任务中取得了优异的性能。本文将其应用到文本相关的SV任务中,在x‑vector模型选择上,采用残差神经网络以获得更有区分性的x‑vector;在包含多字符的语句中,对每个字训练一个残差神经网络;在提取过程中,每一字单独提取一个x‑vector并单独进行说话人判决,最后将多个判决得分进行融合后给出最终的识别结果。实验是在数据库RSR2015 PartⅢ上进行的,提出的方法在男性和女性测试集上等错误率分别有15.34%、19.7%的下降。The x-vector system maps a variable-length speech to a fixed-dimensional speaker embeddings via neural networks,and performs well in text-independent speaker verification.Here,it is applied to the textdependent speaker verification and different x-vectors are extracted according to different contents in one sentence.In model selection,deep residual network(DRN)is used to obtain more discriminative x-vector.For a sentence with multiple words,word-dependent DRNs are trained to extract word-dependent x-vectors,which are separately fed to different backend classifiers.Finally,multiple scores are fused to obtain the final verification results.Experiments on PartⅢof the RSR2015 dataset show that the proposed method can achieve equal error rate(EER)reduction of 15.34%and 19.7%for male and female,respectively.

关 键 词:说话人确认 文本相关 深度神经网络 声纹特征 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象