检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王泳 苏卓艺 朱铮宇 WANG Yong;SU Zhuoyi;ZHU Zhengyu(School of Cyberspace Security,Guangdong Polytechnic Normal University,Guangzhou 510665,China;Audio,Speech and Vision Processing Laboratory,South China University of Technology,Guangzhou 510641,China)
机构地区:[1]广东技术师范大学网络空间安全学院,广东广州510665 [2]华南理工大学音频、语音与视觉处理实验室,广东广州510641
出 处:《西安电子科技大学学报》2021年第4期168-175,共8页Journal of Xidian University
基 金:国家自然科学基金(61672173);广东省普通高校青年创新人才类项目(2018KQNCX140)。
摘 要:语音变换欺骗是指利用语音处理算法改变原说话人的语音特征,从而导致说话人识别系统产生极高的错误拒绝率,达到隐藏说话人身份的目的。其实现成本低廉,并且已集成在众多的音频处理工具中,对社会安全带来严重威胁。然而,目前对于变换欺骗的检测研究仍然不足。为此,提出了一种基于密集卷积神经网络的语音变换欺骗检测方法,以区分欺骗语音和真实语音。该网络总共包含135层的网络层,通过最大化短路径地连接强化数据传输,可同时利用深层和浅层的边缘特征进行分类,抑制退化现象,从而进一步提高检测的准确率。实验结果表明,该算法对不同欺骗因子下的欺骗语音的检测准确率超过了98%。Voice transformation(VT)spoofing refers to the operations for hiding the speaker’s identity which change a speaker’s acoustic features by speech processing algorithms and result in extremely high false reject rates for automatic speaker recognition(ASR)systems.VT spoofing is implemented with a low cost and has been integrated in many audio editing tools,thus presenting serious threats to social security.However,the research on VT spoofing detection is still insufficient.Hence,in this paper we propose a dense convolutional neural network(DenseNet)based VT detection method for distinguishing spoofed voices and genuine ones.The proposed network consists of 135 layers in total.By maximizing the skip-layers,the data transmission can be enhanced,and both the deep and shallow edge features can be used for classification,so as to alleviate the degradation phenomenon and further to improve detection accuracy.Experimental results show that the detection accuracy with various spoofing factors is over 98%.
分 类 号:TP39[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.40