检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张雅欣 张连海[2] ZHANG Yaxin;ZHANG Lianhai(Zhongyuan Network Security Research Institute, Zhengzhou University, Zhengzhou 450001, China;Information Engineering University, Zhengzhou 450001, China)
机构地区:[1]郑州大学中原网络安全研究院,河南郑州450001 [2]信息工程大学,河南郑州450001
出 处:《信息工程大学学报》2020年第6期664-669,共6页Journal of Information Engineering University
基 金:国家自然科学基金资助项目(61673395)。
摘 要:基于SVTTS架构的语音克隆系统采用d-vector描述说话人编码特征,由于该特征提取过程中没有考虑到整段句子的语音信息,从而影响了克隆语音的相似度。针对此问题,提出一种基于x-vector说话人特征的语音克隆方法。该方法采用x-vector作为表征目标说话人的嵌入向量,拼接到合成器中,并通过声码器克隆出目标说话人的语音。实验结果表明采用x-vector的方法提取嵌入向量的相似度更高;与传统方法相比,该方法克隆语音的自然度和相似性分别提升了0.32和0.14。The voice cloning system based on the speaker verification to multi-speaker text-to-speech(SVTTS)architecture adopts the speaker encoding feature described by d-vector.The speech information of the entire sentence is not considered in the feature extraction process,which affects the similarity of the cloned voice.To address this problem,this paper proposes a method of voice cloning based on x-vector speaker characteristics.This method uses x-vector as the embedding vector characterizing the target speaker,splices it into the synthesizer,and clones the target speaker’s voice through the vocoder.The experimental results show that the x-vector method is used to extract the embedding vector with higher similarity.Compared with the traditional method,the naturalness and similarity of the cloned voice of the proposed method are improved by 0.32 and 0.14,respectively.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.188.92.213