supported in part by National Natural Science Foundation of China (Grant Nos. 11274092, 61271335);Fundamental Research Funds for the Central Universities (Grant Nos. 2011B11114, 2011B11314, 2012B07314, 2012B04014);National Natural Science Foundation for Young Scholars of China (Grant Nos. 61101158, 61201301, 31101643);Jiangsu Province Natural Science Foundation for Young Scholars of China (Grant No. BK20130238);Open Research Fund of Key Lab of Broadband Wireless Communication and Sensor Network Technology (Nanjing University of Posts and Telecommunications), Ministry of Education (Grant No. NYKL201305)
In the literature of voice conversion (VC), the method based on statistical Gaussian mixture model (GMM) serves as a benchmark. However, one of the inherent drawbacks of GMM is well-known as discontinuity problem,...