supported in part by the National Natural Science Foundation of China (61501249, 61071167, 41601601);the Key Research and Development Program of Jiangsu Province (BE2016775);the Natural Science Foundation of Jiangsu Province for Youth (BK20150855);Research Project of Science and Technology Department of Jiangsu Province (BY2015011-1);the Natural Science Foundation for Jiangsu Higher Education Institutions (15KJB510022);the Nanjing University of Posts and Telecommunications Science Foundation (NY214143)
Voice conversion (VC) based on Gaussian mixture model (GMM) is the most classic and common method which converts the source spectrum to target spectrum. However this method is prone to over-fitting because of its ...