Efficient fundamental frequency transformation for voice conversion  

用于语音转换的有效基音频率转换算法(英文)

在线阅读下载全文

作  者:宋鹏[1] 金赟[1,2] 包永强[3] 赵力[1] 邹采荣[1] 

机构地区:[1]东南大学水声信号处理教育部重点实验室,南京210096 [2]徐州师范大学物理与电子工程学院,徐州221l16 [3]南京工程学院通信工程学院,南京211167

出  处:《Journal of Southeast University(English Edition)》2012年第2期140-144,共5页东南大学学报(英文版)

基  金:The National Natural Science Foundation of China(No. 60975017);the Natural Science Foundation of Guangdong Province (No. 10252800001000001);the Natural Science Foundation of Higher Education Institutions of Jiangsu Province (No. 10KJB510005)

摘  要:In order to improve the performance of voice conversion, the fundamental frequency (F0) transformation methods are investigated, and an efficient F0 transformation algorithm is proposed. First, unlike the traditional linear transformation methods, the relationships between F0s and spectral parameters are explored. In each component of the Gaussian mixture model (GMM), the F0s are predicted from the converted spectral parameters using the support vector regression (SVR) method. Then, in order to reduce the over- smoothing caused by the statistical average of the GMM, a mixed transformation method combining SVR with the traditional mean-variance linear (MVL) conversion is presented. Meanwhile, the adaptive median filter, prevalent in image processing, is adopted to solve the discontinuity problem caused by the frame-wise transformation. Objective and subjective experiments are carried out to evaluate the performance of the proposed method. The results demonstrate that the proposed method outperforms the traditional F0 transformation methods in terms of the similarity and the quality.为了改善语音转换的性能,对基音频率转换方法进行了研究,并提出了一种有效的转换算法. 首先,不同于传统的线性变换方法,对基音频率和频谱特征的内在关系进行了分析,在 GMM 中的每一分量,基音频率通过 SVR 方法从转换后的频谱特征预测得到. 然后,为了缓解 GMM 统计平均带来的过平滑问题,将传统的均值-方差转换方法和 SVR 方法相结合. 同时,引入广泛应用于图像处理的自适应中值滤波来解决由基于帧转换引起的不连续问题. 通过主客观评价方法对转换后的语音质量进行了测试,结果表明: 该方法无论在语音的相似度还是转换语音的质量上,都取得了比传统方法更好的效果.

关 键 词:F0 prediction support vector regression meanvariance linear conversion adaptive median filter 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象