维吾尔语情感语音韵律转换研究  被引量:2

Rearch on prosodic hierarchy conversion for Uyghur emotional speech

在线阅读下载全文

作  者:杜楠楠[1] 赵晖[1] DU Nannan;ZHAO Hui(College of Information Sciences and Engineering, Xinjiang University, Urumqi 830046, China)

机构地区:[1]新疆大学信息科学与工程学院,乌鲁木齐830046

出  处:《计算机工程与应用》2016年第19期154-160,共7页Computer Engineering and Applications

基  金:国家自然科学基金(N0.61261037)

摘  要:面向维吾尔语情感语音转换,提出一种韵律建模转换方法。该方法结合了维吾尔语韵律特点及语言特点,首次利用离散余弦变换(DCT)分别参数化维吾尔语音节和韵律短语的情感基频。采用高斯混合模型(GMM)训练中性-情感基频联合特征,同时合成中性语速情感语音和情感语速情感语音,主观评测结果显示情感语速更有助于表达情感效果。主客观实验结果显示转换方法可有效进行维吾尔语情感韵律转换,三种情感下,音节和韵律短语的结果均达到75%以上,韵律短语的转换效果要稍优于音节。A prosody conversion method is proposed for transforming neutral speech to emotional speech of Uyghur. Themethod uses the Discrete Cosine Transform(DCT)to parameterize the emotion fundamental frequency of the Uyghur syllablesand prosodic phrases for the first time, which combining the Uyghur prosodic features and language features. Usingthe Gaussian Mixture Model(GMM)to train the joint characteristics of the neutral and emotional frequency, and then synthesizeemotional speech with neutral speed and emotional speed. The listening test results show that emotional speed ismore helpful to express the emotional speech. The objective evaluation and the listening test results show that method canactualize Uyghur emotional prosody conversion effectively, the conversion results of syllables and prosodic phrases ofthree emotions achieve accuracy of more than 75% in listening test, and the prosodic phrases is better than that of syllables.

关 键 词:基频 情感语音转换 离散余弦变换 高斯混合模型 音节 韵律短语 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象