维吾尔语韵律建模  

Prosody modeling for Uyghur TTS

在线阅读下载全文

作  者:古力米热.依玛木 姑丽加玛丽.麦麦提艾力 玛依努尔.阿吾力提甫 艾斯卡尔.艾木都拉[4] Gulmire Imam;Guljamal Mamateli;Maynur Ablitip;Askar Hamdulla(School of Literature, Xinjiang Normal University Urumqi 830054, China;School of Mathematical Sciences, Xinjiang Normal University, Urumqi 830054, China;Xinjiang Normal University Library, Urumqi 830054, China;Institute of Information Science and Engineering, Xinjiang University, Urumqi 830046, China)

机构地区:[1]新疆师范大学文学院,乌鲁木齐830054 [2]新疆师范大学数学科学学院,乌鲁木齐830054 [3]新疆师范大学图书馆,乌鲁木齐830054 [4]新疆大学信息科学与工程学院,乌鲁木齐830046

出  处:《清华大学学报(自然科学版)》2017年第12期1259-1264,共6页Journal of Tsinghua University(Science and Technology)

基  金:教育部社科基金资助项目(10YJA740027);教育部新世纪优秀人才支持计划资助项目(NCET-10-0969);国家自然科学基金地区项目(61462087;61065005)

摘  要:对维吾尔语的韵律结构进行了全面的研究,从维吾尔语语音合成(text to speech,TTS)语音库中提取了音节的时长、能量、基频均值、最大值、最小值和基频范围等韵律特征参数,分析了其在音节处于不同韵律层次时的变化规律。提取了语音数据中韵律边界前后的音节延长量、音高重置和无声段等声学特征参数,并对它们的分布规律进行了统计分析。实验结果表明:不同韵律层级之间时长延长量和音高差值随着边界层级的提高而增加;韵律词边界之间没有显著地停顿,韵律短语和语调短语层级边界之间的平均停顿时长分别是154.2和212.8ms。The prosodic features of syllables such as duration, energy, mean pitch, maximum pitch, minimum pitch and pitch range were extracted from a Uyghur text to speech (TTS) database with analyses of their variations for different prosodic hierarchies. The pitch reset, pre-boundary lengthening, and silence duration of different prosodic boundaries were also analyzed. The results of acoustic experiments show that the pitch reset and pre-boundary lengthening are much greater as the prosodic boundary degree increases. No obvious pause can be perceived at the prosodic word (PW) boundary and the average silence duration at the prosodic phrase (PP) and intonation phrase (INP) boundaries are 154.2 and 212.8 ms.

关 键 词:维吾尔语 语音合成 韵律结构 声学特征分析 

分 类 号:TN912.33[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象