自然言语的韵律组织中的不确定性及其在语音合成中的应用  被引量:2

The Uncertainty in Prosody of Natural Speech and Its Application in Speech Synthesis

在线阅读下载全文

作  者:初敏[1] 

机构地区:[1]微软亚洲研究院,北京100080

出  处:《中文信息学报》2004年第4期66-71,共6页Journal of Chinese Information Processing

摘  要:本文对自然言语的韵律组织中的不确定性及其对合成语音自然度的影响进行了初步探讨 ,并在此基础上 ,提出在韵律预测中用最小错误概率准则代替传统的最大生成概率准则 ,从而在预测结果中保留多种等价的韵律实现。本文还进一步提出一种将基于最小错误准则的韵律预测与单元选择结合的算法 ,首先根据最小错误准则在所有候选单元中筛选出最不可能造成韵律错误的样本 。This paper explores the uncertainty of prosody in a speech corpus, which contains two read versions of 1000 sentences by a professional voice talent under the same linguistic and affective planning. It is found that corresponding prosodic features in the two versions change in a rather wide range. The scope of local variations can be as large as 45-50% of the overall variation range of a speaker. Based on such observation, this paper proposes a minimum error rate criterion (MERC) to replace the traditional maximum correct rate criterion in prosody generation. Furthermore, this paper proposes an approach to integrate the MERC into the unit selection algorithm. Among all instances of a speech unit, those that have the lowest possibility to result unnatural prosody are picked out first, and then the most suitable path is selected from all prosodic equivalent candidates under the smoothest criterion to assure the smoothest concatenation of all units on this path.

关 键 词:计算机应用 中文信息处理 言语 韵律的不确定性 单元选择 最小错误准则 

分 类 号:TP391.42[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象