基于说话人自适应训练的汉藏双语语音合成被引量：5

Realizing Mandarin-Tibetan bilingual speech synthesis by speaker adaptive training

出　　处：《清华大学学报（自然科学版）》2013年第6期776-780,共5页Journal of Tsinghua University(Science and Technology)

基　　金：国家自然科学基金项目(61263036;61262055);甘肃省杰出青年基金项目(1210RJDA007)

摘　　要：根据藏语和汉语在发音上的相似性,提出了一种基于隐Markov模型(hidden Markov model,HMM)的汉藏双语语音合成方法。以声韵母为合成基元,采用多个普通话说话人和1个藏语说话人的语料库,利用说话人自适应训练,获得一个汉藏双语混合语言的平均音模型。通过说话人自适应变换,从混合语言的平均音模型获得普通话或藏语的说话人相关模型,从而合成出普通话或藏语语音。实验结果表明,在藏语训练语句较少的情况下,该方法合成的藏语语音明显优于仅采用说话人相关模型合成的藏语语音。This paper presents a method to realize hidden Markov model（HMM）-based Mandarin-Tibetan bilingual speech synthesis using the similarities between Mandarin and Tibetan pronunciation.The initial and the final are used as the synthesis units with training using a set of average mixed-lingual models from a large Mandarin multi-speaker-based corpus and a small Tibetan one-speaker-based corpus using speaker adaptive training（SAT）.Then,the speaker adaptation transformation is applied to the speaker dependent（SD） training data to obtain a set of speaker dependent Mandarin or Tibetan models from the average mixed-lingual models.The Mandarin speech or Tibetan speech is then synthesized from the speaker dependent Mandarin or Tibetan models.Tests show that this method outperforms the method using only Tibetan SD models when only a small number of Tibetan training utterances are available.

关键词：语音合成隐Markov模型(HMM) 说话人自适应训练多语种语音合成藏语语音合成汉藏双语语音合成

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于说话人自适应训练的汉藏双语语音合成被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于说话人自适应训练的汉藏双语语音合成 被引量：5

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于说话人自适应训练的汉藏双语语音合成被引量：5