基于概率频度的普通话韵律结构预测统计模型被引量：3

Statistical model based on probability frequency for Mandarin prosodic structure prediction

出　　处：《清华大学学报（自然科学版）》2006年第1期78-81,共4页Journal of Tsinghua University(Science and Technology)

基　　金：国家"八六三"高技术项目(2004AA117010);国家自然科学基金资助项目(60275014)

摘　　要：为进一步提高文语转换系统中韵律结构预测的准确度,提出了一个基于概率频度的统计模型的方法,预测韵律词和韵律短语边界两级韵律结构。该方法提取与韵律词和韵律短语边界有关的语言学特征(词性、语法词、长度和位置等),并进行样本训练计算各个特征的概率频度值,最终分别建立韵律词和韵律短语的统计模型。实验结果表明:统计模型的方法对于韵律词和韵律短语边界预测的正确率分别可达90.6%和84.6%,并与决策树算法和T ransform ation-based learn ing(TBL)转换规则学习算法比较,提高10%以上的正确率。The accuracy nf prosody structure prediction in text-to-speech （TTS） conversion systems is improved by a statistical model based on the probability frequency to detect the two-tier prosodic hierarchy, including prosodic words and prosodic phrases. The system fast extracts linguistic features related to the prosodic structure such as part of speech, lexical words, length, and position information, Then, the probability frequency for each selected feature is calculated with statistical models designed for the prosodic words and phrases. Tests show that the correct identification rates of prosodic words and phrases are improved to 90.6% and 84.6% using the statistical model. The statistical model gives 10% better performance than the decision tree Transformation based learning （TBL） algorithms.

关键词：文字信息处理韵律词韵律短语概率频度统计模型

分类号：TP391.1[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于概率频度的普通话韵律结构预测统计模型被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于概率频度的普通话韵律结构预测统计模型 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于概率频度的普通话韵律结构预测统计模型被引量：3