基于声学相关特征与词典语法相关特征的汉语重音检测  被引量:8

Mandarin Stress Detection Using Acoustic,Lexical and Syntactic Features

在线阅读下载全文

作  者:倪崇嘉[1,2] 张爱英[1] 刘文举[2] 

机构地区:[1]山东财政学院统计与数理学院,济南250014 [2]中国科学院自动化研究所模式识别国家重点实验室,北京100190

出  处:《计算机学报》2011年第9期1638-1649,共12页Chinese Journal of Computers

基  金:国家自然科学基金(90820303;60675026;90820011);国家"八六三"高技术研究发展计划项目基金(20060101Z4073;2006AA01Z194);国家"九七三"重点基础研究发展规划项目基金(2004CB318105)资助~~

摘  要:重音对提高语音合成系统的自然度、可懂度以及语音识别系统的正确率等方面扮演着非常重要的作用.该文基于大规模韵律标注的语料库,利用声学相关特征及词典语法相关特征对汉语重音进行检测.采用Boosting集成分类回归树对当前音节的声学相关特征以及词典语法相关特征进行建模,Boosting集成分类回归树充分利用了当前音节的特性.同时还对词典语法相关特征采用条件随机场方法建模,条件随机场很好地利用了当前音节的上下文特性.最后,将Boosting集成分类回归树模型和条件随机场模型加权组合获得识别率更高的混合模型.该混合模型克服了Boosting集成分类回归树模型的不足,实现了Boosting集成分类回归树和条件随机场的优势互补.实验结果表明该方法具有较好的分类效果,在ASCCD语料库上能够获得84.82%重音检测正确率.同时,与之前其他人的工作在相同的条件下(相同的训练集和测试集)对比,在正确率方面,该方法分别有4.01%和1.67%的提高.另外,该文中,对英语的重音检测和汉语的重音检测做了对比,并通过特征分析方法从另一个层面验证了一些语言学上的结论.The stress is important to improve the naturalness, understandability and intelligibili ty of speech synthesis system and the correct rate of automatic speech recognition system. In this paper, we conduct stress detection by using the acoustic, lexical and syntactic features based on large scale prosodic annotation corpus. Boosting classification and regression tree is utilized to model the acoustic, lexical and syntactic features, which adequately utilizes the property of the current syllable. Conditional random fields (CRFs) are utilized to model the lexical and syntactic features, which adequately utilize the contextual property of the current syllable. The combina- tion of boosting classification and regression tree and conditional random fields achieves better classification effect when compared with boosting classification and regression tree model or conditional random fields. The combined model overcomes the efficiency of boosting classification and regression tree model, and realizes the complementarities with the advantages of boosting classification and regression tree and conditional random fields. The experimental results indicate that the proposed method acquires better classification effect, and achieves 84.82% stress detection accuracy rate on ASCCD. Compared with the previous counterpart work in the same conditions (the same training set and testing set), there are 4.01%and 1.67% improvements respectively in terms of the correct rate. In this paper, we also compare the differences and the similarities between Mandarin stress detection and English pitch accent detection. Based on the feature analysis on the large scale prosodic annotation corporus, we also verify some linguistie conclusions in a different way

关 键 词:重音 Boosting集成分类回归树 条件随机场 神经网络 分类回归树 

分 类 号:TP319[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象