基于优化检测网络和MLP特征改进发音错误检测的方法  被引量:2

Mispronunciation detection with an optimized detection network and multi-layer perception based features

在线阅读下载全文

作  者:袁桦[1] 钱彦旻[1] 赵军红[2] 刘加[1] 

机构地区:[1]清华大学电子工程系,清华信息科学与技术国家实验室,北京100084 [2]中国科学院电子学研究所,传感技术国家重点实验室,北京100190

出  处:《清华大学学报(自然科学版)》2012年第4期557-560,570,共5页Journal of Tsinghua University(Science and Technology)

基  金:国家自然科学基金资助项目(60931160443,90920302,N-CUHK414/09);国家科技支撑计划项目(2009BAH41B01)

摘  要:该文基于优化的检测网络和多层感知(multi-layerperception,MLP)特征,提出一种可以更加准确地检测出错误发音类型的方法。首先,从第二语言学习的语音库中提取出基本的发音规则以及组合的发音规则,并相应地计算它们发生的先验概率,再将这些具有先验概率的规则用于构建基于多发音的扩展检测网络。然后在检测过程中,引入基于发音特征的MLP特征来描述发音概率,替代了传统的语音声学特征。最后使用基于MLP特征的GMM-HMM框架从检测网络中识别出最可能的发音音素串。实验表明:该方法将音素识别正确率提高了3.11%,错误类型准确率提高了7.42%。This paper describes an optimized detection network for multi-layer pereeptron (MLP) features to more accurately capture mispronunciations. First, the basic and combined phonological rules are extracted from the L2 speech corpus with computation of their prior probability of occurrence. The prior probability rules are then used to build a multiple pronunciation based extended detection network. Then, articulatory based MLP features are introduced to describe the pronunciation probability instead of the conventional speech acoustic features during detection. Finally, the GMM-HMM framework with MLP features is used to pick the most probable pronunciation phoneme sequences from the detection network. Tests show that this approach improves phoneme recognition accuracy by 3.11% and the mispronunciation type accuracy by 7.42%.

关 键 词:发音错误检测 发音规则 多层感知(MLP) 发音特征 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象