普通话发音检错系统中TANDEM方法的研究与实现  

RESEARCH AND REALISATION OF TANDEM IN MANDARIN SPEECH ERROR DETECTION SYSTEM

在线阅读下载全文

作  者:龚澍[1] 魏思[1] 胡国平[1] 刘庆峰[1] 

机构地区:[1]中国科学技术大学电子工程与信息科学系,安徽合肥230027

出  处:《计算机应用与软件》2011年第7期233-236,共4页Computer Applications and Software

摘  要:语音识别中常用的HMM/GMM框架由于训练准则和算法的限制,对模式的辨识能力较差;另一种HMM/ANN框架虽具有极强的模式分类能力,但缺乏成熟有效的优化手段。将一种综合两者优点的TANDEM方法应用到普通话发音检错系统中,通过使用区分性训练的神经网络去估计音素级后验概率,经过一系列后续处理将原始MFCC特征转化为TANDEM特征,作为基于HMM统计模型的发音检错系统的输入,进而完成评测过程。实验结果证明,TANDEM方法使系统的检错性能有了较大的提升,结合MLLR等自适应方法的使用效果会更为明显。The HMM/GMM framework which is often used in voice recognition is comparatively poor at mode recognition due to its limitation on training rule and algorithm.Another framework,the HMM/ANN,is better at mode classification but lacks mature and effective optimisation approaches.In the thesis,the authors apply TANDEM which holds the advantages of both frameworks to the mandarin speech error detection system.At the beginning a discriminatingly training neural network estimates rhythm level pro-check probability,then with a series of processes converts the original MFCC feature to TANDEM feature as the input to the HMM statistical model based speech error detection system,consequently completes the evaluation process.Experiment shows the TANDEM approach can greatly improve the error detection performance of the system.It works even better when cooperates with an adaptive approach such as MLLR.

关 键 词:语音识别评测 发音检错 TANDEM 多层感知器 最大似然线性回归 

分 类 号:TN912.33[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象