Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception  

Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception

在线阅读下载全文

作  者:贾珈 梁伟俭 吴育昊 张秀龙 王昊 蔡莲红 蒙美玲 

机构地区:[1]Department of Computer Science and Technology, Tsinghua University [2]Tsinghua National Laboratory for Information Science and Technology, Tsinghua University [3]Key Laboratory of Pervasive Computing,Ministry of Education [4]Human-Computer Communications Laboratory, Department of Systems Engineering and Engineering Management The Chinese University of Hong Kong [5]Tsinghua-CUHK Joint Research Center for Media Sciences,Technologies and Systems

出  处:《Journal of Computer Science & Technology》2014年第5期751-761,共11页计算机科学技术学报(英文版)

基  金:supported by the National Basic Research 973 Program of China under Grant No.2013CB329304;the National Natural Science Foundation of China under Grant No.61370023;the Major Project of the National Social Science Foundation of China under Grant No.13&ZD189;partially supported by the General Research Fund of the Hong Kong SAR Government under Project No.415511;the CUHK Teaching Development Grant

摘  要:Computer-aided pronunciation training(CAPT) technologies enable the use of automatic speech recognition to detect mispronunciations in second language(L2) learners' speech. In order to further facilitate learning, we aim to develop a principle-based method for generating a gradation of the severity of mispronunciations. This paper presents an approach towards gradation that is motivated by auditory perception. We have developed a computational method for generating a perceptual distance(PD) between two spoken phonemes. This is used to compute the auditory confusion of native language(L1). PD is found to correlate well with the mispronunciations detected in CAPT system for Chinese learners of English,i.e., L1 being Chinese(Mandarin and Cantonese) and L2 being US English. The results show that auditory confusion is indicative of pronunciation confusions in L2 learning. PD can also be used to help us grade the severity of errors(i.e.,mispronunciations that confuse more distant phonemes are more severe) and accordingly prioritize the order of corrective feedback generated for the learners.Computer-aided pronunciation training(CAPT) technologies enable the use of automatic speech recognition to detect mispronunciations in second language(L2) learners' speech. In order to further facilitate learning, we aim to develop a principle-based method for generating a gradation of the severity of mispronunciations. This paper presents an approach towards gradation that is motivated by auditory perception. We have developed a computational method for generating a perceptual distance(PD) between two spoken phonemes. This is used to compute the auditory confusion of native language(L1). PD is found to correlate well with the mispronunciations detected in CAPT system for Chinese learners of English,i.e., L1 being Chinese(Mandarin and Cantonese) and L2 being US English. The results show that auditory confusion is indicative of pronunciation confusions in L2 learning. PD can also be used to help us grade the severity of errors(i.e.,mispronunciations that confuse more distant phonemes are more severe) and accordingly prioritize the order of corrective feedback generated for the learners.

关 键 词:second language learning computer-aided pronunciation training mispronunciation computational speech perception 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象