调型作预处理器的普通话双音词识别方案  

A Pitch-Contour-Based Preprocessor for Recognition of Mandarin Bisyllabic Words

在线阅读下载全文

作  者:许利群[1] 陈永彬[2] 

机构地区:[1]南京工学院无线电系 [2]东南大学无线电系

出  处:《通信学报》1989年第3期56-60,51,共6页Journal on Communications

摘  要:基于超音段信息在语音感知中的显著作用。本文提出了一种新颖的汉语双音节词(二字词)识别方案。首先将输入语音调型进行时、频归一化处理,并将其和参考调型匹配;再对由此得到的候选集进行精确的谱匹配。在这步处理中结合了动态能量信息,并采用了修正的动态规划算法。实验结果表明,这种方案对于高混淆性汉语二字词识别十分有效。In this paper a novel approach is presented for the recognition of Mandarin bisyllabic words, which is based on the essential role the suprasegmentals play in speech perception. The pitch of the input utterance is first extracted, after undergoing time- and frequency- normalized procedure, it is then compared with a set of reference pitch contours, thus a search for the optimal candidates in the following fine matching is guided, in which a modified One-Stage Dynamic Programing algorithm is adopted by using a compound distortion metric of spectral and dynamic energy shape. The results show that the approach in question is particular suitable for recognizing highly confused Mandarin bisyllabic words.

关 键 词:调型 预处理器 汉语 双音词识别 

分 类 号:TN912.34[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象