汉语连续语音识别的语速自适应算法  被引量:7

The speaking rate adaptation algorithm in Putonghua continuous speech recognition

在线阅读下载全文

作  者:王作英[1] 李健[1] 

机构地区:[1]清华大学电子工程系,北京100084

出  处:《声学学报》2003年第3期229-234,共6页Acta Acustica

摘  要:在连续语音中,不同的说话者在不同语境下说话的速度差异是很大的。偏离正常语速往往会造成识别错误,使识别性能下降。考虑到语速对于语音单元段长的影响是同步增长或同步下降的,相邻语音单元的段长之间存在很强的相关性,本文从利用段长的相关信息出发,在基于段长分布的隐含马尔可夫模型(DDBHMM:Duration Distribution Based HMM)的框架上,提出了一种语速自适应算法。对数字串和大词汇量连续语音识别的试验表明这个算法是有效的。In continuous speech, the difference of speaking rates is big among speakers in different speaking environment. The variation of the speaking rates can cause recognition errors and affect the performance of LVCSR(Large Vocabulary Continuous Speech Recognition) systems. It is noted that the duration of neighboring speech units, which is affected by speaking rates, increases or decreases synchronously and a strong correlation exits between them. Based on the framework of DDBHMM (Duration Distribution Based HMM), a speaking rate adaptation algorithm is proposed. For utilizing the correlation information between duration of neighboring speech units. The experiments on connected digit and large vocabulary continuous speech show that the new algorithm is effective.

关 键 词:汉语 连续语音识别 语速 自适应算法 隐含马尔可夫模型 语音信号处理 

分 类 号:TN912.34[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象