基于REMOS的远距离语音识别模型补偿方法被引量：3

REMOS-based method for model domain compensation in remote speech recognition

出　　处：《重庆邮电大学学报（自然科学版）》2014年第1期117-123,130,共8页Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)

基　　金：重庆市自然科学基金(CSTC 2007BB2445);重庆市教委科学技术研究项目(KJ110522);重庆邮电大学科研基金(A2009-26)~~

摘　　要：封闭环境中远距离语音识别会受到混响效果的影响,从而降低语音识别率。混响建模(reverberation modeling for speech recognition,REMOS)是一种在模型域进行混响补偿的新方法,该方法在已知声源位置的情况下能有效提升远距离语音识别精度。但在实际应用中,往往难以预测声源的位置。利用最大后验概率的原理,基于对房间不同区域进行有区别补偿的思想,在按帧的隐马尔可夫模型(hidden Markov model,HMM)补偿的基础上,提出一种在封闭环境中新的模型补偿方法。该方法利用K均值聚类K-means算法对房间冲击响应(room impulse response,RIR)的优化集进行聚类,对所属相同类的混响模型进行合并处理,再把合并后的混响模型载入维特比算法中,对清晰语音的HMM模型进行按帧补偿。最后采用后验概率方法选择最佳补偿,使得模型域的混响补偿能最接近精确补偿。实验证明,该方法能进一步提升远距离语音识别的精度。The distant-talking speech recognition would be affected by reverb in a enclosed environment. As a result, the recognition rate would be greatly reduced. Reverberation modeling for speech recognition（REMOS） is a new method for re- verberate compensation in the model domain; it can improve distant-talking speech recognition accuracy effectively if the sound source location is already known. But in a real application, location of sound source can be hardly to predicted. Based on the principle of maximum a posteriori probability and frame-wise hidden Markov model（HMM） model compensa- tion, a new method for model compensation in a enclosed environment is proposed in this paper. In this method, K-means clustering algorithm is used to cluster room impulse response （RIR） optimized sets, and merge the reverberation model which is in a same kind class, then Viterbi decoding algorithm is loaded, and frame-wise compensation is implemented to the clear speech HMM model. At last, the best compensate model is selected through the maximum a posteriori estimation. It makes model domain reverberate compensation to be closest to the accurate compensation. The experimental results prove that the method can enhance distant-talking speech recognition accuracy further.

关键词：混响混响建模(REMOS) K—means 房间冲击响应模型补偿

分类号：TP391.4[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于REMOS的远距离语音识别模型补偿方法被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于REMOS的远距离语音识别模型补偿方法 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于REMOS的远距离语音识别模型补偿方法被引量：3