噪声环境下畸变模型线性化处理的顽健语音识别方法  

Linearized distortion model for robust speech recognition in noisy environments

在线阅读下载全文

作  者:何勇军[1,2] 韩纪庆[1] 

机构地区:[1]哈尔滨工业大学计算机科学与技术学院,黑龙江哈尔滨150001 [2]哈尔滨理工大学计算机科学与技术学院,黑龙江哈尔滨150080

出  处:《通信学报》2010年第9期8-14,共7页Journal on Communications

基  金:国家高技术研究发展计划("863"计划)基金资助项目(2006AA010103);国家重点基础研究发展计划("973"计划)基金资助项目(2007CB311100)~~

摘  要:针对噪声环境下语音识别的顽健性问题,考虑到梅尔倒谱系数(MFCC,Mel-frequency cepstral coefficient)域的畸变模型高度非线性且难以处理,用分段线性插值函数代替对数函数,提出了一种新的线性畸变模型。在此基础上,导出了噪声参数估计和声学模型补偿方法,无需采用矢量泰勒级数(VTS,vector Taylor series)展开作近似处理,有效避免了模型误差的引入,增强了系统在噪声环境下的顽健性。The robustness of speech recognition system in noisy environments was investigated.The distortion model in Mel-frequency cepstral coefficient(MFCC) domain is highly non-linear and difficult to deal with.A new linear distortion model was proposed by replacing the logarithm operation with its piecewise linear interpolation function.Then the esti-mation of noise parameters and compensation of acoustic models were provided.The proposed method can avoid model error introduced by utilizing linearization methods based on vector Taylor series(VTS) expansion,and significantly im-prove the robustness of recognizer in noisy environments.

关 键 词:语音识别 顽健性 畸变模型 线性化 

分 类 号:TN912[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象