Maximum likelihood polynomial regression for robust speech recognition  

Maximum likelihood polynomial regression for robust speech recognition

在线阅读下载全文

作  者:LU Yong WU Zhenyang 

机构地区:[1]School of Information Science and Engineering, Southeast University Nanjing 210096

出  处:《Chinese Journal of Acoustics》2011年第3期358-370,共13页声学学报(英文版)

基  金:supported by the 973 Program of China(2002CB312102);the National Natural Science Foundation of China(60672094)

摘  要:The linear hypothesis is the main disadvantage of maximum likelihood linear re- gression (MLLR). This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression (MLPR) for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test envi- ronment using the expectation- maximization (EM) algorithm and maximum likelihood (ML) criterion. The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise.The linear hypothesis is the main disadvantage of maximum likelihood linear re- gression (MLLR). This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression (MLPR) for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test envi- ronment using the expectation- maximization (EM) algorithm and maximum likelihood (ML) criterion. The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise.

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术] O212.1[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象