Maximum likelihood polynomial regression for robust speech recognition

Maximum likelihood polynomial regression for robust speech recognition

机构地区：[1]School of Information Science and Engineering, Southeast University Nanjing 210096

出　　处：《Chinese Journal of Acoustics》2011年第3期358-370,共13页声学学报（英文版）

基　　金：supported by the 973 Program of China(2002CB312102);the National Natural Science Foundation of China(60672094)

摘　　要：The linear hypothesis is the main disadvantage of maximum likelihood linear re- gression （MLLR）. This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression （MLPR） for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test envi- ronment using the expectation- maximization （EM） algorithm and maximum likelihood （ML） criterion. The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise.The linear hypothesis is the main disadvantage of maximum likelihood linear re- gression （MLLR）. This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression （MLPR） for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test envi- ronment using the expectation- maximization （EM） algorithm and maximum likelihood （ML） criterion. The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise.

分类号：TP391.41[自动化与计算机技术—计算机应用技术] O212.1[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Maximum likelihood polynomial regression for robust speech recognition

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Maximum likelihood polynomial regression for robust speech recognition

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索