基于最大似然多项式回归的鲁棒语音识别被引量：3

Maximum likelihood polynomial regression for robust speech recognition

出　　处：《声学学报》2010年第1期88-96,共9页Acta Acustica

基　　金：国家973计划(2002CB312102);国家自然科学基金(60672094)资助项目

摘　　要：本文针对最大似然线性回归算法线性假设的缺点,将多项式回归方法用于模型自适应,构建了基于最大似然多项式回归的非线性模型自适应算法。该算法在对数谱域用多项式回归方法,逼近每个Mel子带上识别环境模型均值与训练环境模型均值之间的非线性关系。多项式系数通过EM算法和最大似然准则从识别环境下的少量自适应数据中估计。实验结果表明,二阶多项式就可以较好地逼近模型均值的非线性环境变换关系。在噪声补偿和说话人自适应实验中,最大似然多项式回归算法的误识率都明显低于最大似然线性回归算法。本文算法较好地克服了线性模型自适应算法线性假设的缺陷,可同时减小噪声,和说话人的改变或其它因素对语音识别系统的影响,尤其适合说话人和噪声的联合自适应。The linear hypothesis is the main disadvantage of maximum likelihood linear regression （MLLR）. This paper applies the polynomial regression method to model adaptation and establishes a nonlinear adaptation algorithm using maximum likelihood polynomial regression （MLPR） for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing mean vectors in every Mel-band is approximated by a set of polynomials. The polynomial coefficients are estimated from small adaptation data in test environment by the expectation-maximization （EM） algorithm and maximum likelihood （ML） criterion. The experimental results show that the second-order polynomial can approximate the nonlinear function of training and testing mean vectors perfectly. In noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise.

关键词：最大似然准则语音识别系统多项式回归线性回归算法说话人自适应模型自适应非线性模型自适应算法

分类号：TN912.34[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于最大似然多项式回归的鲁棒语音识别被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于最大似然多项式回归的鲁棒语音识别 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于最大似然多项式回归的鲁棒语音识别被引量：3