基于码本映射和GMM的语音带宽扩展  

Speech Bandwidth Extension Based on Codebook Mapping and GMM

在线阅读下载全文

作  者:王迎雪[1] 于莹莹[1] 赵胜辉[1] 匡镜明[1] 

机构地区:[1]北京理工大学信息与电子学院,北京100081

出  处:《北京理工大学学报》2017年第9期970-974,共5页Transactions of Beijing Institute of Technology

基  金:国际合作研究项目

摘  要:采用传统的高斯混合模型(Gaussian mixture model,GMM)进行语音带宽扩展时,会出现所估计的特征参数过平滑的问题,其主要原因是协方差估计不准确而导致扩展的高频特征细节信息的丢失,因此本文提出了码本映射(codebook mapping,CM)与高斯混合模型相结合的语音带宽扩展算法.提取高、低频特征参数,并训练高斯混合模型,基于高斯混合模型参数训练偏移矢量的码本;在扩展阶段,利用偏移矢量的码本将低频偏移矢量映射为高频偏移矢量,再将高频偏移矢量与高斯混合模型估计部分相加作为估计的高频特征参数.对利用该方法进行带宽扩展后的语音质量进行主观/客观评测.实验结果表明,相比传统的GMM语音带宽方法,CM-GMM合成的高频语音更接近原始高频语音,明显消除了高频过平滑现象.Speech bandwidth extension(BWE)based on the conventional Gaussian mixture model(GMM)often suffers from the overly smoothed problem,and the main reason is the low accuracy of the estimated covariance which results in the loss of specific high frequency feature.Thus,a speech bandwidth extension base on codebook mapping(CM)and GMM was proposed in this paper.Firstly,the feature of low frequency(LF)and high frequency(HF)were extracted,and the GMM model was trained.Then,an offset vector codebook was designed based on the trained GMM parameters.In the reconstruction phase,LF offset vectors were transformed to HF offset vectors according to the trained offset vector codebook.The final HF feature parameter was obtained by adding the HF offset vectors to the estimated part by GMM.It is shown by subjective evaluations and objective evaluations that the CM-GMM significantly overcomes the overly smoothed problem and obviously improves the quality of the synthesized speech signals compared with the conventional GMM-based BWE method.

关 键 词:语音带宽扩展 高斯混合模型 码本映射 

分 类 号:TN929.53[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象