基于GMM的甚低码率语音编码器  被引量:2

A Very Low Bit-rate Speech Coder Based on GMM

在线阅读下载全文

作  者:李平[1] 曾毓敏[1] 吴婷婷[1] 吴华玉[1] 

机构地区:[1]南京师范大学物理科学与技术学院,南京210097

出  处:《光电子技术》2007年第2期110-114,共5页Optoelectronic Technology

基  金:南京师范大学留学回国启动基金(2006102XLH0131)

摘  要:提出了一种新颖的基于高斯混合模型(GMM)的甚低码率语音编码系统。该编码器利用GMM对短时语音谱包络进行拟合的方法来对语音进行参数化表示。编码时,语音经预处理、分帧加窗后,再经FFT分析得到分帧语音的信号频谱,并获得平滑谱包络。然后采用GMM对谱包络进行拟合,用GMM参数(均值、方差、权重)对语音谱加以表示。由于GMM参数较少,从而可以使得码率甚低。解码时,根据编码逆运算生成谱包络,浊音信号利用正弦模型加以合成,清音信号经IFFT合成。实验仿真结果表明:该编码器在传输码率降低到2.35 kb/s时,仍可获得音质令人满意的解码语音。A novel very low bit-rate speech coder based on Gaussian mixture model (GMM), which is used to parameterize the short-time speech spectrum envelope, is proposed in this paper. In the coding procedure, speech signal is firstly pre-emphasized and segmented. Secondly, the segmented speech is transformed to spectrum domain and the spectrum envelope of the segmented speech is obtained. Then the spectrum envelope is parameterized by GMM. So the segmented speech is represented by the means, covariances and mixture weights of GMM. In the decoding procedure, the spectrum envelope of segmented speech is reconstructed with the inverse method of the coding. Then the speech is synthesized based on the reconstructed spectrum envelope, in which the voiced speech is synthesized by sinusoid model and the unvoiced speech is just synthesized by inverse FFT. Since the segmented speech can be rep- resented by very few parameters of GMM, the bit-rate of the coder is very low. The result of the experiment shows that the proposed speech coder presents a good performance. The quality of the synthesized speech is still satisfying when the bit-rate of the coder is reduced to 2, 35 kb/s.

关 键 词:语音编码 高斯混合模型 甚低码率 谱包络 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象