检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李平[1] 曾毓敏[1] 吴婷婷[1] 吴华玉[1]
机构地区:[1]南京师范大学物理科学与技术学院,南京210097
出 处:《光电子技术》2007年第2期110-114,共5页Optoelectronic Technology
基 金:南京师范大学留学回国启动基金(2006102XLH0131)
摘 要:提出了一种新颖的基于高斯混合模型(GMM)的甚低码率语音编码系统。该编码器利用GMM对短时语音谱包络进行拟合的方法来对语音进行参数化表示。编码时,语音经预处理、分帧加窗后,再经FFT分析得到分帧语音的信号频谱,并获得平滑谱包络。然后采用GMM对谱包络进行拟合,用GMM参数(均值、方差、权重)对语音谱加以表示。由于GMM参数较少,从而可以使得码率甚低。解码时,根据编码逆运算生成谱包络,浊音信号利用正弦模型加以合成,清音信号经IFFT合成。实验仿真结果表明:该编码器在传输码率降低到2.35 kb/s时,仍可获得音质令人满意的解码语音。A novel very low bit-rate speech coder based on Gaussian mixture model (GMM), which is used to parameterize the short-time speech spectrum envelope, is proposed in this paper. In the coding procedure, speech signal is firstly pre-emphasized and segmented. Secondly, the segmented speech is transformed to spectrum domain and the spectrum envelope of the segmented speech is obtained. Then the spectrum envelope is parameterized by GMM. So the segmented speech is represented by the means, covariances and mixture weights of GMM. In the decoding procedure, the spectrum envelope of segmented speech is reconstructed with the inverse method of the coding. Then the speech is synthesized based on the reconstructed spectrum envelope, in which the voiced speech is synthesized by sinusoid model and the unvoiced speech is just synthesized by inverse FFT. Since the segmented speech can be rep- resented by very few parameters of GMM, the bit-rate of the coder is very low. The result of the experiment shows that the proposed speech coder presents a good performance. The quality of the synthesized speech is still satisfying when the bit-rate of the coder is reduced to 2, 35 kb/s.
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.170