一种用于WI语音编码的相位预测式矢量量化方法  被引量:4

A Predictive Phase Vector Quantization Method in WI Speech Coding

在线阅读下载全文

作  者:陈悦[1] 鲍长春[1] 

机构地区:[1]北京工业大学电子信息与控制工程学院,北京100022

出  处:《电子与信息学报》2007年第11期2672-2675,共4页Journal of Electronics & Information Technology

基  金:国家自然科学基金(60372063);北京市自然科学基金(4042009);北京市教委科技发展计划项目(KM200710005001)资助课题

摘  要:在传统的低比特率语音编码中,考虑到人耳对相位信息不敏感而经常忽略相位信息,这将导致语音粗糙、刺耳甚至音调发生改变。为了获得高质量的声码器,语音的相位信息是不能不考虑的。该文在散布相位矢量量化方法的基础上进一步去除了相位冗余,在波形内插(Waveform Interpolation,WI)编码模型中对相邻帧慢渐变波形(Slowly Evolving Waveform,SEW)的相位谱差值进行预测式矢量量化。实验发现,该方法大大改善了重建语音效果,明显提高了语音的自然度和清晰度。主观A/B测试结果显示,该方法与固定相位法相比,经4~6bit的相位量化可使合成语音质量得到显著的改善,相比散布相位矢量量化方法,女声的语音合成质量有所改进。In traditional low bit-rate speech coding, considering that ears are not sensitive to phase information, the phase information is often neglected, and this will result in coarse and harsh speech quality, and it even may lead to inflection in pitch. In order to obtain a high-quality speech eodee, the phase information of speech should be included in codec. In this paper, the phase redundancy is reduced further based on the dispersion phase vector quantization method. In the waveform interpolation (WI) speech coding model, the difference of SEW's phase spectra of conjoint frames is quantized using predictive vector quantization. The result of this scheme reveals that the speech quality is improved, and its naturalness and articulation are increased greatly. Subjective A/B listening test indicates that the reconstructed speech's quality of this method is better than that of fixed phase with 4-6 bit. Compared with the dispersion phase vector quantization method, the synthesis speech is slightly improved for female speakers.

关 键 词:语音编码 波形内插 矢量量化 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象