在波形网络中融合相位信息的骨导语音增强  被引量:5

Bone-conducted speech enhancement using WaveNet fused with phase information

在线阅读下载全文

作  者:郑昌艳 杨吉斌 张雄伟 孙蒙 ZHENG Changyan;YANG Jibin;ZHANG Xiongwei;SUN Meng(High-tech Institute,Fan Gong-ting South Street on the 12th,Qingzhou 262500;Army Engineering University,Nanjing 210007)

机构地区:[1]火箭军士官学校,青州262500 [2]陆军工程大学,南京210007

出  处:《声学学报》2021年第2期309-320,共12页Acta Acustica

基  金:国家自然科学基金项目(61471394);江苏省优秀青年基金项目(BK20180080)资助。

摘  要:已有骨导语音增强算法重点关注语音幅度谱增强,在波形合成时会因为相位不匹配导致语音质量下降。为解决该问题,提出了一种融合相位信息的波形网络(WaveNet)模型实现骨导语音增强波形生成。该方法以频带扩展WaveNet为基础,融合骨导语音相位谱信息与增强的语音幅度谱作为模型的条件特征,根据融合特征生成增强语音波形,实现了相位信息的有效利用。仿真实验综合对比了群时延谱和瞬时频率偏差谱相位特征,主客观结果表明,不论是采用串联融合还是卷积融合方式,骨导语音相位信息均有效补充了原有幅度谱条件特征,改善了语音增强效果。利用串联方式融合群时延谱特征可得到最佳结果,相比于原始骨导语音,平均意见得分(MOS)提升了约54.3%。The existing bone-conducted speech enhancement algorithms mainly focus on the enhancement of speech magnitude,and use the mismatch phase to synthesize waveform,which leads to the degradation of speech quality.In order to solve this problem,a WaveNet model based on phase information fusion is proposed to generate the enhanced waveform.The proposed method is based on bandwidth extended WaveNet,and combines the phase information of bone-conducted speech and the magnitude of enhanced speech as the conditional features.The waveform is generated under the fused feature conditions,where the phase information is effectively utilized.The performances of group delay spectrum and instantaneous frequency deviation spectrum are compared in experiments.The results show that the phase information of bone-conducted speech can effectively complement the original magnitude condition and improve the performance of speech enhancement,no matter whether they are fused by concatenation or convolution.The best result is obtained by fusing the group delay spectrum by concatenation.Compared with the original bone-conducted speech,the Mean Opinion Score(MOS) score is improved by 54.3%.

关 键 词:语音增强 幅度谱 群时延 频带扩展 语音波形 瞬时频率 骨导 波形合成 

分 类 号:TN912.35[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象