采用扩展型双线性变换法将耳语音转换为正常语音的研究  被引量:4

Research of conversion from whispered speech to normal speech by the extended bilinear transformation

在线阅读下载全文

作  者:陶智[1] 赵鹤鸣[2] 谈雪丹[1] 顾济华[1] 张晓俊[1] 吴迪[1] 

机构地区:[1]苏州大学物理科学与技术学院能源学院,江苏215006 [2]苏州大学电子信息学院,江苏215006

出  处:《声学学报》2012年第6期651-658,共8页Acta Acustica

基  金:国家自然科学基金(61271359;61071215);苏州市科技发展计划(SYG201001);苏州大学捷美生物医学工程仪器联合重点实验室资助项目

摘  要:提出了一种采用扩展型双线性变换将耳语音转换为正常语音的方法。根据耳语音在不同频段的共振峰偏移程度不同,将耳语音的频谱进行分段处理,在此基础上建立耳语音转换为正常语音的转换函数。由于耳语音在各频段相对于正常语音非线性偏移,在双线性变换函数中引入扩展因子,使其对频谱的非线性偏移与对共振峰带宽的压缩更加符合耳语音转换为正常语音的实际转换需求,有效减小了转换语音与正常语音的谱失真距离。实验结果表明,本文的转换语音在音质和可懂度上均得到了有效提高。One method of conversion from whispered speech to formal speech based on the extended bilinear trans- formation is proposed. On account of the different deviation degrees of the whisper's formants in different frequency bands, the spectrum of the whispered speech will be processed in the separate partitions of this paper. On the basis of this spectrum, we will establish a conversion function able to usefully convert whispered speech to formal speech. Because of the whisper's non-linear offset in relation to normal speech, this paper introduces an expansion factor in the bilinear transform function making it correspond more closely to the actual conversion demands of whispered speech to formal speech. The introduction of this factor takes the non-linear move of the spectrum and the compression of the formant bandwidth into consideration, thus effectively reducing the spectrum distortion distance in the conversion. The experiment results show that the conversion presented in this paper effectively improves both the sound quality and the intelligibility of whispered speech.

关 键 词:双线性变换法 语音转换 扩展型 耳语音 分段处理 转换函数 扩展因子 变换函数 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象