基于角度压制比谱减的环境自适应双麦语音增强  

Environment adaptive dual-microphone speech enhancement basedon direction mitigation ratio spectral subtraction

在线阅读下载全文

作  者:张家扬 何伟[1,2] 童峰 卢荣富[3] 冯万健[3] ZHANG Jiayang;HE Wei;TONG Feng;LU Rongfu;FENG Wanjian(College of Ocean and Earth Sciences,Xiamen University,Xiamen 361005,China;National and Local Joint Engineering Research Center for Navigation and Location Service Technology of Xiamen University,Xiamen 361005,China;Yealink,Xiamen 361015,China)

机构地区:[1]厦门大学海洋与地球学院,福建厦门361005 [2]导航与位置服务技术国家地方联合工程研究中心(厦门大学),福建厦门361005 [3]厦门亿联网络技术股份有限公司,福建厦门361015

出  处:《厦门大学学报(自然科学版)》2024年第2期296-304,共9页Journal of Xiamen University:Natural Science

基  金:上海市科委“科技创新行动计划”项目(21DZ1205502);厦门市海洋产业项目(22CZB012HJ13)。

摘  要:[目的]针对智能终端小型化、使用场景多样化的发展趋势,研制一种既能满足严苛的尺度、算力、存储空间限制,又能实现环境自适应的双麦语音增强算法.[方法]考虑到麦克风阵列波束形成算法可以增强期望方向信号,同时抑制非期望方向的噪声,但小尺寸阵列波束主瓣波束宽度较宽、影响增强效果.在小尺寸双麦对目标方向进行波束对准增强的基础上,参考干扰方向噪声,进一步对目标方向语音进行谱减处理,并引入角度压制比实时检测干扰方向噪声的能量估计,实现对不同混响、噪声类型的自适应处理,从而提升语音增强效果.[结果]角度压制比随混响时间增加而增大,与信噪比不相关.相对于原始带噪信号、滤波-累加波束形成(filter-and-sum beamforming, FSB)信号、FSB结合固定对向谱减的语音增强信号,通过FSB结合角度压制比自适应对向谱减得到的语音增强信号,在不同噪声类型、不同信噪比和不同混响时间下,均能得到最高的分段信噪比得分和大多数的最高客观语音质量评估得分.[结论]角度压制比能一定程度地反映不同的混响情况,利用角度压制比得到的谱减阈值具有一定的环境适应性.[Objective]The voice front-end plays an important role in collecting and ensuring the quality of speech signals so that different types of speech processing can be supported.The increasing application of small size intelligent terminals in highly diverse application scenarios brings significant challenges to the speech enhancement performance of the voice front-ends under complicated reverberant and noisy environments.As the beam directivity of microphone array beamforming algorithm depends highly on microphone array sizes and element numbers,dual-microphones that are popularly adopted in small size intelligent terminals endure substantial performance degradation.In this paper,an environment adaptive dual-microphone speech enhancement algorithm based on direction mitigation ratio spectral subtraction is proposed to improve the speech-enhancement performance of dual-microphone array under different environments.[Methods]First,a least-squares(LS)driven filter-and-sum(FSB)dual-microphone beamformer is designed to yield the preliminary speech enhancement with its signal beam and noise beam aiming at desired directions and undesired directions,respectively.Then,the noise reference collected by the noise beam is used to remove residual noises that are contained in the beamforming enhanced speech by the way of spectral subtraction.Specifically,a direction mitigation ratio(DMR)parameter is defined to carry the environmental information,which is calculated in each frame to determine the spectral subtraction threshold.Thus,by updating the DMR in real time,the spectral subtraction processing between the enhanced speech and noise reference is adaptively controlled to achieve environmental prediction and achieve improved effects of residual noise removing.[Results]For the purpose of performance evaluation and comparison,practical experiments are carried out in anechoic laboratory,in which speakers located in different directions are used as artificial noise resources to generate environmental noises with different signal-to-n

关 键 词:双麦 麦克风阵列 波束形成 谱减 角度压制比 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象