检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:解元 邹涛 余锦视 孙为军 XIE Yuan;ZOU Tao;YU Jinshi;SUN Weijun(Department of Robotics Engineering,School of Mechanical and Electrical Engineering,Guangzhou University,Guangzhou,Guangdong 510006,China;Department of Automatic Control,School of Automation,Guangdong University of Technology,Guangzhou,Guangdong 510006,China)
机构地区:[1]广州大学机械与电气工程学院机器人工程系,广东广州510006 [2]广东工业大学自动化学院自动控制系,广东广州510006
出 处:《信号处理》2024年第12期2238-2248,共11页Journal of Signal Processing
基 金:广州市基础与应用基础研究项目(SL2022A04J00289);国家自然科学基金(62003095,52171331);广东省基础与应用基础研究基金(2023A1515011311);广州市市校联合实验室项目(2023A03J0120)。
摘 要:语音增强的目的是从受噪声干扰的语音信号中提取纯净的目标语音信号。然而,在混响环境下接收到的声源信号是目标源信号和许多延迟与衰减的反射的集合,这大大降低了目标语音的质量和可懂度。为了探索带噪声和声学混响场景下的语音增强问题,本文在目标语音和声学环境的先验信息未知的情况下,设计一种基于盲信号提取的无监督的多通道语音增强方法。首先,将后期反射产生的混响视为附加的、不相关的噪声分量,构建一个带噪声和声学混响的语音增强新模型,使用原始-对偶分裂算法,通过时频掩码对目标语音信号进行隐式建模。然后,利用倒谱阈值法增强目标语音信号的谐波结构,使得含噪声混响语音信号中的目标语音信号被增强,并且具有比目标语音信号小能量的其他分量被衰减。最后,由于每个信道上的干扰信号都被衰减,使得在每次迭代中提取的目标语音信号具有更好的排他性和非混合性,从而设计一种自适应时频类维纳掩蔽逆滤波器实现去混响去噪声的增强效果。实验部分,分别对噪声和混响条件下的实际语音信号进行了去混响去噪声的性能评估和分析,实验结果表明,所提算法具有很好的去混响去噪声的性能,同时对比于几种比较流行的多通道语音增强算法,验证了本文算法的增强效果更优越。The purpose of speech enhancement is to extract a pure target speech signal from a noisy speech signal.How‐ever,the sound source signal received in a reverberation environment is a collection of the target source signal and many delayed and attenuated reflections,which significantly reduces the quality and intelligibility of the target speech.To ex‐plore the problem of speech enhancement in noisy and acoustic reverberation scenarios,this study proposes an unsuper‐vised multichannel speech enhancement method based on blind signal extraction when the prior information of the target speech and acoustic environment is unknown.First,a new speech enhancement model with noise and acoustic reverbera‐tion is constructed by considering the reverberation generated by later reflections as additional and unrelated noise com‐ponents,and the target signal is implicitly modeled through a time-frequency mask using the primal-dual splitting algo‐rithm.Subsequently,the cepstrum threshold method is used to enhance the harmonic structure of the target speech sig‐nal,enhancing the target speech signal in the noisy reverberation speech signal and attenuating other components with less energy than the target speech signal.Finally,as the interference signal on each channel is attenuated,the extracted target speech signal in each iteration has better exclusivity and is unmixed,an adaptive time-frequency Wiener masking inverse filtering is designed to enhance dereverberation and denoising.An experiment was conducted to evaluate and ana‐lyze the performance of dereverberation and denoising for actual speech signals under noisy and reverberation condi‐tions.The experimental results demonstrated that the proposed algorithm has excellent performance in dereverberation and denoising.Additionally,we verified that the enhancement effect of the proposed algorithm is superior to several popular multi-channel speech enhancement algorithms.
关 键 词:语音增强 盲信号提取 声学混响 干扰语音消除 逆滤波
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.224.96.135