检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王文华 夏秀渝[1] WANG Wenhua;XIA Xiuyu(School of Electronic Informnation,Sichuan University,Chengdu 610064,China)
出 处:《成都信息工程大学学报》2024年第3期275-282,共8页Journal of Chengdu University of Information Technology
摘 要:人类的听觉系统具有非常精细而巧妙的结构,即使在嘈杂的环境中,也能准确地理解语音。采用精细的耳蜗模型作为前端处理可以实现更好的语音处理。利用快速压缩的非对称谐振器级联(CARFAC)作为人耳外周模型,结合听觉稳定图像得到精确的皮层前听觉模型。在听觉模型的基础上提取较准确的基音轮廓,利用基音信息进行声场景分析,合成鲁棒性语音特征,并将其送入神经网络进行监督训练,以实现语音增强。实验结果表明,噪声条件下,由听觉模型提取的特征在各语音评价指标下都有较好的体现,可以更好表征语音信号,具有一定的鲁棒性。The human auditory system has a very fine and ingenious structure,and it can accurately understand speech even in a noisy environment.Using a fine cochlea model as front-end processing allows for better speech processing.In this paper,a rapidly compressed asymmetric resonator cascade(CARFAC)is used as a peripheral model of the human ear,combined with an auditory stabilization image(SAI)to obtain an accurate precortical auditory model.Based on the auditory model,a more accurate pitch contour is extracted,the pitch information is used to analyze the acoustic scene,and robust speech features are synthesized,which are sent to the neural network for supervised training to achieve speech enhancement.Experiments show that under noise conditions,the features extracted by the auditory model are better reflected in various speech evaluation indicators,which can better characterize the speech signal and have certain robustness.
关 键 词:CARFAC模型 听觉稳定图像 语音增强系统 基音提取
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.15.145.122