检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:薛珮芸 师晨康 白静[1] 赵建星 汪思斌 XUE Peiyun;SHI Chenkang;BAI Jing;ZHAO Jianxing;WANG Sibin(College of Electronic Information and Optical Engineering,Taiyuan University of Technology,Jinzhong 030600,China;Postdoctor Workstation,Shanxi Institute of Advanced Innovation,Taiyuan 030002,China)
机构地区:[1]太原理工大学电子信息与光学工程学院,山西晋中030600 [2]山西高等创新研究院博士后科研工作站,山西太原030002
出 处:《无线电工程》2024年第4期1026-1033,共8页Radio Engineering
基 金:山西省应用基础研究计划项目(201901D111094);山西省基础研究计划(20210302124544)。
摘 要:针对单个语音去噪算法在去噪过程中关注点较为单一,而多个语音去噪算法在融合时存在细节信息被削弱、融合效果不理想的问题,提出一种多个语音去噪算法下的自适应门限融合策略,将带噪信号分别经过3种不同的去噪算法得到3个去噪信号;根据自适应门限值以帧为单位进行帧筛选,得到自适应门限融合策略下的去噪信号;为提高识别效果,采用倒谱提升器对Gammatone频率倒谱系数(Gammatone Frequency Cepstrum Coefficient, GFCC)进行改进,并联合支持向量机进行噪声环境下的语音识别。实验结果表明,在5、10、15、20 dB四种信噪比下,通过该融合策略所得到的去噪信号与目前主流的顺序融合及多级融合方式相比,在语音识别率方面平均提高3.6%,融合倒谱提升器的GFCC特征相比于GFCC特征平均提高了2.2%。To solve the problem that a single speech denoising algorithm has a single focus in the process of denoising,while the details of multiple speech denoising algorithms are weakened and the fusion effect is not ideal,an adaptive threshold fusion strategy under multiple speech denoising algorithms is proposed.Three denoising signals are obtained by using three different denoising algorithms.Frame filtering is carried out according to the adaptive threshold,and then the denoising signal under the adaptive threshold fusion strategy is obtained.On the other hand,in order to improve the recognition effect,the Gammatone Frequency Cepstrum Coefficient(GFCC)is improved by using a cepstrum lifter,and the speech recognition in noisy environment is carried out jointly with support vector machine.The experimental results show that under the four signal to noise ratios of 5,10,15 and 20 dB,the speech recognition rate of the denoised signals obtained by the fusion strategy is improved by 3.6%on average compared with the current mainstream sequential fusion and multistage fusion methods,and the GFCC feature using cepstrum lifter is improved by 2.2%on average compared with the GFCC feature.
关 键 词:语音去噪 自适应门限融合 带噪语音 帧筛选 Gammatone滤波器
分 类 号:TN912.35[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:13.59.173.30