检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:陈亮 邵玉斌[1] 龙华[1] 杜庆治[1] 彭艺[1] 唐维康 CHEN Liang;SHAO Yubin;LONG Hua;DU Qingzhi;PENG Yi;TANG Weikang(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China)
机构地区:[1]昆明理工大学信息工程与自动化学院,云南昆明650500
出 处:《信号处理》2022年第3期599-608,共10页Journal of Signal Processing
基 金:国家自然科学基金(61761025)。
摘 要:针对广播语种识别问题,提出一种语音时域滤波方法,用gammatone时域函数与预处理后的语音信号进行卷积滤波,再分帧加窗并求对数化能量得到时域GF(gammatone filterbank)特征。将特征参数图像化表示,然后通过VGG19和Resnet34分类网络进行语种识别实验。同时,也使用自动色阶算法对加噪语音的图像化特征参数进行去噪,并对比不同维数的特征参数以及不同噪声类型和信噪比对语种识别率的影响。结果表明,采用该特征参数的广播语种识别准确率高于使用传统的GFCC特征、GFCC-D-A特征、GFCC-SDC特征及Fbank特征,且在不同噪声类型和不同信噪比的广播语音识别场景下,语种识别准确率均有一定提升。A speech time-domain filtering method is proposed for the broadcast language identification problem,where the gammatone time-domain function is used to convolutionally filter the pre-processed speech signal,and the windowing and signal energy logarithmizing are then used to find the time-domain gammatone filterbank features in each separate frame.After that,the feature parameters are represented pictorially. With the obtained feature parameters,the language identification experiments are carried out by VGG19 and Resnet34 classification networks. The automatic color scale algorithm is also used to denoise the imaged feature parameters of noise-added speech and to compare the effect of different dimensional feature parameters and different noise types and signal-to-noise ratios on the performance of language identification accuracy. The results show that the language recognition accuracy with the proposed feature parameters is higher than that with the traditional GFCC feature,GFCC-D-A feature,GFCC-SDC feature and Fbank feature,and the language identification accuracy is also improved in different noise types and different signal-to-noise ratios under broadcast speech identification scenarios.
关 键 词:广播语种识别 gammatone时域滤波 时域gammatone filterbank 自动色阶算法
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.144.98.87