基于时域Gammatone滤波特征的广播语种识别  被引量:4

Language Identification for Broadcasting Signal Based on Time-domain Gammatone Filtering Features

在线阅读下载全文

作  者:陈亮 邵玉斌[1] 龙华[1] 杜庆治[1] 彭艺[1] 唐维康 CHEN Liang;SHAO Yubin;LONG Hua;DU Qingzhi;PENG Yi;TANG Weikang(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China)

机构地区:[1]昆明理工大学信息工程与自动化学院,云南昆明650500

出  处:《信号处理》2022年第3期599-608,共10页Journal of Signal Processing

基  金:国家自然科学基金(61761025)。

摘  要:针对广播语种识别问题,提出一种语音时域滤波方法,用gammatone时域函数与预处理后的语音信号进行卷积滤波,再分帧加窗并求对数化能量得到时域GF(gammatone filterbank)特征。将特征参数图像化表示,然后通过VGG19和Resnet34分类网络进行语种识别实验。同时,也使用自动色阶算法对加噪语音的图像化特征参数进行去噪,并对比不同维数的特征参数以及不同噪声类型和信噪比对语种识别率的影响。结果表明,采用该特征参数的广播语种识别准确率高于使用传统的GFCC特征、GFCC-D-A特征、GFCC-SDC特征及Fbank特征,且在不同噪声类型和不同信噪比的广播语音识别场景下,语种识别准确率均有一定提升。A speech time-domain filtering method is proposed for the broadcast language identification problem,where the gammatone time-domain function is used to convolutionally filter the pre-processed speech signal,and the windowing and signal energy logarithmizing are then used to find the time-domain gammatone filterbank features in each separate frame.After that,the feature parameters are represented pictorially. With the obtained feature parameters,the language identification experiments are carried out by VGG19 and Resnet34 classification networks. The automatic color scale algorithm is also used to denoise the imaged feature parameters of noise-added speech and to compare the effect of different dimensional feature parameters and different noise types and signal-to-noise ratios on the performance of language identification accuracy. The results show that the language recognition accuracy with the proposed feature parameters is higher than that with the traditional GFCC feature,GFCC-D-A feature,GFCC-SDC feature and Fbank feature,and the language identification accuracy is also improved in different noise types and different signal-to-noise ratios under broadcast speech identification scenarios.

关 键 词:广播语种识别 gammatone时域滤波 时域gammatone filterbank 自动色阶算法 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象