基于共振峰和梅尔倒谱的声纹特征提取SOC设计  被引量:2

Voiceprint Feature Extraction SOC Design Based on Formant and Mel Cepstral

在线阅读下载全文

作  者:席青云 邱长江 陶佰睿[2] 关新宇 苗凤娟[2] XI Qingyun;QIU Changjiang;TAO Bairui;GUAN Xinyu;MIAO Fengjuan(XingAn League Branch,Inner Mongolia Radio and Television University,Ulanhot,Inner Mongolia,137400,China;College of Communications and Electronic Engineering,Qiqihar University,Qiqihar Heilongjiang 161006,China)

机构地区:[1]内蒙古广播电视大学兴安盟分校,内蒙古兴安盟乌兰浩特137400 [2]齐齐哈尔大学通信与电子工程学院,黑龙江齐齐哈尔161006

出  处:《传感技术学报》2023年第5期782-787,共6页Chinese Journal of Sensors and Actuators

基  金:黑龙江省自然科学基金(ZD2019F004);黑龙江省高等教育教学改革项目(SJGY20200781);黑龙江省教育厅基本业务专项项目(135309115)。

摘  要:反映声道(谐振器)物理特性且不易受环境影响的元音共振峰可以更好地反映说话人的声纹特征,为此提出了说话人共振峰自适应MFCC(梅尔倒谱系数)特征提取SOC(片上系统)设计。首先提取说话人语音元音的三组共振峰来设计Mel三角形滤波器组,并基于传统MFCC与共振峰改进MFCC矩阵参数比值设计自适应融合说话人语音特征以改进MFCC。在MATLAB中完成性能仿真,在QUARTUS II中完成VERILOG-HDL代码设计,在FPGA(现场可编程门阵列)开发板上完成SOC设计、编译、仿真和验证下载。结果表明,在较高信噪比环境下,基于自适应融合和共振峰改进的MFCC得到的特征向量比传统的MFCC具有更强的鲁棒性,此技术在说话人声纹身份识别传感器设计中有较大应用推广价值。Vowel formants,which reflect the physical characteristics of the vocal tract(resonator)and are not easily affected by the environment,can better reflect the speaker’s voiceprint characteristics.For this reason,the speaker formant adaptive MFCC(Mel Cepstral Coefficient)feature extraction SOC(system on chip)is proposed.Firstly,the three groups of formants of the speaker’s voice vowels are extracted to design the Mel triangular filter bank,and the MFCC is improved through adaptive fusion of the speaker’s speech features based on traditional MFCC and the formant improved MFCC matrix parameter ratio.The performance simulation is completed by using MATLAB and the VERILOG-HDL code design is completed by using QUARTUS II.Finally,the SOC design,compilation,simulation and verification download are completed on the FPGA(Field Programmable Gate Array)development board.The results show that the eigenvectors obtained by using the improved MFCC based on adaptive integration and formant are more robust than those got by using the traditional MFCC under the environment of high signal-to-noise ratio.The proposed technique has a great application promotion value in the design of speaker voiceprint recognition sensor.

关 键 词:声纹识别 共振峰 梅尔频率 自适应融合 片上系统 

分 类 号:TN492[电子电信—微电子学与固体电子学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象