基于神经网络加速器的FPGA语音情感识别系统  

DESIGN OF FPGA SPEECH EMOTION RECOGNITION SYSTEM BASED ON NEURAL NETWORK ACCELERATOR

在线阅读下载全文

作  者:乔栋 陈章进[1,2] 邓良 张廓 Qiao Dong;Chen Zhangjin;Deng Liang;Zhang Kuo(Microelectronics Research and Development Center of Shanghai University,Shanghai 200444,China;Computing Center of Shanghai University,Shanghai 200444,China)

机构地区:[1]上海大学微电子研究与开发中心,上海200444 [2]上海大学计算中心,上海200444

出  处:《计算机应用与软件》2024年第10期163-169,246,共8页Computer Applications and Software

基  金:国家自然科学基金项目(61674100)。

摘  要:针对现有语音情感识别系统的部署功耗高、不具有便携性的缺点,提出一种基于神经网络加速器的FPGA语音情感识别系统设计。在FPGA上实现语音MFCC(Mel Frequency Cepstrum Coefficient)特征的提取,便于进行识别;为神经网络加速器设计指令生成算法,将网络模型部署在神经网络加速器实现语音情感识别。整个系统主要硬件资源消耗为37078个LUT和153个DSP,支持在主流FPGA平台上的部署。经过检验,语音情感识别系统的指令运算误差可达0.06以下,输出误差为0.0004以下,满足语音情感识别的需求。Aiming at the disadvantages of high-power consumption and no portability in the deployment of existing speech emotion recognition system,this paper proposes a design of FPGA speech emotion recognition system based on neural network accelerator.Mel frequency cepstrum coefficient(MFCC)feature extraction of speech was realized on FPGA,which was convenient for recognition.The instruction generation algorithm was designed for the neural network accelerator,and the network model was deployed in the neural network accelerator to realize speech emotion recognition.The main hardware resource consumption of the whole system is 37078 LUTs and 153 DSPs,which supports the deployment on the mainstream FPGA platform.After testing,the instruction operation error of speech emotion recognition system is less than 0.06,and the output error is less than 0.0004,which meets the needs of speech emotion recognition.

关 键 词:MFCC 语音情感识别 神经网络加速器 FPGA 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象