基于改进的长短期神经网络的贵州方言辨识系统的设计与实现  被引量:3

Design and Build of Identification System of Guizhou Dialect Based on Improved Long Short-Term Memory

在线阅读下载全文

作  者:艾虎 李菲 AI Hu;LI Fei(Department of Criminal Technology,Guizhou Police College,Guiyang 550005,China;Faculty of Humanities,The Education University of Hongkong,Hongkong 999077,China)

机构地区:[1]贵州警察学院刑事技术系,贵阳550005 [2]香港教育大学人文学院,香港999077

出  处:《科学技术与工程》2019年第5期203-210,共8页Science Technology and Engineering

基  金:贵州省科技计划项目(黔科合【2016】支撑2847)资助

摘  要:汉语方言的辨识能为案件侦破提供重要的线索,为了对贵州方言进行辨识,设计并实现了贵州方言辨识系统;该系统采用Client/Server与Browser/Server相结合的架构,其用户端采用Matlab实现并具有改进的长短期记忆神经网络算法,主要用于方言的辨识和方言语音样本的采集。方言样本采集于贵州省6个地区,首先提取语音样本与口头禅的梅尔频率倒谱系数MFCC;然后每份语音样本MFCC后面加上相应地区的口头禅MFCC;最后通过奇异值分解得到该系统的输入数据。该系统的网站主要用于训练数据的储存与修改,采用ASP. NET技术并利用C#、Java Script和T-SQL等编程语言实现。实验结果证明贵州方言辨识系统是高效的,让用户获得极大的方便和客观统一的方言辨识结果。Chinese dialect identification may provide important clues for forensic investigation. An identification system of Guizhou dialect was construced, which combines Client/Server and Browser/Server architecture. The user side of the system is mainly used for identification of Guizhou dialect based on improved long short-term memory (LSTM) and dialect voice samples collection, which is achieved by Matlab. Firstly, the authors extracted Mel frequency cepstral coefficients (MFCC) from speech samples and regional pet phrase collected from six regions in Guizhou province, then added corresponding regional pet phrase after each voice samples. Finally, the singular value was decomposed from voice samples as the input data of the identification system of Guizhou dialect. The website of the system is mainly used for storage and modification of training data, which is achieved by programming languages of C#, JavaScript and T-SQL based on ASP.NET. The experimental results show that the identification system of Guizhou dialect is efficient for identification of Guizhou dialect, which enables users to achieve great convenience and accurate, objective and unified identification results of Guizhou dialect.

关 键 词:汉语方言辨识系统 梅尔频率倒谱系数 地区口头禅 奇异值分解 长短期记忆神经网络 ASP.NET C# Matlab 

分 类 号:TP391.42[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象