基于CTC-GRU模型的长沙方言识别

Changsha Dialect Recognition Based on CTC -GRU Model

作　　者：梁小林沈湘菲梁曌邱海琳 LIANG Xiaolin;SHEN Xiangfei;LIANG Zhao;QIU Hailin(School of Mathematics and Statistics Science,Changsha University of Science and Technology,Changsha 410114,China)

机构地区：[1]长沙理工大学数学与统计学院,湖南长沙410114

出　　处：《吉首大学学报（自然科学版）》2022年第2期45-52,共8页Journal of Jishou University(Natural Sciences Edition)

基　　金：国家自然科学基金面上资助项目(61972055);湖南省教育厅重点项目(17A003,18A145)。

摘　　要：为了识别大词汇量下连续长沙话方言语音,提出了基于CTC算法的门控线性单元神经网络模型.先通过梅尔倒谱系数提取语音的特征参数,再把提取的特征参数输入门控线性单元神经网络,用CTC算法进行训练优化,得到输入序列整个的预测标签.最后在自建的长沙话方言语料库上,以词错率作为评价指标,对CTC模型、GRU模型和CTC-GRU模型进行对比,结果表明CTC-GRU模型相对于其他2个模型收敛速度更快,结果更精准.In order to recognize continuous speech in Changsha dialect with a large vocabulary,a gated linear element neural network model based on Connectionist Temporal Classification(CTC)algorithm is proposed.Firstly,the characteristic parameters of speech are extracted by Mel-scale Frequency Cepstral Coefficients(MFCC),and then the extracted characteristic parameters are input into gated linear unit neural network.CTC algorithm is used for training and optimization,and the whole prediction label of input sequence is obtained.Finally,the results of the CTC model,the GRU model and the CTC-GRU model are compared on the self-built corpus of Changsha dialect,and the Word Error Rate(WER)is taken as the evaluation index.The results show that the CTC-GRU model can achieve faster convergence and greater accuracy compared with the other two models.

关键词：CTC-GRU模型梅尔倒谱系数长沙话方言识别词错率

分类号：O213[理学—概率论与数理统计] TP181[理学—数学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于CTC-GRU模型的长沙方言识别

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于CTC-GRU模型的长沙方言识别

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索