端到端维吾尔语语音识别研究被引量：2

Research on End-to-end Speech Recognition System for Uyghur

作　　者：丁枫林郭武[1] 孙健[1] DING Feng-lin;GUO Wu;SUN Jian(University of Science and Technology of China,National Engineering Laboratory for Speech and Language Information Processing,Hefei 230027,China)

机构地区：[1]中国科学技术大学语音及语言信息处理国家工程实验室

出　　处：《小型微型计算机系统》2020年第1期19-23,共5页Journal of Chinese Computer Systems

基　　金：科技部国家重点研发计划16年项目(YF2100060003)资助

摘　　要：近几年来,基于端到端模型的语音识别系统因其相较于传统混合模型的结构简洁性和易于训练性而得到广泛的应用,并在汉语和英语等大语种上取得了显著的效果.本文将自注意力机制和链接时序分类损失代价函数相结合,将这种端到端模型应用到维吾尔语语音识别上.考虑到维吾尔语属于典型的黏着语,其丰富的构词形式使得维吾尔语的词汇量异常庞大,本文引入字节对编码算法进行建模单元的生成,从而获得合适的端到端建模输出单元.在King-ASR450维吾尔语数据集上,提出的算法明显优于基于隐马尔可夫模型的经典混合系统和基于双向长短时记忆网络的端到端模型,最终识别词准确率为91.35%.Compared with the conventional hybrid models,the end-to-end frameworks have recently been widely used in the automatic speech recognition(ASR)fields for their simple structure and ease of training,and have achieved remarkable results in large languages such as Chinese and English.In this paper,the end-to-end model which integrates self-attention mechanism and Connectionist Temporal Classification(CTC)loss function is applied to Uyghur speech recognition.Uyghur is a typical adhesive language with extremely large vocabulary.This paper introduces Byte Pair Encoding(BPE)to generate modeling units for CTC output layer.Experiments are carried out on King-ASR450 Uyghur corpus,the proposed methods can achieve better performance than the conventional hybrid system based on Hidden Markov Model and the end-to-end model based on Bi-directional long-short memory network,and we can final obtain a 91.35%word accuracy in this corpus.

关键词：语音识别维吾尔语端到端自注意力字节对编码链接时序分类

分类号：TP183[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

端到端维吾尔语语音识别研究被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

端到端维吾尔语语音识别研究 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

端到端维吾尔语语音识别研究被引量：2