基于深度神经网络的维吾尔语语音识别  被引量:13

Uyghur speech recognition based on deep neural network

在线阅读下载全文

作  者:其米克.巴特西 黄浩[1] 王羡慧[1] 

机构地区:[1]新疆大学信息科学与工程学院,新疆乌鲁木齐830046

出  处:《计算机工程与设计》2015年第8期2239-2244,共6页Computer Engineering and Design

基  金:国家自然科学基金项目(61365005;60965002);新疆大学博士毕业生科研启动基金项目(2014211B009);新疆大学自治区自然科学基金项目(BS120124)

摘  要:目前的语音识别主要采用隐马尔可夫模型去实现,考虑三音子后,模型参数巨增,在训练数据有限的状态下,模型参数得不到很好的训练,影响语音识别率。为提高语音识别率,提出基于深度神经网络的语音识别方法。以kaldi为测试平台,对一个含有4隐层的神经网络进行训练,利用该模型进行维吾尔语语音识别。实验结果表明,相比基本单音子隐马尔科夫模型和考虑三音子后的隐马尔科夫模型,深度神经网络模型使维吾尔语语音识别错误率分别降低了31.09%和8.68%,且现存一切模型优化算法在此模型中依然有效。Currently speech recognition is mainly achieved by using hidden Markov models. However, after taking the triphone model into account, the scale of parameters greatly increases, in the circumstances of limited training data, the model parameters are not well trained, thus affecting the speech recognition rate. To improve the speech recognition rate, the method for speech recognition based on deep neural network was proposed. A neural network containing four hidden layers was trained on the kaldi platform, and the model was used to deal with the Uyghur speech recognition. Experimental results show that the error in Uy- ghur speech recognition is reduced by 31.09 % and 8.68 % respectively using the deep the neural network model compared to that using the basic tone sub-HMM and HMM triphone. And all models of existing optimization algorithm are still valid in this model.

关 键 词:语音识别 模型 深度神经网络 三音子 隐马尔可夫 

分 类 号:TP391.42[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象