检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:代伟[1] 刘洪[2] DAI Wei;LIU Hong(College of Artificial Intelligence,Neijiang Normal University,Neijiang 641112,Sichuan;College of Computer Science,Sichuan University,Chengdu 610065,Sichuan)
机构地区:[1]内江师范学院人工智能学院,四川内江641112 [2]四川大学计算机学院,四川成都610065
出 处:《四川师范大学学报(自然科学版)》2022年第1期131-135,共5页Journal of Sichuan Normal University(Natural Science)
基 金:国家自然科学基金(71573184)。
摘 要:研究一种基于神经网络的端到端中文语音识别算法.算法将语音信息处理为频谱图,基于频谱图,设计和实现一种基于卷积神经网络和循环神经网络的深度学习模型结构用于中文语音识别.模型以汉字作为标签样本,运用训练算法和序列损失函数进行模型迭代训练最终模型;采用开源数据集,通过实验验证网络结构对识别效果的影响,同时对比传统的语音识别算法,取得更加优异的识别效果,消耗更少的训练时间.A deep learning based end-to-end Chinese automatic voice recognition model is proposed in this paper. The raw voice signal is firstly converted to spectrogram. Then a convolutional neural network and recurrent neural network combined structure is designed and implemented to translate Chinese audio to texts. The label of our model is the single Chinese character, with the proper loss function and training algorithm applied to train the recognition model iteratively. Taking an open dataset as training samples to test the influence of neural network structure, we also do tests to compare with the traditional methods. The experimental results show that our proposed model obtains more accuracy recognition and consumes less time for the training procedure.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28