基于SGMM和DNN结合提高音素识别率的研究  被引量:1

Research on Improving Phoneme Recognition Rate Based on Subspace Gaussian Mixture Model and Deep Neural Network Combination

在线阅读下载全文

作  者:贾兵兵 曹辉[1] 秦驰杰 JIA Bingbing;CAO Hui;QIN Chijie(School of Physics and Information Technology,Shaanxi Normal University,Xi’an 710119,China)

机构地区:[1]陕西师范大学物理学与信息技术学院

出  处:《计算机工程与应用》2019年第24期117-121,127,共6页Computer Engineering and Applications

基  金:国家自然科学基金(No.1202020368,No.11074159,No.11374199)

摘  要:为降低声学特征在语音识别系统中的音素识别错误率,提高系统性能,提出一种子空间高斯混合模型和深度神经网络结合提取特征的方法,分析了子空间高斯混合模型的参数规模并在减少计算复杂度后将其与深度神经网络串联进一步提高音素识别率。把经过非线性特征变换的语音数据输入模型,找到深度神经网络结构的最佳配置,建立学习与训练更可靠的网络模型进行特征提取,通过比较音素识别错误率来判断系统性能。实验仿真结果证明,基于该系统提取的特征明显优于传统声学模型。In order to reduce the phoneme recognition error rate of acoustic features in speech recognition system and improve system performance,a Subspace Gaussian Mixture Model(SGMM)and Deep Neural Network(DNN)combined with extraction features are proposed.The parameter size of SGMM is analyzed and the computational complexity is reduced.After the degree is connected with DNN,the phoneme recognition rate is further improved.The speech data transformed by nonlinear feature is input into the model to find the optimal configuration of the deep neural network structure,and a more reliable network model for learning and training is established for feature extraction.The phoneme recognition error rate is compared to judge the system performance.Experimental simulation results show that the features extracted based on the system are significantly better than the traditional acoustic model.

关 键 词:声学特征 音素识别 子空间高斯混合模型 深度神经网络 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象