改进粒子滤波跟踪的视听双模态语音识别仿真  

Simulation of Audiovisual Bimodal Speech Recognition Based on Improved Particle Filter Tracking

在线阅读下载全文

作  者:岳莉[1] 李柯景[1] 赵剑[1] YUE Li;LI Ke-jing;ZHAO Jian(College of Computer Science and Technology,Changchun University,Changchun Jilin 130022,China)

机构地区:[1]长春大学计算机科学技术学院,吉林长春130022

出  处:《计算机仿真》2024年第9期213-216,345,共5页Computer Simulation

基  金:吉林省教育厅科研项目(JJKH20220600KJ)。

摘  要:噪声环境下视听语音不易被识别,为提升语音识别效果,提出改进粒子滤波跟踪的视听双模态语音识别方法。采用谱减法去除噪声数据,完成视听双模态语音的消噪处理;根据人语和唇动信息之间的相关性,采用改进粒子滤波跟踪方法提取视听双模态语音特征信息,构建transformer语音识别模型,将提取的特征信息输入到模型内实施并行训练,实现视听双模态语音的有效识别。实验结果表明,通过对上述方法开展信噪比测试、识别性能测试,验证了上述方法的可行性高、可靠性强。In noisy environments,audio-visual speech is not easily recognized.To improve speech recognition performance,an improved particle filter tracking audio-visual bimodal speech recognition method is proposed.Firstly,spectral subtraction was adopted to remove noise data,thus completing the noising removal of audiovisual dual-modal speech.Based on the correlation between human speech and lip movement information,an improved particle filter tracking method was adopted to extract audiovisual dual-modal speech feature information,and then a transformer speech recognition model was constructed.Finally,the extracted information was input into the model for parallel training,thus achieving the effective recognition for audiovisual dual-modal speech.The experimental results show that the proposed method show high feasibility and strong reliability after the signal-to-noise ratio test and recognition performance test.

关 键 词:语音识别模型 谱减法 去噪处理 识别训练 

分 类 号:TP399[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象