检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:郭明琦 GUO Mingqi(Yellow River Conservancy Technical Institute,Kaifeng,Henan 475000,China)
出 处:《计算机应用文摘》2024年第2期96-99,共4页Chinese Journal of Computer Application
摘 要:人工智能概念的提出,让语音识别迎来了新的生机。随着相关知识与技能的飞速发展,神经网络带动了语音识别领域相关知识的革新。文章使用语音识别中常见的LPCC特征、MFCC特征和PLP特征对同一段语音进行特征提取,通过特征图像化可以直观展示其特征的优劣势。其中,LPCC特征对频谱包络变化较为敏感;MFCC特征具有较好语音信号的短时频谱,对信号的语音干扰和音量变化等抗干扰能力较好,但高频细节不够清晰;PLP特征具有较好的鲁棒性,对信号的语音干扰和音量变化等有很好的抗干扰能力,且对高频部分的细节信息表示更为准确。The introduction of the concept of artificial intelligence has ushered in new vitality for speech recognition.With the rapid development of related knowledge and skills,neural networks have driven the innovation of relevant knowledge in the field of speech recognition.This article uses common LPCC features,MFCC features,and PLP features in speech recognition to extract features from the same segment of speech.Through feature visualization,the advantages and disadvantages of these features can be visually displayed.Among them,LPCC features are more sensitive to changes in spectral envelope.MFCC features have a good short-term spectrum of speech signals,and have good anti-interference ability against speech interference and volume changes,but high-frequency details are not clear enough.PLP features have good robustness and have good anti-interference ability against speech interference and volume changes in signals,and are more accurate in representing detailed information in high-frequency parts.
分 类 号:TN912[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.20.238.29