基于谱峰值点特征的汉语音节匹配算法  

Syllable Matching Algorithm with Spectral Peak Point Feature for Chinese Speech

在线阅读下载全文

作  者:唐维康 邵玉斌[1] 龙华[1] 杜庆治[1] 彭艺[1] 陈亮 Tang Weikang;Shao Yubin;Long Hua;Du Qingzhi;Peng Yi;Chen Liang(School of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China)

机构地区:[1]昆明理工大学信息工程与自动化学院,云南昆明650500

出  处:《激光与光电子学进展》2022年第7期121-129,共9页Laser & Optoelectronics Progress

基  金:国家自然科学基金(61761025)。

摘  要:为提升噪声环境中汉语语音音节的匹配效果,依据汉语语音的谱峰值点特征,提出了一种音节匹配算法。采用离散余弦变换提取语音信号包络语谱图,利用人耳掩蔽效应进行谱能量判决,获取每一帧谱能量的极大值点;接着在对数频率范围内作二值量化,将音节信号对应为二进制序列;然后根据二进制序列的模板对比,确定音节匹配结果。本算法对无噪汉语语音的音节匹配效果优于传统方法,且在低信噪比情况下仍具有较高的匹配准确率。Based on the spectral peak point characteristics of Chinese speech,this study proposes a syllable matching algorithm to improve the matching effect of Chinese speech syllables in noisy environments.First,a discrete cosine transform is used to extract the speech signal envelope spectrogram,and the human ear masking effect is used for spectral energy judgment to obtain the extreme value points of spectral energy in each frame.Then,the syllable signal is corresponded to a binary sequence by performing binary quantization in the logarithmic frequency range.Finally,the syllable matching result is determined based on the template comparison of the binary sequence.The results show that the proposed algorithm outperforms the conventional methods for matching syllables in the noiseless Chinese speech.Additionally,it has a high matching accuracy at low signal-to-noise ratios.

关 键 词:信号处理 音节匹配 极值点 对数频率域 

分 类 号:TN912.34[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象