一种适用于说话人识别的改进Mel滤波器  被引量:8

An Improved Mel-frequency Filter for Speaker Recognition

在线阅读下载全文

作  者:项要杰[1,2] 杨俊安[1,2] 李晋徽[1,2] 陆俊[1,2] 

机构地区:[1]电子工程学院信息系,合肥230037 [2]安徽省电子制约技术重点实验室,合肥230037

出  处:《计算机工程》2013年第11期214-217,222,共5页Computer Engineering

基  金:国家自然科学基金资助项目(60872113)

摘  要:Mel倒谱系数(MFCC)侧重提取语音信号的低频信息,对语音信号的频谱分布特性描述不充分,不能有效区分说话人个性信息。为此,通过分析语音信号各频段所含说话人个性信息的不同,结合Mel滤波器和反Mel滤波器在高低频段的不同特性,提出一种适于说话人识别的改进Mel滤波器。实验结果表明,改进Mel滤波器提取的新特征能够获得比传统Mel倒谱系数以及反Mel倒谱系数(IMFCC)更好的识别效果,并且基本不增加说话人识别系统训练和识别的时间开销。Mel-frequency Cepstral Coefficient(MFCC) focuses on extracting information in the lower frequency of speech signal, and fails to describe the distribution of a speech spectrum sufficiently, so it cannot effectively distinguish speaker's specific information. By analyzing the distribution of speaker specific information in different frequency bands of the speech signal, different characters of mel-filterbank and inverted rnel-filterbank are combined in high and low frequency bands, and an improved filterbank is presented, which is more suitable for speaker recognition. Experimental results show that features are extracted using the improved filterbank achieve better recognition rates compared with the traditional MFCC and Inverted MFCC, and without increasing the computing time obviously.

关 键 词:说话人识别 MEL倒谱系数 个性信息 反Mel倒谱系数 频谱分布 语音信号 

分 类 号:TN912.34[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象