基于模糊分类器及多层前馈神经网络混合结构的说话人辨认被引量：4

Hybrid architecture based on fuzzy classifier and multiplayer feed-forward neural network for speaker identification

出　　处：《通信学报》2005年第11期68-75,共8页Journal on Communications

基　　金：江苏省"青蓝工程"跨世纪学术带头人专项基金资助项目(QL003YZ)南京邮电大学科研发展基金资助项目(2001院17)

摘　　要：提出了基于模糊超椭球聚类算法的说话人辨认新方法。该算法首先将某一类的训练数据分成若干子类, 对每一子类在其中心周围定义具有超椭球区域的模糊规则。实验表明,该系统可以较快的聚类速度取得与HMM 相当的识别效果。进一步的研究表明,基于模糊超椭球聚类算法的说话人辨认系统与传统的基于HMM的识别方法存在一个共同的缺点,即抗噪性能较差。为此,通过引入多层前馈神经网络(MLFNN)与模糊超椭球分类器构成混合模型,使系统的识别性能和抗噪能力显著提高。A novel method for speaker identification was proposed which was based on a fuzzy classifier with hyperellipsoidal regions. First, the training data for each class were divided into several clusters. Then, for each cluster, a fuzzy rule with a hyperellipsoidal region was defined around a cluster center. The evaluation experiments had been conducted to compare the fuzzy hyperellipsoidal classifier with the HMM. It was found that the former classifier can achieve a comparable speaker identification performance to the latter with higher clustering speed. Further research showed that both fuzzy hyperellipsoidal classifier and the HMM worsened the recognition ability when the test data contained noise. To overcome this problem, a hybrid architecture based on fuzzy classifier and multilayer feed-forward neural network （MLFNN） was developed for speaker recognition. The experimental results showed that the new method can achieve a much better identification performance and robustness to the additive rioise.

关键词：说话人辨认模糊超椭球分类器多层前馈神经网络

分类号：TN912.3[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于模糊分类器及多层前馈神经网络混合结构的说话人辨认被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于模糊分类器及多层前馈神经网络混合结构的说话人辨认 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于模糊分类器及多层前馈神经网络混合结构的说话人辨认被引量：4