模糊矢量量化在语音情感识别中的应用  

Improved Fuzzy VQ Algorithm in Speech Emotion Recognition

在线阅读下载全文

作  者:狄金海[1] 赵艳[2] 赵力[2] 

机构地区:[1]浙江工贸职业技术学院,浙江温州325003 [2]东南大学信息科学与工程学院,江苏南京210096

出  处:《电声技术》2008年第10期49-51,55,共4页Audio Engineering

基  金:国家自然基金(60472058);教育部博士点基金(20050286001);教育部"新世纪优秀人才支持计划"

摘  要:提出了一种将改进的模糊C均值聚类算法与矢量量化相结合的语音情感识别方法,实现了对4种情感的识别:高兴、生气、悲伤和惊奇。首先提取情感语句全局结构和时序结构特征参数并进行性别规整,再利用改进后的模糊矢量量化方法来设计码本,最后对待识别语音进行辩识。该算法不但解决了模糊C均值算法对初始值敏感、易陷入局部最优的问题,而且性别规整改善了特征参数的有效性,使识别率得以进一步提高。实验结果表明该算法能够有效改善识别率。A method which combines improved fuzzy c-mean clustering and VQ(Veetor Quantization) is proposed. Four emotions, namely happiness, angry, sadness and surprise, are recognized. Firstly, globe and time sequence features are extracted from speech signals, and modified according to the gender difference. Then code book is designed by improved fuzzy VQ. Finally the emotion of the speech is recognized. The problem of sensitive to initial condition is settled, and the local optimization is also avoided. In addition, the features validity is improved by gender modification. The result shows the better recognition rate.

关 键 词:模糊C均值 矢量量化 语音情感识别 性别规整 

分 类 号:TN912[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象