融合功能性副语言比例系数的语音情感识别  

Speech Emotion Recognition Fusing Functional Paralanguage Proportion Coefficient

在线阅读下载全文

作  者:孙颖 周雅茹 张雪英 SUN Ying;ZHOU Ya-ru;ZHANG Xue-ying(College of Information and Computer,Taiyuan University of Technology,Taiyuan 030024,China)

机构地区:[1]太原理工大学信息与计算机学院,山西太原030024

出  处:《东北大学学报(自然科学版)》2024年第1期40-48,共9页Journal of Northeastern University(Natural Science)

基  金:国家自然科学基金资助项目(62271342);山西省自然科学基金资助项目(201901D111096).

摘  要:语言中的非言语发声如笑声、叹息、抽泣等,称为功能性副语言,对情感表达起重要作用,但现有研究很少考虑多种功能性副语言在一种情感中的协同作用.针对该问题,提出了融合功能性副语言比例系数(functional paralanguage proportion coefficient,FPPC)的情感识别系统.首先,提取能体现多种功能性副语言在情感语句中出现的频率快慢和持续时间长短的FPPC特征;然后,搭建基于注意力机制的集成学习(attention stacking)为不同的基分类器赋予不同权重,并对FPPC特征进行训练;最后,通过自适应熵权重决策融合方法将传统语音情感识别与基于FPPC特征情感识别进行融合.实验结果显示,融合了FPPC特征后的情感识别结果提高了16.84%,证明融合FPPC特征能有效提高系统整体识别率.Nonverbal vocalizations such as laughter,sighs,and sobs in speech are called functional paralanguage and play an important role in emotional expression.However,existing research has rarely considered the synergistic effect of multiple functional paralanguages in a single emotion.To address this issue,an emotion recognition system integrating functional paralanguage proportion coefficients(FPPC)is proposed.Firstly,FPPC features that reflect the frequency and duration of multiple functional paralanguages appearing in emotional statements are extracted.Then,an attention mechanism-based ensemble learning is constructed to assign different weights to different base classifiers and train the FPPC features.Finally,the adaptive entropy weight decision fusion method is used to fuse traditional speech emotion recognition with emotion recognition based on FPPC features.Experimental results show a 16.84%improvement in emotion recognition after integrating FPPC features,proving that integrating FPPC features can effectively improve the overall recognition rate of the system.

关 键 词:语音情感识别 比例系数 功能性副语言 注意力机制 自适应熵权重决策融合 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象