检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:任国凤[1,2] 张雪英 李东[1] 闫建政 REN Guofeng;ZHANG Xueying;LI Dong;YAN Jianzheng(Sehool of Information Engineering,Taiyuan University of Teehnology,Taiyuan 030600,China;Department of Eleetronies,Xinzhou Teaehers University,Xinzhou 034000,China)
机构地区:[1]太原理工大学信息工程学院,山西太原030600 [2]忻州师范学院电子系,山西忻州034000
出 处:《现代电子技术》2018年第14期182-186,共5页Modern Electronics Technique
基 金:国家自然科学基金(61371193);山西省研究生创新基金(2015BY24);山西省教育改革创新项目(J2016097)~~
摘 要:针对包含发音动作参数和情感语音的双模态汉语普通话数据库非常匮乏的问题,设计包含中性、高兴、愤怒及悲伤4种情感的普通话语音库。该语音库由10名被试录制的1 440段音频及发音动作数据组成,文本长度有双音节词和句子两种类型。为了确保该数据库的有效性,邀请普通话较好、听力正常的10名评价者组成评价小组,对数据库内所有音频文件进行评价。根据评价小组评价结果结合发音动作数据的稳定性进行筛选,得到语音质量较好、发音动作参数稳定的双模态情感语音数据库。该数据库可用于开展情感语音的发音动作研究,进而单独或联合作为情感语音识别算法的样本数据,对情感语音识别率的提高具有积极的作用。In allusion to the problem of lack of bi-modal Chinese Mandarin database containing pronunciation action parameters and emotional speech,a Mandarin speech corpus that includes four emotions of neutrality,happiness,anger and sadness is designed. The speech corpus is composed of 1 440 segments of audio and pronunciation action data recorded by 10 subjects,and the textual length includes bi-syllable word and sentence. To ensure the validity of the database,10 evaluators with good Mandarin and normal hearing are invited to constitute the evaluation group,so as to evaluate all the audio files in the database.According to the evaluation results of the evaluation group and in combination with the stability of pronunciation action data,the audio files are screened to obtain the bi-modal emotion speech database with good audio quality and stable pronunciation action parameters. The database can be used to conduct the pronunciation action research of emotional speech,and solely or jointly taken as the sample data of the emotional speech recognition algorithm,which has a positive function for improvement of the emotional speech recognition rate.
关 键 词:数据库 情感语音 发音动作参数 汉语普通话 信号处理 普通话语音库
分 类 号:TN912.34[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.148.247.50