检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]重庆邮电大学计算机学院,重庆400065 [2]西南交通大学计算机学院,成都610031
出 处:《重庆邮电大学学报(自然科学版)》2008年第5期597-602,共6页Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基 金:新世纪优秀人才支持计划;重庆市自然科学基金(CSTC2007BB2445);重庆市计算机网络与通信技术重点实验室开放课题基金“情感识别的关键技术研究”
摘 要:情感语音中携带着丰富的信息,在人机交互领域有着广阔的应用。Mel频率是基于人耳听觉特性提出来的,它与Hz频率成非线性对应关系。Mel频率倒谱系数(MFCC)则是利用它们之间的这种关系,计算得到的Hz频谱特征,MFCC已经广泛地应用在语音识别领域。由于Mel频率与Hz频率之间非线性的对应关系,使得MFCC随着频率的提高,其计算精度随之下降。因此,在应用中常常只使用低频MFCC,而丢弃中高频MFCC。针对该问题进行了研究,修正了Hz-Mel非线性对应关系,提升了中高频系数的计算精度,并将其作为低频MFCC的补充,应用到语音情感识别中。实验证明,改进之后的算法与经典算法比较,在不同的特征组合上识别率都有不同程度的提高,从而证明了Mid MFCC特征计算方法的有效性。Emotion speech carries rich information, which is widely used in the human-computer interaction (HCI). Melfrequency is proposed based on the human auditory characteristics, and it is nonlinearly corresponded with Hz-frequency. Mel-frequency cepstral coefficients (MFCC) is one kind of Hz spectral characteristics; MFCC is calculated based on the nonlinear relationship between Mel-frequency and Hz-frequency and has a wide application in the speech recognition area. But because of such nonlinear relationship, the accuracy of MFCC reduces as the frequency increases. Hence, low MFCCs are usually used and high MFCCs are discarded in applications. This paper analyses this problem and proposes an improved algorithm by amending the nonlinear relationship to improve the accuracy of high MFCCs which are the complementary features to low MFCCs for emotion speech recognition. The experiment result proves that the recognition rate of improved algorithm increases compared to the classical algorithm, and the proposed Mid MFCC is effective.
分 类 号:TP391.42[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.148