检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《西南交通大学学报》2013年第4期756-760,共5页Journal of Southwest Jiaotong University
基 金:国家自然科学基金资助项目(61001089);重庆市自然科学基金资助项目(2010BB2049)
摘 要:讨论了基于MFCC(Mel-frequency cepstral coefficients)特征参数的语音质量客观评价方法 Mel-CD(Mel-cepstral distance measure).根据心理声学原理将Johannesma提出的人耳听觉模型和非线性压缩变换引入MFCC特征参数的提取过程,用Gammatone滤波器组对人耳基底膜进行仿真.利用改进后的MFCC作为语音信号特征参数,提出了一种更加符合人耳听觉感知特性的客观评价方法——Mel-GD(Mel-cepstral gammatone filter bankdistance measure).性能测试结果表明:所提算法与Mel-CD算法在时间复杂度上保持一致,评价结果的主观与客观的相关度提高了4.9%,平均估计偏差改善了45.5%.Based on Mel-frequency cepstral coefficients (MFCC), Mel-cepstral distance measure (Mel-CD) algorithm used for the objective evaluation of speech quality was analyzed. According to the theory of psychoacoustics, a human auditory model proposed by Johannesma and nonlinear compression were applied to extracting MFCC. Gammatone filter bank was used to simulate the basilar membrane. Mel-cepstral gammatone filter bank distance measure (Mel-GD) based on the improved MFCC was proposed, which was more in accordance with the auditory perceptual properties. Performance testing results showed that the proposed algorithm compared favorably with the Mel-CD in time complexity, the correlation degree between objective evaluation and subjective evaluation was improved by 4.9% , and estimation bias was decreased by 45.5%.
关 键 词:语音质量 MFCC Gammatone滤波器组 非线性变换
分 类 号:TN912[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30