检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:黄心仪 谢凌云[2] 王鑫[1] HUANG Xinyi;XIE Lingyun;WANG Xin(School of Music and Recording Arts,Communication University of China,Beijing 100024,China;School of Information and Communication Engineering,Communication University of China,Beijing 100024,China)
机构地区:[1]中国传媒大学音乐与录音艺术学院,北京100024 [2]中国传媒大学信息与通信工程学院,北京100024
出 处:《中国传媒大学学报(自然科学版)》2023年第4期62-68,共7页Journal of Communication University of China:Science and Technology
摘 要:随着三维声的应用逐渐广泛,对三维声进行双耳渲染成为了新的技术热点,如何有效地评价三维声双耳渲染算法成为关键问题。本文针对6种三维声双耳渲染算法进行了音质维度的主观评价实验,对实验数据进行方差分析和回归分析。通过对双耳录音的实验素材进行客观特征的提取和筛选,与主观评价结果进行偏最小二乘回归分析,建立了总体音质评价维度的客观评测模型,并探究了主观感知与客观特征之间的关联。主观实验结果表明,进行双耳渲染算法处理会对音质造成损伤,但对音质进行算法补偿,可以在一定程度上弥补渲染算法造成的音质损伤。客观预测模型表明音质与2560~5120Hz和40~320Hz这两个频段的时频特征高度相关,例如谱通量和谱滚降等。低频段的双耳互相关系数和侧向声能比也是影响音质维度的重要特征。As the application of 3D sound becomes increasingly widespread,binaural rendering of 3D sound has emerged as a new technological focus.The effective evaluation of binaural rendering algorithms for 3D sound has become a key issue.In this paper subjective quality assessment experiments on six different binaural rendering algorithms for 3D sound were conducted,followed by variance analysis and regression analysis of the experimental data.Objective features were extracted and selected from binaural recordings,and a partial least squares regression analysis was performed to establish an objective evaluation model for overall sound quality dimensions.The relationship between subjective perception and objective features was also explored.The subjective experimental results indicate that the binaural rendering algorithm processing can have a negative impact on sound quality.However,compensating for sound quality using algorithmic adjustments can partially mitigate the sound quality degradation caused by the rendering algorithm.The objective prediction model reveals that sound quality is highly correlated with time-frequency features in the frequency ranges of 2560-5120Hz and 40-320Hz,such as spectral flux and spectral rolloff.Additionally,the interaural cross-correlation coefficient and lateral sound energy ratio in the low-frequency range are important features influencing sound quality dimensions.
关 键 词:三维声 双耳渲染算法 主观评价 客观评测模型 音质
分 类 号:TN912.2[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.17.74.181