机构地区:[1]特种光纤与光接入网省部共建重点实验室,新型显示技术及应用集成教育部重点实验室,上海大学通信与信息工程学院,上海200444
出 处:《信号处理》2025年第4期759-769,共11页Journal of Signal Processing
基 金:国家自然科学基金(62071287,62020106011,62371279,62371278);上海市科学技术委员会(20DZ2290100)。
摘 要:无参考全景图像质量评价旨在客观衡量全景图像的人类视觉感知质量,而无需依赖原始图像的质量信息。随着虚拟现实技术的迅猛发展,全景图像质量评价的重要性日益凸显。然而,现有全景图像质量评价算法仍存在着一些限制,如不能很好模拟观察者的浏览过程、未能有效考虑观看者的立体感知过程等。这严重影响了全景图像质量评价的准确性。为解决这一问题,本文提出一种基于沉浸式立体感知和视口感知交互的无参考全景图像质量评价算法。首先,设计一种视口提取策略,通过在球形域上提取特征视点,选择具有较高被观察概率的视点。对选定的视点提取相应的视口内容,并将多个视口内容并行输入特征编码器,以实现多尺度视口特征的提取。随后,鉴于当前实现多个视口间信息交互的方式尚存在局限性,本文提出一个视口特征交互模块,旨在实现对输入的多个视口内容进行跨视口的信息交互。最后,本文还探索了在缺乏视口采样的情况下,利用整个全景图像实现对立体感信息的获取,以实现对立体感过程建模从而提高整体评价性能。实验结果证明了本文提出算法的有效性,与当前最先进的质量评价算法相比之下,斯皮尔曼等级相关系数(Spearman Rank Order Correlation Coefficient,SROCC)指标和皮尔逊线性相关系数(Linear Pearson Correlation Coefficient,PLCC)在公开数据集CVIQD上分别达到0.72%和0.70%的提升,而在数据集OIQA上分别达到了1.10%和0.54%的提升。Blind omnidirectional image quality assessment(BOIQA)aims to objectively assess the human-perceived quality of omnidirectional images without relying on original image quality information.With the continuous evolution of virtual reality(VR)technology,the importance of BOIQA is increasingly pronounced.However,extant algorithms for omnidirectional image quality assessment exhibit certain constraints,including inadequacies in accurately simulating the browsing behavior of a viewer and deficiencies in effectively incorporating the stereoscopic perception processes of the viewer.These limitations impede the precision of omnidirectional image quality evaluation algorithms.To solve this problem,this paper proposes an algorithm for omnidirectional image quality assessment based on viewport perception and immersive stereoscopic perception interaction.In particular,the proposed methodology integrates the SPHORB algorithm,which enables the formulation of a systematic approach for the extraction of viewports.Leveraging this algorithm,a plethora of significant viewpoints can be meticulously extracted from the spherical domain.Subsequent to the acquisition of multiple feature-rich viewpoints,a meticulous selection procedure is executed to discern the ultimate 20 crucial viewpoints.These meticulously chosen viewpoints function as the pivotal coordinates for viewport sampling,encapsulating regions with the highest propensity for human visual focus and attention.Following the completion of viewport content filtering and sampling,the extracted contents are concurrently fed into the feature encoder to facilitate the extraction of viewport-specific features.Acknowledging the significance of both shallow and deep features in quality score prediction,this study endeavors to extract multi-scale features for each viewport,thereby augmenting the perceptual feature space.However,current methodologies for facilitating information exchange among multiple viewport contents exhibit certain limitations.To address this,we introduce a viewport f
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...