基于神经辐射场的稀疏视角三维重建方法

Sparse perspective 3D reconstruction method based on neural radiation field

作　　者：张超[1] 袁亮[1,2] 肖文东[1] 冉腾吕凯 Zhang Chao;Yuan Liang;Xiao Wendong;Ran Teng;Lyu Kai(College of Intelligent Manufacturing Modern Industry,Xinjiang University,Urumqi 830017,China;School of Cultural and Creative Industries,Shanghai Jiao Tong University,Shanghai 200240,China)

机构地区：[1]新疆大学智能制造现代产业学院,乌鲁木齐830017 [2]上海交通大学文化创意产业学院,上海200240

出　　处：《电子测量技术》2024年第20期159-166,共8页Electronic Measurement Technology

基　　金：国家自然科学基金(52275003);新疆维吾尔自治区重大科技专项(2023A03001)资助。

摘　　要：针对神经辐射场在稀疏视角输入条件下渲染结果过于平滑,细节缺失严重等问题,提出一个基于信息关注抑制模块和双阶段损失的网络模型。首先,为解决细节缺失问题,提出一个信息关注抑制模块,该模块在全连接层各层之间采用特征向量归一化模块过滤权重异常值,并以残差网络级联全局信息和局部信息,最后利用通道注意力将将融合后的信息根据重要度进行区分,有效提高了采样点特征向量的准确性。然后,为了解决渲染结果过于平滑导致感知精度低的问题,设计了一种双阶段损失函数,将训练过程划分为两个阶段,粗阶段仅以RGB损失和深度损失指导训练,细阶段在此基础上还引入感知损失和全变分损失,通过渐进优化的方式,充分利用图片的高级特征,提升图像感知能力。本文算法与其他经典方法进行对比,在LLFF数据集上,定量结果表明,整体性能取得最优值,比次优算法性能提升1.9%,在DTU数据集上,定性结果显示,Scan37、Scan55和Scan63等场景重建的完整性和细节水平具有明显优势。In order to address the issue of the neural radiation field rendering results being overly smooth when sparse viewpoint input conditions are present,resulting in a lack of detail,a network model based on an information attention suppression module and a two-stage loss function has been proposed.The first step is to propose an information attention suppression module,which uses a feature vector normalization module to filter outliers in the weights between layers of MLP.It also uses a residual network to cascade global and local information and employs channel attention to differentiate fused information based on its degree of importance.This process improves the accuracy of the sampling points′feature vectors.To address the issue of low perceptual accuracy resulting from overly smooth rendering,a two-stage loss function is proposed.This function partitions the training phase into two stages.In the initial coarse stage,training is guided by RGB and depth loss.Subsequently,in the fine stage,perceptual loss and TV loss are incorporated.This approach enables the utilisation of high-level image features,thereby enhancing the image perception ability via gradual optimization.This paper′s algorithm is compared with other classical methods,and on the LLFF dataset,the quantitative results demonstrate that the overall performance reaches its optimal value,which is 1.9%superior to the performance of the sub-optimal algorithm.Furthermore,on the DTU dataset,the qualitative results indicate that the reconstruction′s completeness and detail level,as observed in Scan37,Scan55,and Scan63,are notably enhanced.

关键词：三维重建稀疏视角神经辐射场注意力机制双阶段损失

分类号：TP391.4[自动化与计算机技术—计算机应用技术] TN-9[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于神经辐射场的稀疏视角三维重建方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于神经辐射场的稀疏视角三维重建方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索