结合局部自注意力和深度优化的多视图重建

Multi-view Reconstruction with Local Self-attention and Deep Optimization

作　　者：叶森辉王蕾[1] YE Senhui;WANG Lei(School of Information Engineering,East China University of Technology,Nanchang 330013,China)

机构地区：[1]东华理工大学信息工程学院,江西南昌330013

出　　处：《计算机与现代化》2024年第5期92-98,共7页Computer and Modernization

基　　金：国家自然科学基金资助项目(42001411);江西省核地学数据科学与系统工程技术研究中心基金资助项目(JELRGBDT202202);江西省放射性地学大数据技术工程实验室开放基金资助项目(JELRGBDT202103)。

摘　　要：针对多视图三维重建中存在的内存和时间消耗过大、高分辨率重建完整性差等问题,提出一种基于深度学习的多视图重建网络。网络由特征提取模块、级联的Patchmatch模块和深度图优化模块组成。首先,设计U型的特征提取模块,提取多阶段特征图,并在每个阶段引入相对位置编码的局部自注意力层,捕捉图像中的局部细节和全局上下文,提升网络特征提取性能。其次,设计深度残差网络,通过密集连接和残差结构对特征进行融合,充分利用彩色图像先验知识来约束深度图,提升深度估计的准确性。在公开数据集DTU(Technical University of Denmark)上进行测试,实验结果表明,三维重建质量到了有效的提升,与PatchmatchNet相比在完整性上提升了6.1%,在整体性上提升了2.5%,与其他的SOTA(State-Of-The-Art)方法相比,在完整性和整体性上都得到了较大提升。To address the issues of high memory and time consumption,low completeness and fidelity of high-resolution reconstruction in multi-view 3D reconstruction,we propose a deep learning-based multi-view reconstruction network.The network consists of a feature extraction module,a cascaded Patchmatch module and a depth map optimization module.First,we design a U-shaped feature extraction module to extract multi-stage feature maps,and introduce local self-attention layers with relative position encoding at each stage,which capture the local details and global context in the images,and enhance the feature extraction performance of the network.Second,we design a deep residual network to fuse the features,and fully utilize the color image prior knowledge to constrain the depth map,and improve the accuracy of depth estimation.We test our network on the public dataset DTU(Technical University of Denmark),and the experimental results show that our network achieves significant improvement in 3D reconstruction quality.Compared with PatchmatchNet,our network improves the completeness by 6.1%and the overall by 2.5%.Compared with other SOTA(State-Of-The-Art)methods,our network also achieves better performance in both completeness and overall.

关键词：深度学习三维重建局部自注意力多视图立体深度估计

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

结合局部自注意力和深度优化的多视图重建

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

结合局部自注意力和深度优化的多视图重建

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索