基于多级特征融合的体素三维目标检测网络被引量：2

Voxel-based 3D Object Detection Network Based on Multi-level Feature Fusion

作　　者：张吴冉胡春燕[1] 陈泽来李菲菲 ZHANG Wu-ran;HU Chun-yan;CHEN Ze-lai;LI Fei-fei(School of Optical-electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China;School of Medical Instrument and Food Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China)

机构地区：[1]上海理工大学光电信息与计算机工程学院,上海200093 [2]上海理工大学医疗器械与食品学院,上海200093

出　　处：《包装工程》2022年第15期42-53,共12页Packaging Engineering

基　　金：上海市高校特聘教授(东方学者)岗位计划(ES2015XX)。

摘　　要：目的为精确分析点云场景中待测目标的位置和类别信息,提出一种基于多级特征融合的体素三维目标检测网络。方法以2阶段检测算法Voxel−RCNN作为基线模型,在检测一阶段,增加稀疏特征残差密集融合模块,由浅入深地对逐级特征进行传播和复用,实现三维特征充分的交互融合。在二维主干模块中增加残差轻量化高效通道注意力机制,显式增强通道特征。提出多级特征及多尺度核自适应融合模块,自适应地提取各级特征的关系权重,以加权方式实现特征的强融合。在检测二阶段,设计三重特征融合策略,基于曼哈顿距离搜索算法聚合邻域特征,并嵌入深度融合模块和CTFFM融合模块提升格点特征质量。结果实验于自动驾驶数据集KITTI中进行模拟测试,相较于基线网络,在3种难度等级下,一阶段检测模型的行人3D平均精度提升了3.97%,二阶段检测模型的骑行者3D平均精度提升了3.37%。结论结果证明文中方法能够显著提升目标检测性能,且各模块具有较好的移植性,可灵活嵌入到体素类三维检测模型中,带来相应的效果提升。The work aims to accurately analyze the location and classification information of the object to be tested in the point cloud scene,and propose a voxel-based 3D object detection network based on multi-level feature fusion.The two-stage Voxel-RCNN was used as the baseline network.In the first stage,the Sparse Feature Residual Dense Fusion Module(SFRDFM)was added to propagate and reuse the level-by-level features from shallow to deep,to achieve full in-teractive fusion of 3D features.The Residual Light-weight and Efficient Channel Attention(RL-ECA)mechanism was added to the 2D backbone network to explicitly enhance channel feature representation.A multi-level feature and mul-ti-scale kernel adaptive fusion module was proposed to adaptively extract the weight information of the multi-level fea-tures,to achieve a strong fusion with a weighted manner.In the second stage,a Triple Feature Fusion Strategy(TFFS)was designed to aggregate neighborhood features based on the Manhattan distance search algorithm,and a Deep Fusion Mod-ule(DFM)and a Coarse to Fine Fusion Module(CTFFM)were embedded to improve the quality of grid features.The al-gorithm in this paper was tested in the autonomous driving data set KITTI.Compared with the baseline network at three difficulty levels,the average 3D accuracy of pedestrians in the first stage detection model was improved by 3.97%,and the average 3D accuracy of cyclists in the second stage detection model was improved by 3.37%.The experimental results prove that the proposed method can effectively improve the performance of object detection,each module has superior portability,and can be flexibly embedded into the voxel-based 3D detection model to bring corresponding improvements.

关键词：三维目标检测残差融合自适应融合特征增强三重特征融合

分类号：TP311[自动化与计算机技术—计算机软件与理论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于多级特征融合的体素三维目标检测网络被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于多级特征融合的体素三维目标检测网络 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于多级特征融合的体素三维目标检测网络被引量：2