基于BEV视角的多传感融合3D目标检测  

3D Object Detection for Multi-sensor Fusion Based on BEV Perspective

在线阅读下载全文

作  者:张津 朱冯慧 王秀丽 朱威[1,2] ZHANG Jin;ZHU Fenghui;WANG Xiuli;ZHU Wei(College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China;Joint Key Laboratory of Embedded Systems of Zhejiang Province,Hangzhou 310023,China)

机构地区:[1]浙江工业大学信息工程学院,杭州310023 [2]浙江省嵌入式系统联合重点实验室,杭州310023

出  处:《计算机测量与控制》2024年第10期77-85,共9页Computer Measurement &Control

基  金:国家自然科学基金青年项目(62303414);浙江省自然科学基金探索青年项目(LQ23F030016)。

摘  要:3D目标检测是自动驾驶在道路环境感知任务中的重要环节,现有主流框架通过搭载多种感知设备获取多模态的数据信息来实现多传感融合检测;传统相机与激光雷达的传感器融合过程中存在几何失真,以及信息优先级不对等,导致传感融合的3D目标检测性能不足;对此,提出了一种基于鸟瞰视角(BEV)的多传感融合3D目标检测算法;利用提升-展开-投射(LSS)方式,获取图像的潜在深度分布建立图像在BEV空间下的特征;采用PV-RCNN的集合抽象法建立点云在BEV空间下的特征;该算法在统一的BEV共享空间中设计了低复杂度的特征编码网络融合多模态特征实现3D目标检测;实验结果表明,所提出的算法在检测精度上相较于纯激光方法提升4.8%,相较于传统的融合方案减少了47%的参数,并保持了相近的精度,较好地满足了自动驾驶系统道路环境感知任务的检测要求。3D object detection is an important part in the road environment perception of autonomous driving.Existing mainstream framework is to obtain multi-modal data by using multiple sensing devices,which achieves multi-sensor fusion.There are the shortages of geometric distortions and unequal information priorities in the fusion process of traditional cameras and LiDARs,resulting in insufficient 3D object detection performance of sensor fusion.To address this issue,a multi-sensor fusion 3D object detection algorithm based on bird s-eye view(BEV)is proposed.The lift-splat-shot(LSS)method is used to obtain the potential depth distribution of the image,and establish the feature map of the image in the BEV space.The set abstraction method of point-voxel region convolutional neural networks(PV-RCNN)is used to establish the feature map of the point cloud in the BEV space.A low-complexity feature encoding network is designed for fusing multi-modal features in a unified BEV space in the proposed method to achieve 3D object detection.Experimental results show that the proposed method improves the accuracy by 4.8%compared to the LiDAR method,reduces the parameters by 47%compared to the traditional fusion methods,and maintains similar accuracy.The proposed method meets the detection requirements of the road environment perception of autonomous driving system.

关 键 词:3D目标检测 鸟瞰图视角 多传感融合 自动驾驶 道路环境感知 

分 类 号:TP212.9[自动化与计算机技术—检测技术与自动化装置] TP242[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象