基于多通道交叉注意力融合的三维目标检测算法被引量：1

3D object detection algorithm with multi-channel cross attention fusion

作　　者：鲁斌[1,2] 杨振宇[1,2] 孙洋刘亚伟[1,2] 王明晗 LU Bin;YANG Zhenyu;SUN Yang;LIU Yawei;WANG Minghan(School of Control and Compute Engineering,North China Electric Power University,Baoding 071000 China;Hebei Key Laboratory of Knowledge Computing for Energy&Power,North China Electric Power University,Baoding 071000,China)

机构地区：[1]华北电力大学控制与计算机工程学院,河北保定071000 [2]华北电力大学河北省能源电力知识计算重点实验室,河北保定071000

出　　处：《智能系统学报》2024年第4期885-897,共13页CAAI Transactions on Intelligent Systems

基　　金：河北省重点研发计划项目(20310103D);河北省在读研究生创新能力培养资助项目(CXZZBS2023153).

摘　　要：针对现有单阶段三维目标检测算法对点云下采样特征利用方式单一、特征对长程上下文信息的聚合程度无法满足算法性能提升需求的问题,本文提出了基于多通道交叉注意力融合的单阶段三维目标检测算法。首先,设计通道交叉注意力模块用于融合下采样特征,可基于交叉注意力机制在通道层面上增强多尺度特征对不同感受野下长程空间信息的表达能力;然后,提出级联特征激励模块,结合原始下采样特征对通道交叉注意力加权特征进行级联激励,提升算法对关键空间特征的学习能力。在公共自动驾驶数据集KITTI上进行了大量实验并与主流算法对比,本文算法作为单阶段目标检测算法,在车辆类别3个难度级别上的检测准确率分别为91.34%、79.85%和75.98%,较基线算法分别提升了4.83%、3.26%和3.32%。实验结果证明了本文算法及所提模块在三维目标检测任务上的有效性和先进性。To solve the problems that the existing single-stage 3D object detection algorithm utilizes point cloud downsampling features in a single way and the degree of aggregation of features for the long-range contextual information cannot meet the requirement of enhancing the algorithm performance,we propose a single-stage 3D object detection algorithm based on multi-channel cross attention fusion.First,the channel-wise cross attention module is designed to fuse the down sampled features,which can enhance the expression ability of multi-scale features for the long-range spatial information under different receptive field based on the cross attention mechanism.Then,a cascade feature excitation module is proposed to combine the original downsampling features to cascade channel-wise cross attention weighted features to enhance the algorithm's learning ability for key spatial features.Extensive experiments were conducted on the public autonomous driving dataset KITTI and compared with mainstream algorithms.As a single-stage algorithm,the detection accuracy was 91.34%,79.85%and 75.98%for the three difficulty levels of car categories,which were 4.83%,3.26%and 3.32%better than the baseline algorithm.The experimental results demonstrate the effectiveness and advancement of the algorithm and the proposed modules for 3D object detection task.

关键词：三维点云自动驾驶激光雷达深度学习三维目标检测柱体素交叉注意力单阶段算法

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于多通道交叉注意力融合的三维目标检测算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于多通道交叉注意力融合的三维目标检测算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于多通道交叉注意力融合的三维目标检测算法被引量：1