检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:鲁斌[1,2] 杨振宇[1,2] 孙洋 刘亚伟[1,2] 王明晗 LU Bin;YANG Zhenyu;SUN Yang;LIU Yawei;WANG Minghan(School of Control and Compute Engineering,North China Electric Power University,Baoding 071000 China;Hebei Key Laboratory of Knowledge Computing for Energy&Power,North China Electric Power University,Baoding 071000,China)
机构地区:[1]华北电力大学控制与计算机工程学院,河北保定071000 [2]华北电力大学河北省能源电力知识计算重点实验室,河北保定071000
出 处:《智能系统学报》2024年第4期885-897,共13页CAAI Transactions on Intelligent Systems
基 金:河北省重点研发计划项目(20310103D);河北省在读研究生创新能力培养资助项目(CXZZBS2023153).
摘 要:针对现有单阶段三维目标检测算法对点云下采样特征利用方式单一、特征对长程上下文信息的聚合程度无法满足算法性能提升需求的问题,本文提出了基于多通道交叉注意力融合的单阶段三维目标检测算法。首先,设计通道交叉注意力模块用于融合下采样特征,可基于交叉注意力机制在通道层面上增强多尺度特征对不同感受野下长程空间信息的表达能力;然后,提出级联特征激励模块,结合原始下采样特征对通道交叉注意力加权特征进行级联激励,提升算法对关键空间特征的学习能力。在公共自动驾驶数据集KITTI上进行了大量实验并与主流算法对比,本文算法作为单阶段目标检测算法,在车辆类别3个难度级别上的检测准确率分别为91.34%、79.85%和75.98%,较基线算法分别提升了4.83%、3.26%和3.32%。实验结果证明了本文算法及所提模块在三维目标检测任务上的有效性和先进性。To solve the problems that the existing single-stage 3D object detection algorithm utilizes point cloud downsampling features in a single way and the degree of aggregation of features for the long-range contextual information cannot meet the requirement of enhancing the algorithm performance,we propose a single-stage 3D object detection algorithm based on multi-channel cross attention fusion.First,the channel-wise cross attention module is designed to fuse the down sampled features,which can enhance the expression ability of multi-scale features for the long-range spatial information under different receptive field based on the cross attention mechanism.Then,a cascade feature excitation module is proposed to combine the original downsampling features to cascade channel-wise cross attention weighted features to enhance the algorithm's learning ability for key spatial features.Extensive experiments were conducted on the public autonomous driving dataset KITTI and compared with mainstream algorithms.As a single-stage algorithm,the detection accuracy was 91.34%,79.85%and 75.98%for the three difficulty levels of car categories,which were 4.83%,3.26%and 3.32%better than the baseline algorithm.The experimental results demonstrate the effectiveness and advancement of the algorithm and the proposed modules for 3D object detection task.
关 键 词:三维点云 自动驾驶 激光雷达 深度学习 三维目标检测 柱体素 交叉注意力 单阶段算法
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7