具有双层路由注意力的YOLOv8道路场景目标检测方法被引量：21

YOLOv8 with bi-level routing attention for road scene object detection

作　　者：魏陈浩杨睿刘振丙[1] 蓝如师孙希延[2] 罗笑南 WEI Chen-hao;YANG Rui;LIU Zhen-bing;LAN Ru-shi;SUN Xi-yan;LUO Xiao-nan(Guangxi Key Laboratory of Image and Graphic Intelligent Processing(Guilin University of Electronic Technology),Guilin Guangxi 541004,China;National Local Joint Engineering Research Center of Satellite Navigation and Location Service(Guilin University of Electronic Technology),Guilin Guangxi,541004,China)

机构地区：[1]广西图像图形与智能处理重点实验室(桂林电子科技大学),广西桂林541004 [2]卫星导航定位与位置服务国家地方联合工程研究中心(桂林电子科技大学),广西桂林541004

出　　处：《图学学报》2023年第6期1104-1111,共8页Journal of Graphics

摘　　要：随着机动车的数量不断增加,道路交通环境变得更加复杂,尤其是光照变化以及复杂背景都会干扰目标检测算法的准确性和精度,同时道路场景下多变形态的目标也会给检测任务造成干扰。针对这一系列问题,提出了一种YOLOv8n_T方法,在YOLOv8的基础上首先针对骨干网络构建了基于可变形卷积的D_C2f块,强化了特征提取网络对复杂背景下目标的特征学习,更好地适应道路目标复杂多变的情形;其次增加了双层路由注意力模块,以查询自适应的方式去除不相关的区域,留下相关度最高的区域;最后针对道路上行人、交通灯等小目标增加小目标检测层。实验表明,本文提出的YOLOv8n_T有效提高了模型在道路场景下的目标检测精度,在BDD100K数据集上的平均精度比原始YOLOv8n提升了6.8个百分点,比YOLOv5n提升了11.2个百分点。With the continuous increase of motor vehicles,the road traffic environment has become increasingly complex,particularly due to changes in light conditions and complex backgrounds that can interfere with the accuracy and precision of target detection algorithms.Meanwhile,the diverse shapes of targets in road scenes can pose challenges to the detection task.In response to these challenges,a method named YOLOv8n_T was proposed.Building on the YOLOv8 skeleton network,it incorporated a D_C2f block utilizing deformable convolution to enhance feature learning for targets under complex backgrounds,making it more adaptable to the diverse and complex scenarios of road targets.Furthermore,the model incorporated a dual routing attention module to query adaptively and remove irrelevant regions,retaining only the most relevant regions.For small targets such as pedestrians and traffic lights on the road,a small target detection layer was added.Experimental results demonstrated that the proposed YOLOv8n_T could significantly enhance the precision of target detection in road scenarios,with an average precision increase of 6.8 percentage points compared to the original YOLOv8n and 11.2 percentage points compared to YOLOv5n on the BDD100K dataset.

关键词：可变形卷积道路场景目标检测 YOLO 注意力机制

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

具有双层路由注意力的YOLOv8道路场景目标检测方法被引量：21

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

具有双层路由注意力的YOLOv8道路场景目标检测方法 被引量：21

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

具有双层路由注意力的YOLOv8道路场景目标检测方法被引量：21