机构地区:[1]智能机器人湖北省重点实验室(武汉工程大学),武汉430205
出 处:《计算机应用》2022年第9期2900-2908,共9页journal of Computer Applications
基 金:湖北省教育厅科学研究计划指导性项目(B2017051);智能机器人湖北省重点实验室开放基金资助项目(HBIRL202002)。
摘 要:智慧工地中的设备内存和计算能力有限,在现场的设备上通过目标检测对钢筋进行实时检测具有很大的难度,而且其钢筋检测速度慢、模型部署成本高。针对这些问题,在YOLOv3网络的基础上,提出了一个嵌入注意力机制的轻量级钢筋检测网络RebarNet。首先,利用残差块作为网络的基本单元来构建特征提取结构,并用其提取局部和上下文信息;其次,在残差块中添加通道注意力(CA)模块和空间注意力(SA)模块,以调整特征图的注意力权重,并提升网络提取特征的能力;然后,采用特征金字塔融合模块,以增大网络的感受野,并优化中等钢筋图像的提取效果;最后,输出经过8倍下采样后的52×52通道的特征图用于后处理和钢筋检测。实验结果表明,所提网络的参数量仅为Darknet53网络的5%,在钢筋测试集上以106.8 FPS的速度达到了92.7%的mAP。与现有的EfficientDet、SSD、CenterNet、RetinaNet、Faster RCNN、YOLOv3、YOLOv4和YOLOv5m等8个目标检测网络相比,RebarNet具有更短的训练时间(24.5 s)、最低的显存占用(1956 MB)、最小的模型权重文件(13 MB)。与目前效果最好的YOLOv5m网络相比,RebarNet的mAP略低0.4个百分点,然而其检测速度上升了48 FPS,是YOLOv5m网络的1.8倍。以上结果表明,所提出的网络有助于完成智慧工地中要求实现的高效、准确的钢筋检测任务。There are limited memory and computing power of the equipment in smart construction sites,making it very difficult to detect rebar in real time through object detection on the on-site equipment. The slow speed of rebar detection and the high cost of model deployment of this equipment also bring great challenges. In order to solve the problems,RebarNet,a lightweight network for rebar detection with attention mechanism was proposed on the basis of YOLOv3(You Only Look Once version 3). Firstly,the residual block was used as the basic unit of the network to construct a feature extraction structure to extract local and contextual information. Secondly,Channel Attention(CA)module and Spatial Attention(SA)module were added to the residual block to adjust the attention weight of the feature map and improve the ability of the network to extract features. Thirdly,the feature pyramid fusion module was used to increase the receptive field of the network and optimize the extraction effect of the medium-sized rebar images. Finally,the feature map of 52×52 channel was output for post-processing and rebar detection after 8 times downsampling. Experimental results show that the parameter amount of the proposed network is only 5% of that of Darknet53 network,and mAP(mean Average Precision)of the proposed network achieves 92. 7% at the speed of 106. 8 FPS(Frames Per Second)on the rebar test dataset. Compared with the existing 8object detection networks including EfficientDet(Scalable and Efficient Object Detection),SSD(Single Shot MultiBox Detector),CenterNet,RetinaNet,Faster RCNN(Faster Region-CNN),YOLOv3,YOLOv4 and YOLOv5m(YOLOv5medium),RebarNet has a shorter training time(24. 5 seconds),the lowest memory usage(1 956 MB),and the smallest model weight file(13 MB). Compared with the current best-performing YOLOv5m network,RebarNet has the mAP slightly lower by 0. 4 percentage points with the detection speed increased by 48 FPS,which is 1. 8 times of that of YOLOv5m network. The above indicates that the proposed network helps to co
关 键 词:钢筋检测 YOLOv3 注意力机制 特征金字塔 轻量级网络
分 类 号:TP399[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...