融合多尺度特征和多重注意力的水下目标检测  被引量:7

Detecting underwater objects using multi-scale features fusion and multiple attention

在线阅读下载全文

作  者:李辉[1] 王晓宇 刘云[1] 陶冶[1] 付诗佳 吴依凡 Li Hui;Wang Xiaoyu;Liu Yun;Tao Ye;Fu Shijia;Wu Yifan(College of Information Science and Technology,Qingdao University of Science and Technology,Qingdao 266061,China)

机构地区:[1]青岛科技大学信息科学技术学院,青岛266061

出  处:《农业工程学报》2022年第20期129-139,共11页Transactions of the Chinese Society of Agricultural Engineering

基  金:国家自然科学基金项目(61702295);山东省高等学校青创科技支持计划项目(2019KJN047)。

摘  要:探明海洋生物资源的分布情况,对渔业捕捞和海洋牧场管理具有重要意义。该研究针对水下环境复杂、水下目标存在多尺度、多类别及小目标较多等复杂情况,提出水下目标两阶段网络检测方法。首先通过改进多尺度特征提取和融合,获取水下目标多尺度信息和增强目标特征,得到更加丰富的目标特征信息,然后构建多重注意力,利用空间和通道维度中的全局特征依赖关系,进一步挖掘深层特征信息和隐藏信息,突出背景和目标的差异性,最后在模型训练中采用样本均衡方法,自适应均衡正负样本比例,减少无效样本,实现模型快速收敛。在国际水下机器人大赛公开数据集UPRC2019、WildFish及自建数据集上对所提方法进行试验,其mAP(mean Average Precision)分别达到85.3%、96.9%和97.8%,召回率分别达到90.6%、98.7%和98.9%,相较于Libra RCNN(CVPR2019)、Double head RCNN(ECCV2020)和STransFuse(2021)等检测方法,该文方法mAP要比上述方法分别高9.58、12.2和4.1个百分点。研究结果可为海洋渔业生物监测、水下机器人精准捕捞作业提供技术支撑。The distribution of biological resource is of great significance in fisheries and marine ranching.The underwater robots can be expected to combine with the underwater object detection,due to the cost saving and the less risk of fishing operations,compared with the commonly-used manual detection.However,many difficulties and challenges are still remained under the complex and special underwater conditions susceptible to environmental interference,particularly for the underwater objects with the multiple scales,types,and small size.Most of the existing detection for the objects on the ground cannot fully meet the requirement of underwater marine objects,leading to the false and missing detection.Therefore,it is necessary to redesign the network structure suitable for the underwater objects.In this study,a two-stage network detection was proposed to integrate the multi-scale features and multiple attention networks for the underwater objects.Firstly,the multi-scale feature fusion was enhanced to expand the high-level receptive field for the more feature information of objects in the high-level feature maps using hybrid dilated convolution.As such,the network was better adapted to the multi-scale changes without losing the small objects.Then,the up sampling was performed on the effective jump fusion for the high-level channel information without the loss,in order to fully extract the object features.Secondly,a multi-attention network was constructed for the more spatial location and channel features,in view of the complex underwater background,blurred images,small size,and insignificant differences of underwater objects.The global feature dependencies in the space and channel dimensions were more fully utilized to further excavate the hidden feature of difficult samples,with emphasis on the location and feature information of the objects.Finally,the sample equalization was adopted to adaptively balance the proportion of positive and negative samples in training.The fast convergence and optimal training were achieved

关 键 词:目标检测 特征融合 注意力 自适应均衡采样 水下小目标 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象