面向无人机航摄图像语义分割的双路特征融合网络  被引量:3

Dual-Stream Feature Aggregation Network for Unmanned Aerial Vehicle Aerial Images Semantic Segmentation

在线阅读下载全文

作  者:李润增 史再峰 孔凡宁 赵向阳 罗韬[2] Li Runzeng;Shi Zaifeng;Kong Fanning;Zhao Xiangyang;Luo Tao(School of Microelectronics,Tianjin University,Tianjin 300072,China;College of Intelligence and Computing,Tianjin University,Tianjin 300072,China;Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology,Tianjin 300072,China)

机构地区:[1]天津大学微电子学院,天津300072 [2]天津大学智能与计算学部,天津300072 [3]天津市成像与感知微电子技术重点实验室,天津300072

出  处:《激光与光电子学进展》2023年第24期283-291,共9页Laser & Optoelectronics Progress

基  金:国家自然科学基金(62071326);天津市自然科学基金(22JCYBJC00140)。

摘  要:针对无人机航摄图像中目标尺寸差异大导致的感受野难以同时兼顾不同尺寸物体分割效果的问题,提出了利用两路分支分别提取浅层和深层信息的双路特征融合网络(DSFA-Net)。在编码器中,浅层分支利用三个串行ConvNeXt模块提取高通道数的浅层特征以保留更多空间细节;深层分支利用坐标注意力空洞空间金字塔池化(CA-ASPP)模块为特征图重新分配权重,使网络更加关注尺寸各异的分割目标,获得深层多尺度特征。在解码过程中,网络利用双边引导融合模块为两层特征建立通信以进行分辨率融合,提高层级特征的利用率。所提方法在AeroScapes和Semantic Drone航摄图像数据集上进行了实验,其平均交并比分别达到83.16%和72.09%、平均像素准确率分别达到90.75%和80.34%。与主流的语义分割方法相比,所提方法对于具有较大尺寸差异的目标,分割能力更强,更适用于无人机航摄图像场景下的语义分割任务。Large object size difference in unmanned aerial vehicle(UAV)aerial photography makes it difficult to take into account the segmentation effect of objects of different sizes in the receptive field.A dual-stream feature aggregation network(DSFA-Net)with two branches to extract low-level and high-level features separately,is proposed for such problems.In the encoder,a lowlevel information extraction branch with three serial ConvNeXt modules is used to preserve more low-level features by generating more channels of features.In the deep feature branch,the coordinate attention atrous spatial pyramid pooling(CA-ASPP)module reassigns weights to feature maps in the channel dimension.It makes the module focus on segmentation objects of different sizes and deep-level multi-scale features are obtained.During the decoding process,the bilateral guided aggregation module performs resolution aggregation between the low-level and deep-level features.Our method is evaluated on the AeroScapes and Semantic Drone datasets,the mean intersection over union is 83.16%and 72.09%respectively,and the mean pixel accuracy is 90.75%and 80.34%respectively.The proposed method is more capable of segmenting objects with large difference sizes compared to mainstream methods.It is suitable for semantic segmentation tasks for UAV aerial images.

关 键 词:语义分割 特征融合 双路网络 坐标注意力空洞空间金字塔池化 多尺度特征提取 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象