MFVNet:a deep adaptive fusion network with multiple field-of-views for remote sensing image semantic segmentation  被引量:7

在线阅读下载全文

作  者:Yansheng LI Wei CHEN Xin HUANG Zhi GAO Siwei LI Tao HE Yongjun ZHANG 

机构地区:[1]School of Remote Sensing and Information Engineering,Wuhan University,Wuhan 430079,China

出  处:《Science China(Information Sciences)》2023年第4期89-102,共14页中国科学(信息科学)(英文版)

基  金:supported in part by State Key Program of the National Natural Science Foundation of China(Grant No.42030102);Foundation for Innovative Research Groups of the Natural Science Foundation of Hubei Province(Grant No.2020CFA003);National Natural Science Foundation of China(Grant No.41971284);Fundamental Research Funds for the Central Universities(Grant No.2042022kf1201);Special Fund of Hubei Luojia Laboratory。

摘  要:In recent years,the remote sensing image(RSI)semantic segmentation attracts increasing research interest due to its wide application.RSIs are difficult to be processed holistically on current GPU cards on account of their large field-of-views(FOVs).However,the prevailing practices such as downsampling and cropping will inevitably decrease the quality of semantic segmentation.To address this conflict,this paper proposes a new deep adaptive fusion network with multiple FOVs(MFVNet),which is specially designed for RSI semantic segmentation.Different from existing methods,MFVNet takes into consideration the differences among multiple FOVs.By pyramid sampling the RSI,we first obtain images on different scales with multiple FOVs.Images on the high scale with a large FOV can capture larger spatial contexts and complete object contours,while images on the low scale with a small FOV can keep the higher spatial resolution and more detailed information.Then scale-specific models are chosen to make the best predictions for all scales.Next,the output feature maps and score maps are aligned through the scale alignment module to overcome spatial misregistration among scales.Finally,the aligned score maps are fused with the help of adaptive weight maps generated by the adaptive fusion module,producing the fused prediction.The performance of MFVNet surpasses the previous state-of-the-art semantic segmentation models on three typical RSI datasets,demonstrating the effectiveness of the proposed MFVNet.

关 键 词:semantic segmentation remote sensing image(RSI) field-of-view(FOV) adaptive fusion convolutional neural network 

分 类 号:TP751[自动化与计算机技术—检测技术与自动化装置]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象