基于CNN-GCN双流网络的高分辨率遥感影像场景分类  被引量:11

CNN-GCN-based dual-stream network for scene classification of remote sensing images

在线阅读下载全文

作  者:邓培芳 徐科杰 黄鸿[1,2] DENG Peifang;XU Kejie;HUANG Hong(Key Laboratory of Optoelectronic Technology and Systems of the Education Ministry of China,Chongqing University,Chongqing 400044,China;State Key Laboratory of Coal Mine Disaster Dynamics and Control,Chongqing University,Chongqing 400044,China)

机构地区:[1]重庆大学,光电技术及系统教育部重点实验室,重庆400044 [2]重庆大学,煤矿灾害动力学与控制国家重点实验室,重庆400044

出  处:《遥感学报》2021年第11期2270-2282,共13页NATIONAL REMOTE SENSING BULLETIN

基  金:国家自然科学基金(编号:42071302);中央高校基本科研业务费(编号:2020CDCGTM002);重庆市基础研究与前沿探索项目(编号:cstc2018jcyjAX0093);重庆市留学人员回国创业创新支持计划(编号:cx2019144);重庆市研究生科研创新项目(编号:CYB19039)。

摘  要:高分辨率遥感影像具有复杂的几何结构和空间布局,传统的卷积神经网络的方法仅能提取场景图像中的全局特征,忽略了上下文的关系,导致特征的表达能力受限,制约了分类精度提高。针对此问题,本文提出一个面向高分辨率遥感影像场景分类的CNN-GCN双流网络,该算法包含CNN流和GCN流两个模块。CNN流基于预训练DenseNet-121网络提取高分影像的全局特征;而GCN流采用由预训练VGGNet-16网络得到的卷积特征图构建邻接图,再通过GCN模型提取高分影像的上下文特征。最后,通过加权级联的方式有效地融合全局特征和上下文特征并利用线性分类器实现分类。本文选取AID、RSSCN7和NWPU-RESISC45共3个具有挑战性的数据集进行实验,得到的最高分类精度分别是97.14%、95.46%和94.12%,结果表明本文算法能够有效地表征场景并取得具有竞争力的分类结果。Scene classification is an important research topic,which aims at assigning a semantic label to a given image.High-SpatialResolution(HSR)images contain abundant information of ground objects,such as geometric structure and spatial layout.Complex HSR images are difficult to interpret effectively.Extracting discriminative features is the key step to improve classification accuracy.Various methods for constructing discriminative representations,including handcrafted feature-based methods and deep learning-based methods,have been proposed.The former methods focus on designing different handcrafted features via professional knowledge and describing a scene through single feature or multifeature fusion.However,for a complex scene,handcrafted features show limited discriminative and generalization capabilities.Deep learning-based methods,due to the powerful capability of feature extraction,have made incredible progress in the field of scene classification.Compared with the former methods,Convolutional Neural Networks(CNNs)can automatically extract deep features from massive HSR images.Nevertheless,CNNs merely focus on global information,which makes it fail to explore the context relationship of HSR images.Recently,Graph Convolutional Networks(GCNs)have become an important branch of deep learning,and they have been adopted to model spatial relations hidden in HSR images via graph structure.In this paper,a novel architecture termed CNNGCN-based Dual-Stream Network(CGDSN)is proposed for scene classification.The CGDSN method contains two modules:CNN and GCN streams.For the CNN stream,the pretrained DenseNet-121 is employed as the backbone to extract the global features of HSR images.In the GCN stream,VGGNet-16 that is pretrained well on ImageNet is introduced to generate feature maps of the last convolutional layer.Then,an average pooling is organized for downsampling before the construction of an adjacency matrix.Given that every image is represented by a graph,a GCN model is developed to demonstrate context relationships.

关 键 词:高分辨率影像 遥感场景分类 图神经网络 卷积神经网络 特征融合 

分 类 号:TP751[自动化与计算机技术—检测技术与自动化装置] TP183[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象