机构地区:[1]中国科学院东北地理与农业生态研究所,长春130102 [2]中国科学院大学,北京100049
出 处:《农业工程学报》2021年第13期142-151,共10页Transactions of the Chinese Society of Agricultural Engineering
基 金:中科院战略性先导科技专项项目课题(XDA28010500);国家重点研发计划项目(2017YFB0503602)。
摘 要:准确的农作物分类图是农业监测和粮食安全评估的重要数据来源,针对传统的深度学习模型在多时相农作物遥感分类方面精度较低的问题,该研究将卷积维度单一的卷积神经网络(Convolutional Neural Networks,CNN)进行改进,提出了一种混合三维和二维卷积的神经网络识别模型(Hybrid Three Dimensional and Two Dimensional Convolutional Neural Networks,3D-2D CNN)。该模型首先通过多个三维卷积层提取时空特征,其次将输出的特征降维压缩后通过二维卷积层执行空域特征分析,最后将高层特征图展平后通过全连接层进行类别预测。试验以Landsat8多时相影像为数据源,将美国加利福尼亚州北部研究区的地块按照2:2:6分层随机划分为训练集、验证集和测试集。试验结果表明3D-2DCNN对13种农作物分类的总体精度(89.38%)、宏平均F1值(84.21%)和Kappa系数(0.881)均优于三维卷积神经网络(Three Dimensional Convolutional Neural Networks,3D-CNN)、二维卷积神经网络(Two Dimensional Convolutional Neural Networks,2D-CNN)、支持向量机(Support Vector Machines,SVM)和随机森林(Random Forest,RF)等方法,并在参数量和收敛时间方面比3D CNN大幅度减小。同时,在较小样本训练集下3D-2D CNN仍表现最优。该模型综合利用空间-光谱-时间特征并具有较高的分类精度和较强的鲁棒性,这为解决多时相遥感农作物分类问题提供了一个有效且可行的方案。Reliable and accurate classification of crop types can greatly contribute to data sources in agricultural monitoring and food security.Remote sensing can be used to rapidly and accurately extract the planting areas and distribution of main crops,thereby optimizing the spatial pattern of crops,grain production,and management.However,it is extremely difficult to identify and then map different types of crops with high accuracy and efficiency,especially for traditional machine learning.The reason is that there are highly complex and heterogeneous spectral data in crop space on time-series remote sensing images.Fortunately,three-dimensional convolution neural networks(3D CNN)are suitable for the spatio-temporal information in the time-series remote sensing imagery.Nevertheless,the high complexity of the 3D CNN model often requires a large number of training samples.In this study,a novel hybrid classification model(called 3D-2D CNN)was proposed to integrate 3D CNN and two-dimensional convolution neural networks(2D CNN)in the trade-off among accuracy,efficiency,and ground sample acquisition.The specific procedure was as follows.The spatio-temporal features were first extracted from the multiple 3D convolutional layers,then the output features were compressed for the spatial feature analysis in the 2D convolutional layer,and finally the high-level maps of features were flattened to predict the category in the fully connected layer.Batch normalization was performed on the input data of each layer to accelerate the network convergence.As such,the complex structure of the original 3D CNN was reduced,while the capacity of 3D-2D CNN remained in spatio-temporal feature extraction.Taking northern California,USA,as the study area,Landsat8 multi-temporal images were utilized as the remote sensing data source in the test to verify the model.Landsat images presented specific characteristics,compared with the natural.The spectral and texture features of the same type varied greatly along with the imaging time and conditions.Califor
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...