基于生成对抗网络的烟田土壤有机质含量高光谱估测  

Hyperspectral Estimation of Soil Organic Matter Content in Tobacco Fields Based on Generated Adversarial Network

在线阅读下载全文

作  者:夏雨 武洪艳 高加明[3] 徐锐 郭利 程雪莹 王志坤 张继光[5] 胡晓[1] 王勉[3] XIA Yu;WU Hongyan;GAO Jiaming;XU Rui;GUO Li;CHENG Xueying;WANG Zhikun;ZHANG Jiguang;HU Xiao;WANG Mian(College of Information Science and Engineering,Shandong Agricultural University,Tai’an 271018,Shandong,China;Tianjin Zhongyuan Industrial Co.,Ltd.,Tianjin 300467,China;Hubei Tobacco Company,Wuhan 430030,China;Xiangyang Tobacco Company of Hubei Province,Xiangyang 441003,Hubei,China;Tobacco Research Institute of Chinese Academy of Agricultural Sciences,Qingdao 266101,China)

机构地区:[1]山东农业大学信息科学与工程学院,山东泰安271018 [2]天津众远实业有限公司,天津300467 [3]湖北省烟草公司,武汉430030 [4]湖北省烟草公司襄阳市公司,湖北襄阳441003 [5]中国农业科学院烟草研究所,青岛266101

出  处:《中国烟草科学》2025年第1期106-115,共10页Chinese Tobacco Science

基  金:湖北省烟草公司科技项目(027Y2022-004);中国农业科学院科技创新工程(ASTIP-TRIC06)。

摘  要:土壤有机质(soil organic matter,SOM)是评价土壤肥力高低的一项重要指标,在烟草生长过程中发挥了重要的作用。本研究在采集湖北省烟田土壤样本基础上,借助生成式对抗网络(generative adversarial networks,GAN)生成伪样本扩充建模集。使用标准正态变换(standard normal variable,SNV)、多元散射校正(multiplicative scatter correction,MSC)组合一阶微分(FD)、倒数对数(LR)以及倒数对数一阶微分(LRFD)进行预处理,结合皮尔逊相关系数(Pearson correlation coefficient,PCC)筛选敏感特征波段。使用偏最小二乘回归(partial least squares regression,PLSR)、随机森林(random forest,RF)和反向传播神经网络(back propagation neural networks,BPNN)3种机器学习方法,构建烟田SOM含量估测模型。结果表明:(1)25000次训练后的GAN模型,生成的伪样本具有与真实样本相似的特征和规律;(2)经过MSC+LRFD预处理后,全波段反射率与SOM含量的相关性得到了提高,相关系数最高可达到0.66;(3)伪样本数量占比为150%时,经过特征波段筛选后,MSC+BPNN模型验证精度最优,其决定系数(coefficient of determination,R^(2))、相对分析误差(relative percent difference,RPD)和均方根误差(root mean square error,RMSE)分别为0.80、2.22和3.18。相比较原始数据集构建的最优模型,其模型精度提升了9.59%。研究证实,将GAN模型生成的伪样本添加进建模集中,可有效提高模型的估测性能,为复杂山区烟田SOM估测提供一种新的途径。Soil organic matter(SOM)is a crucial indicator for evaluating soil fertility and plays an important role in tobacco growth.In this study,soil samples from tobacco fields in Hubei Province were collected,and generative adversarial networks(GAN)were used to generate pseudo-samples to expand the modeling set.Reflectance data were preprocessed by using standard normal variate(SNV),multiplicative scatter correction(MSC),first derivative(FD),logarithm reciprocal(LR),and logarithm reciprocal first derivative(LRFD).Sensitive spectral bands were selected based on pearson correlation coefficients.Partial least squares regression(PLSR),random forest(RF),and back propagation neural networks(BPNN)were then used to construct SOM estimation models for the tobacco fields.Results showed as the follows(1)After the GAN model was trained for 25000 times,the generated pseudo samples showed similar characteristics and rules of real samples.(2)After MSC+LRFD preprocessing,the correlation between full band spectral reflectance and SOM content was increased,with the value of the correlation coefficient reaching up to 0.66.(3)When the pseudo-sample quantity reached 150%,after feature band selection,the MSC+BPNN model showed the best validation accuracy with a coefficient of determination(R^(2)),relative percent difference(RPD),and root mean square error(RMSE)of 0.80,2.22,and 3.18,respectively.Compared to the optimal model constructed from the original dataset,the model accuracy improved by 9.59%.The results from this study confirmed that adding GAN-generated pseudo-samples to the modeling set effectively enhanced model estimation performance,providing a new approach for SOM estimation in complex mountainous tobacco fields.

关 键 词:土壤有机质 高光谱 生成式对抗网络 反向传播神经网络 

分 类 号:S572[农业科学—烟草工业] S126[农业科学—作物学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象