检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:田尤 高波[1,2] 殷红 李元灵 张佳佳 陈龙 李洪梁[1,2] TIAN You;GAO Bo;YIN Hong;LI Yuanling;ZHANG Jiajia;CHEN Long;LI Hongliang(Institute of Exploration Technology,Chinese Academy of Geological Sciences,Chengdu,Sichuan 611734,China;Technology Innovation Center for Risk Prevention and Mitigation of Geohazard,Ministry of Natural Resources,Chengdu,Sichuan 611734,China;Sichuan Province Engineering Technology Research Center of Geohazard Prevention,Chengdu,Sichuan 610081,China;Sichuan Geological Environment Survey and Research Center,Chengdu,Sichuan 610081,China)
机构地区:[1]中国地质科学院探矿工艺研究所,四川成都611734 [2]自然资源部地质灾害风险防控工程技术创新中心,四川成都611734 [3]四川省地质灾害防治工程技术研究中心,四川成都610081 [4]四川省地质环境调查研究中心,四川成都610081
出 处:《水文地质工程地质》2024年第6期171-181,共11页Hydrogeology & Engineering Geology
基 金:中国地质调查局地质调查项目(DD20230449,DD20190644);第二次青藏高原综合科学考察研究项目(2019QZKK0902)。
摘 要:滑坡易发性评价中,样本不均衡问题的不同处理方案通常会带来评价结果的大量不确定性。针对这一问题,以藏东昌都市部分县(区)为研究区,构建滑坡/非滑坡样本不均衡数据集,采用不处理、下采样和合成少数类过采样(synthetic minority oversampling technique,SMOTE)3种处置方案,运用逻辑回归方法分别构建滑坡易发性评价模型。基于ROC曲线、准确度、精确率、召回率、漏检率等评价指标,采用综合评价指标F_(1)′同数对模型分类的精度进行验证。结果表明:数据处理成均衡数据集(过采样/下采样)建立的模型效果较不处理数据建立的模型效果有了大幅提升,F_(1)′同数的值最大提高了53.17%;在下采样、过采样两种数据处理方案中,过采样方法比下采样方法F_(1)′分数的值提高了16.30%,表明过采样方法对处理样本不均衡数据问题方面具有较好效果。研究成果可为滑坡预测和地质灾害预测前的数据集处理提供参考,为进一步提高区域防灾减灾水平提供理论与技术支持。In landslide susceptibility assessment,different approaches to handling sample imbalance can introduce significant uncertainty in evaluation outcomes.To address this issue,this study focused on the Changdu area of eastern Tibet and constructed the landslide susceptibility evaluation model using a dataset with imbalanced landslide and non-landslide samples.Three disposal schemes were applied:no treatment,downsampling,and SMOTE oversampling.The logistic regression method was used to construct the landslide susceptibility evaluation model.Based on ROC curve,accuracy,precision,recall,missed detection rate,and other evaluation indicators,the comprehensive evaluation index of F1′score was used to verify the accuracy of model classification.The results show that the modeling effect of landslide susceptibility obtained by data processing into equilibrium data(downsampling/oversampling)is greatly improved compared with that obtained without processing data.Specifically,the value of the F_(1)′score of the comprehensive index was increased by 53.17%.In the two schemes for processing data(downsampling and oversampling),the oversampling method increased the value of the composite index F_(1)′score by 16.30%compared with the downsampling method,indicating that the oversampling method has effectiveness in handling unbalanced data.This study can provide basic information for processing of data sets before landslide prediction and geological disaster prediction,and provide theoretical and technical support for further improving regional disaster prevention and mitigation.
关 键 词:滑坡易发性 合成少数类过采样技术 评价模型 昌都市 样本不均衡数据
分 类 号:P642.22[天文地球—工程地质学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7