检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:郭学兵[1,2] 朱小杰 唐新斋[1] 杨刚[3] 侯艳飞 何洪林 GUO Xuebing;ZHU Xiaojie;TANG Xinzhai;YANG Gang;HOU Yanfei;HE Honglin(Key Laboratory of Ecosystem Network Observation and Modeling,Institute of Geographic Sciences and Natural Resources Research,Chinese Academy of Sciences,Beijing 100101,China;National Ecosystem Science Data Center,Beijing 100101,China;Computer Network Information Center,Chinese Academy of Sciences,Beijing 100083,China)
机构地区:[1]中国科学院地理科学与资源研究所,生态系统网络观测与模拟重点实验室,北京100101 [2]国家生态科学数据中心,北京100101 [3]中国科学院计算机网络信息中心,北京100083
出 处:《数据与计算发展前沿(中英文)》2024年第4期96-105,共10页Frontiers of Data & Computing
基 金:国家重点研发计划(2022YFF1300100)。
摘 要:【背景】激光雷达(LiDAR)数据在森林资源分析利用方面有着广泛应用,科研人员研制了很多涉及大数据管理和人工智能的专业算法模型,这些算法模型目前多数散落在研究人员手里,尚缺乏新型信息化平台对其进行整合。【方法】大数据流水线系统πFlow软件具有大数据管理能力和大数据算法集成能力,并可以所见即所得方式构建流水线并调度运行流水线,适合于LiDAR数据复杂算法模型的整合,且流水线可定制、可复用。【内容】本文介绍了πFlow的特点和功能,并以基于LiDAR冠层高度模型(CHM)数据的树冠解析及利用机器学习方法估测树木生物量为例,介绍了将算法整合到πFlow并构建LiDAR数据分析处理流水线的方法和技术,且对流水线进行了测试运行。【结果】利用πFlow构建的可重复信息化平台可支撑野外站观测网络的LiDAR数据生物量快速反演,为数据密集型的专业数据处理算法模型的整合提供了创新方法技术。[Background]Light Detection and Ranging(LiDAR)data are widely used in the analysis and utilization of forest resources.Researchers have developed many professional algorithm models involving big data management and artificial intelligence.Currently,most of these algorithm models are scattered in the hands of researchers,and there is still a lack of new information platforms to integrate them.[Methods]The big data pipeline system such asπFlow has the capability of big data management and big data algorithm integration,and can build and schedule the pipeline in the way of WYSIWYG(what you see is what you get).It is suitable for integration of complex algorithm models for LiDAR data,and the pipeline can be customized and reused.[Contents]This paper introduces the characteristics and functions ofπFlow,taking tree crown segmentation and estimation of tree biomass using machine learning methods based on LiDAR tree canopy height model(CHM)data as an example.The paper presents the method and technology of integrating algorithms intoπFlow,constructs a LiDAR data analysis and processing pipeline,and conducts test operations to the pipeline.[Results]The reproducible information platform constructed usingπFlow could support fast biomass inversion of LiDAR data for multiple networked observational field sites,which can also provide an innovative technological method for the integration of data-intensive processing algorithm models.
关 键 词:大数据流水线 算法模型集成 激光雷达 机器学习 随机森林 πFlow
分 类 号:S718.5[农业科学—林学] TP181[自动化与计算机技术—控制理论与控制工程] TN958.98[自动化与计算机技术—控制科学与工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7