检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国科学院计算机网络信息中心科学数据中心,北京100190
出 处:《科研信息化技术与应用》2012年第3期29-37,共9页E-science Technology & Application
基 金:中国科学院计算机网络信息中心青年基金项目(CNIC_QN_09007)
摘 要:在分析现有数据质量校验方法与校验工具的基础上,借鉴科研领域的数据质量校验经验和规则引擎的相关技术,实现了基于知识规则的Excel数据质量校验工具,进而解决科研观测数据中异常记录判别、异常原因标识、数据可视化分析等关键技术问题。中国生态系统研究网络综合中心以及土壤分中心的应用表明,在不影响原有数据填报流程的前提下,该工具能很好地代替数据质量校验人员的手工查错工作,有效地提高数据质量校验的效率及准确性。Reviewing the existing methods and tools for data quality validation, this paper presents the development of an Excel data quality validation tool based on the customized knowledge rules database, learned from the experiences of data quality validation in scientific research. A number of key technical issues were solved in the research and observational data such as the discrimination of exception record, the identity of the reason for the exception, data visualization analysis and so on. The applications in Institute of Geographical Sciences and Natural Resources Research and Nanjing Institute of Soil, Chinese Academy of Sciences, showed that the tool could take the place of manual troubleshooting work and improve the efficiency and accuracy greatly in the data quality validation under the premise that the existing data reporting process was not affected.
分 类 号:N94[自然科学总论—系统科学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.145