检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张书瑜 张定祥[2] 王荣彬[2] 季宏伟[2] ZHANG Shuyu;ZHANG Dingxiang;WANG Rongbin;JI Hongwei(School of Earth Sciences,Zhejiang University,Hangzhou 310027,China;China Land Surveying and Planning Institute,Beijing 100035,China)
机构地区:[1]浙江大学地球科学学院,浙江杭州310027 [2]中国土地勘测规划院,北京100035
出 处:《浙江大学学报(理学版)》2018年第5期589-594,共6页Journal of Zhejiang University(Science Edition)
基 金:"十二五"国土资源调查评价--土地基础数据库整合集成与共享平台建设项目(DCPJ131707-01)
摘 要:为了从多源异构的复杂土地基础数据中快速准确地提取用户所需信息,提出了基于元数据的一体化管理检索方法.在元数据信息提取、元数据加权索引、实体同义词扩展检索3个环节中,结合土地领域专业知识和用户实际需求,设计和开发了共享元数据表结构、加权元数据中字段相对重要性和信息熵因子,构建地名实体和专题数据层实体同义词库,并集成到包括中文分词、实体识别、同义词扩展、索引检索和相似度计算的一体化管理检索框架中,解决了多源异构土地基础数据统一管理和精确检索的问题.实践表明,该方法较传统的通用信息检索方法具有更好的适用性和更高的准确率.In order to obtain the required information quickly and accurately from the complex multi-source heterogeneous land basic data,an integrated management and retrieval method based on metadata is proposed.More concretely,during the process of metadata information extraction,metadata weighted indexing and entity synonyms extended retrieval,three optimized methods are performed combined with the field expertise of land and the actual needs of users,which are design and development of sharing metadata structure,construction of weighted index based on relative importance of metadata columns and information entropy factor,and building synonym database of geographic name entities and thematic data layer entities.An integrated management and retrieval method is proposed,including features of word segmentation,entity recognition,synonym extension,index retrieval and similarity computation.And,the optimized methods mentioned above are integrated into the framework for unified management and precise retrieval for multi-source and heterogeneous land basic data.Experimentation and practical application show that the proposed method presents higher accuracy and better applicability than the traditional general information retrieval method.
关 键 词:多源异构土地基础数据 管理检索一体化 元数据信息提取 元数据加权索引 实体同义词扩展检索
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28