检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张云玲 罗婷婷[1,2] 赵瑞雪 鲜国建[1,3] ZHANG YunLing;LUO TingTing;ZHAO RuiXue;XIAN GuoJian(Agricultural Information Institute of CAAS,Beijing 100081,P.R.China;Key Laboratory of Knowledge Mining and Knowledge Services in Agricultural Converging Publishing,National Press and Publication Administration,Beijing 100081,P.R.China;Key Laboratory of Agricultural Big Data,Ministry of Agriculture and Rural Affairs,Beijing 100081,P.R.China)
机构地区:[1]中国农业科学院农业信息研究所,北京100081 [2]国家新闻出版署农业融合出版知识挖掘与知识服务重点实验室,北京100081 [3]农业农村部农业大数据重点实验室,北京100081
出 处:《数字图书馆论坛》2022年第1期26-36,共11页Digital Library Forum
基 金:中国农业科学院科技创新工程项目(编号:CAAS-ASTIP-2016-AII)资助。
摘 要:开放仓储目录是对开放仓储的描述说明和索引,是开放学术资源利用、发现、共享的基础。本文首先通过对OpenDOAR、ROAR、BASE等5个国际主流开放仓储目录的建设现状进行调研分析,发现在国际开放仓储目录建设方面,还存在仓储目录收录不够完整、目录元数据项不够丰富、目录更新时效性有待提高、揭示系统功能相对单一等不足。在此基础上,本文提出开放仓储目录元数据整合研究,包括元数据描述规范设计、基于OAI协议和ETL工具收割元数据,使用数据清洗工具OpenRefine对元数据进行“形式去重”和OAI-Identify获取结果的“内容去重”,并建立对多源异构仓储目录进行匹配融合的方法路径,形成数据内容更丰富、数量更加全面的全球开放仓储目录GOAR核心集和扩展集。最后从建立动态更新融合机制、常态化监控机制和目录发布系统三方面提出下一步研究方向。The directory of open access repository is an instruction and index of open access repository,which is the basis for the utilization,discovery,and sharing of open academic resources.By discussing current situation and development of five international dominant open access repository registry construction,such as OpenDOAR,ROAR,BASE.We find that there are still deficiencies in the international open access repository registry revealing,such as the repository registry coverage is not complete,the registry metadata items are not abundant enough,the registry update timelines needs to be improved,and the revealing system function is relatively simple.Therefore,this paper proposes the integration of open access repository metadata based on OAI-PMH,including the design of metadata description pattern,the use of ETL tools to harvest metadata,the data cleaning tool OpenRefine to“form de-duplication”and OAI-Identify to obtain the results for“content de-duplication”.Finally,this paper established a path to match and integrate metadata items of multi-source heterogeneous repository directory,formed a more richer and comprehensive global open access repository directory,and suggested the future research direction in three aspects:establishing a dynamic update and integration mechanism,a regular monitoring mechanism and a directory issuing system.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.222.188.103