检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:徐欣 杜军平[1] 薛哲 XU Xin;DU Junping;XUE Zhe(Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia,School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China)
机构地区:[1]北京邮电大学智能通信软件与多媒体北京市重点实验室计算机学院,北京100876
出 处:《计算机工程与应用》2022年第22期116-122,共7页Computer Engineering and Applications
基 金:国家重点研发计划(2018YFB1402600);国家自然科学基金(61772083,61802028);广西科技重大专项(桂科AA18118054)。
摘 要:科技成果数据呈现跨领域、跨学科特性,传统的信息查询检索技术已难以满足用户日益增长的智能化、精准化的科技成果信息获取需求。分析了知识图谱领域和信息检索领域的研究现状。采用网络爬虫从互联网中高效地爬取科技成果数据,利用实体识别和关系抽取技术识别和发现科技成果数据中的科技实体,构建科技成果知识图谱,并实现科技成果数据的结构化存储。基于ElasticSearch搜索引擎对科技实体构建高效索引,研究科技成果语义相似度计算方法,实现基于知识图谱的科技成果智能查询系统。实验结果验证了所构建的系统能够实现科技成果的高效查询以及相关主题内容的关联发现。Since the data of scientific and technological achievements present cross-domain and interdisciplinary charac-teristics,the traditional information retrieval technology is unable to meet the increasing needs of users for the informa-tion acquisition of scientific and technological achievements.This paper analyzes the research status in the field of knowl-edge graph and information retrieval.The scientific and technological achievements data are crawled from Internet with the web crawlers.Entity recognition and relationship extraction technology is adopted to identify and discover scientific and technological entities,and a knowledge graph of scientific and technological achievements is constructed,so as to realize the structured storage of scientific and technological achievements data.This paper builds an efficient index of sci-entific and technological entities based on the ElasticSearch,studies the calculation method of semantic similarity of scien-tific and technological achievements,and realizes an intelligent query system for scientific and technological achieve-ments based on the knowledge graph.The experimental results demonstrate that the constructed system realizes the effi-cient query of scientific and technological achievements and the association discovery of related contents.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.143.9.5