基于知识图谱的中文科技文献问答系统构建研究  

Research on the Construction of Question Answering System of Chinese Scientific and Technical Literature Based on Knowledge Graph

在线阅读下载全文

作  者:李琳娜[1,2] 丁楷 韩红旗 王力[1,2] 李艾丹 LI Linna;DING Kai;HAN Hongqi;WANG Li;LI Aidan(Institute of Scientific and Technical Information of China,Beijing 100038;Key Laboratory of Richmedia Knowledge Organization and Service of Digital Publishing Content,Beijing 100038;Information Research Center,Sixth Academy of China Aerospace Science and Industry Group,Information Research Center,Huhhot 010000)

机构地区:[1]中国科学技术信息研究所,北京100038 [2]富媒体数字出版内容组织与知识服务重点实验室,北京100038 [3]中国航天科工集团六院情报信息研究中心,内蒙古呼和浩特010000

出  处:《中国科技资源导刊》2024年第4期51-62,共12页China Science & Technology Resources Review

基  金:中国科学技术信息研究所重点工作项目“智能情报融合创新体系建设研究与应用”(ZD2023-11);国家重点研发计划项目“颠覆性技术识别理论、方法与专家预判系统”(2019YFA0707201)。

摘  要:科技文献问答系统能以自然语言对话的方式为用户提供高水平的知识服务。针对语义解析型知识图谱问答系统存在跨领域适应性弱及现有基于深度学习、大模型的问答系统存在结果可解释性差且难以溯源的问题,提出基于句式特点的中文问题分类方法,并设计基于Pipeline方法的中文科技文献问答系统框架。实验结果表明,基于句式特点的问题分类具有不依赖于特定领域的特点且在效果上与基于意图的问题分类基本相当,基于Pipeline的问题解析方法能有效地将问题转化为知识图谱查询语句,从而满足用户对自动问答结果可解释、可溯源的基本需求。The Q&A system of scientific and technical literature can provide high-level knowledge services for researchers with natural language.But the current semantic parsing-based knowledge graph Q&A system has poor cross-domain adaptability and Q&A systems based on deep learning or large language model suffer from poor interpretability and traceability of results.Aiming to address these issues,this article proposed a Chinese question categorical method based on sentence patterns and designed a Pipeline based framework for the Q&A system of Chinese scientific and technical literature.The experimental results show that question classification based on sentence patterns does not rely on specific domains and its effectiveness is basically comparable to the question classification based on intentions.The Pipeline-based question parsing method can effectively transform questions into knowledge graph query statements and effectively meets users’need for Q&A answers with interpretability and traceability of results.

关 键 词:中文科技文献问答系统 知识图谱 问题分类体系 集成学习 

分 类 号:G250[文化科学—图书馆学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象