科技文献数据化及组织呈现路径研究  被引量:11

Datafication,Organization and Manifestation of Scientific Literature

在线阅读下载全文

作  者:徐雷[1] 秦翠玉 李娇 XU Lei;QIN Cuiyu;LI Jiao

机构地区:[1]武汉大学信息管理学院,湖北武汉430072 [2]武汉大学语义出版与知识服务重点实验室,湖北武汉4300720

出  处:《中国图书馆学报》2022年第3期25-42,共18页Journal of Library Science in China

基  金:教育部规划基金项目“科学出版物语义组织模式及其实现路径研究”(编号:20A10486066);武汉大学自主科研项目“图情档领域科学本体构建与应用研究”的研究成果之一。

摘  要:文本型科技文献是当前科学知识表达以及科学交流的主要形态。为了促进科学交流,对日益增长的科技文献中的科学知识进行数据化及组织呈现的研究和实践逐渐增多。本文对科技文献数据化及组织呈现方法、应用场景、实现技术进行了系统梳理,包括科技文献的元数据化、科学词汇抽取、领域实体及其关系识别、篇章功能结构识别、科技文献语义组织以及科技文献呈现与智能化应用六个维度,总结目前该研究领域存在的主要问题;在此基础上设计了科技文献数据化及组织呈现的整体框架,阐述了该框架实现的四个核心技术:识别抽取技术、语义组织技术、分析推理技术以及展陈交互技术;最后归纳总结了该领域面临的挑战,如科学知识自动获取、科学数据质量及信任性、科学知识交互体验等。未来需要加强各方合作,以高质量的科学数据为基础,实现科学知识的叙事生产和转化。图4。表3。参考文献69。In order to facilitate scientific communication, there is an increasing number of research practices that digitize and reorganize scientific knowledge from the growing body of scientific literature. This paper summarizes the main approaches of current scientific knowledge acquisition, including all kinds of academic databases and academic search engines, social network platforms of science, and open access academic platforms, etc. With the convenience of scientific knowledge access, we also face new kinds of scientific communication difficulties, such as quite low efficiency of reading comprehension of large scale of literature. In order to break this new scientific communication dilemma, datafication, organization and manifestation of scientific literature as the mainstream practices are carried out widely. This paper makes a survey about this research field systematically from aspects of related concepts, application scenarios and implementation technologies. It includes six dimensions, namely meta-datamation of scientific literature, extraction of scientific vocabularies, recognition of scientific entities and their relations, recognition of discourse function structure, semantic organization of scientific literature, presentation and intelligent application of scientific literature. Recognition and extraction of scientific vocabularies, named entities and their relations, discourse structure such as knowledge units and summary of scientific literature, are the mainstream practices in this research field. A large number of data models and scientific knowledge graphs have been constructed based on scientific literature content, and studies on distributed representation of scientific literature have also been increased gradually. However, the degree of automatic indexing of literature by using current data models needs to be strengthened, and the granularity of scientific knowledge graphs needs to be deepened. In the aspect of intelligent applications about scientific literature, higher intelligent form of liter

关 键 词:科技文献 科学知识 数据化 组织呈现 科学交流 

分 类 号:G255.51[文化科学—图书馆学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象