检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:化柏林[1,2] 何鸿魏 HUA Bolin;HE Hongwei(Department of Information Management,Peking University,Beijing 100871,China;Key Laboratory of Rich-media Knowledge Organization and Service of Digital Publishing Content,Beijing 100038,China;Fudan University Library,Shanghai 200433,China)
机构地区:[1]北京大学信息管理系,北京100871 [2]富媒体数字出版内容组织与知识服务重点实验室,北京100038 [3]复旦大学图书馆,上海200433
出 处:《情报科学》2023年第7期90-99,共10页Information Science
基 金:中国科学技术信息研究所富媒体数字出版内容组织与知识服务重点实验室开放基金“人工智能领域知识图谱构建研究”(ZD2021-11/07)。
摘 要:【目的/意义】为了明晰科学实体在学术文本中承担的语义角色,进而建立特定领域的术语的知识结构,本文提出一种以科学实体词间关系为特征工程的术语分类方法,从学术研究的语义属性角度,将学术文本中出现的科学实体分为“研究领域”“研究问题”“研究方法”“研究工具”“其他”五类。【方法/过程】采用依存句法分析的方法,对于学术文本中存在两个及两个以上科学实体的句子,挖掘它们之间的最短依存路径,将最短依存路径上的谓词成分作为实体之间的关系进行提取,构造2D矩阵作为卷积神经网络的输入,完成实体的分类研究。【结果/结论】该模型在Web of Science上获取的“人工智能”领域的学术文献进行验证,精确率为89.38%,召回率为92.46%,F1值为0.9089。【创新/局限】由科学实体关系构成的矩阵是稀疏矩阵,在计算过程中会对计算速度产生不利影响;在关系抽取的环节比较依赖依存句法分析分析工具的处理效果。【Purpose/significance】In order to clarify the semantic role of scientific entities in academic texts,so as to establish the knowledge structure of terminologies in specific fields,this paper proposes a classification method of scientific entities that takes the relationship between entities as a feature engineering.From the perspective of semantic attribute of academic research,the keywords of academic texts are divided into"research field","research problem","research method","research tool"and"other".【Method/pro⁃cess】With the method of dependency syntax analysis,for sentences with two or more entities in the academic text,the shortest depen⁃dency path between entities is mined,the predicate components on the shortest dependency path are extracted as the relationship be⁃tween entities,the relationship between words is taken as the feature,and a 2D matrix is constructed as the input of the convolutional neural network to complete the classification of entities.【Result/conclusion】The model was verified in the academic literature in the field of"artificial intelligence"downloaded from Web of Science,with an accuracy rate of 89.38%,a recall rate of 92.46%,and an F1 value of 0.9089.【Innovation/limitation】The matrix formed by the relationship between entities is a sparse matrix,which will ad⁃versely affect the calculation speed in the calculation process.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.138.37.16