检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:马月坤 冯烨琛 MA Yuekun;FENG Yechen(College of Artificial Intelligence,North China University of Science and Technology,Tangshan 063210,China;Hebei Provincial Key Laboratory of Industrial Intelligent Perception,Tangshan 063210,China;School of Computer&Communication Engineering,University of Science&Technology Beijing,Beijing 100083,China;Beijing Key Laboratory of Knowledge Engineering for Materials Science,Beijing 100083,China)
机构地区:[1]华北理工大学人工智能学院,河北唐山063210 [2]河北省工业智能感知重点实验室,河北唐山063210 [3]北京科技大学计算机与通信工程学院,北京100083 [4]材料领域知识工程北京市重点实验室,北京100083
出 处:《郑州大学学报(理学版)》2024年第2期34-42,共9页Journal of Zhengzhou University:Natural Science Edition
基 金:河北省三三三人才项目(A201803082)。
摘 要:自然语言种类丰富、形式灵活多变的特征使得隐式关系抽取成为目前关系抽取领域中富有难度和挑战性的任务之一。通过引入构式语法理论和依存句法分析两种认知语言学范畴的理论技术,构建了一种面向中医古籍中隐式关系的抽取方法。首先利用构式语法理论制定文本构式化策略、分析并定义出8种构式特征与5种构式类型,并使用CART(classification and regression tree,CART)分类模型完成文本分类;其次对其中4类构式使用依存句法分析技术构建句法树,通过分析句法树中的特定结构,制定医学类实体间的关系三元组抽取规则,实现隐式关系抽取;最后在经典中医古籍《黄帝内经》数据集上进行测试,实验结果表明了方法的有效性。The variety of natural languages and their flexible forms are very rich,which makes implicit relation extraction one of the difficult and challenging tasks in the field of relation extraction.Two theoretical techniques were introduced in the field of cognitive linguistics,namely constructive grammar theory and dependent syntactic analysis,to construct a method for extracting implicit relations in traditional Chinese medical texts.Firstly,the constructive grammar theory was used to formulate a text structuring strategy,analyze and define eight constructive features and five constructive types,and the CART classification model was used to classify the text.Secondly,the dependent syntactic analysis technique was used to construct a syntactic tree for four constructs,and by analyzing the specific structure of the syntactic tree,the extraction rules of the relational triad between medical entities were formulated to realize the implicit relation extraction.Finally,the tests were conducted on the dataset of the classic traditional Chinese medical text,i.e.,Huangdi Neijing,and the experimental results showed the effectiveness of the method.
关 键 词:关系抽取 中医古籍 隐式关系 构式语法理论 依存句法分析
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15