检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:郑晓芳 杨娜 傅军栋 Zheng Xiaofang;Yang Na;Fu Jundong(School of Electrical and Automation Engineering,East China Jiaotong University,Nanchang 330013,Jiangxi,China)
机构地区:[1]华东交通大学电气与自动化工程学院,江西南昌330013
出 处:《计算机应用与软件》2024年第2期117-122,129,共7页Computer Applications and Software
摘 要:针对电气照明设计领域规范条文繁杂,存在设计人员查询困难及对同一规范条文的理解偏差等问题,提出运用信息抽取技术建立该领域的知识图谱,并将专家经验融入其中。在预处理阶段引入互信息和边界熵两个参数对分词进行改进,避免了对专业名词的切分;通过语义角色标注与依存句法分析相结合的方法对数据三元组进行抽取,弥补单纯用语义角色标注方法不能抽取出多宾语的缺陷;用图数据库Neo4j存储,完成该领域知识图谱的构建。In view of the complexity of regulations in electrical lighting design,the difficulty for designers to query and the deviation of understanding of the same regulations,this paper proposes to use information extraction technology to establish the knowledge graph in this field,and integrate the expert experience into it.In the preprocessing stage,the two parameters of mutual information and boundary entropy were introduced to improve word segmentation,avoiding the segmentation of professional nouns.The method of combining semantic role labeling and dependency parsing was used to extract data triples,to make up the defects that simple method of semantic role labeling could not extract multiple objects.The graph database Neo4j was used for storage to complete the construction of the domain knowledge graph.
分 类 号:TP399[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249