基于全文索引知识图谱的危化品运输地址匹配研究  被引量:2

Address matching based on full-text indexed knowledge graph for hazardous materials transportation

在线阅读下载全文

作  者:刘斐[1,2] 贺向阳 邹志云 Liu Fei;He Xiangyang;Zou Zhiyun(School of Civil&Hydraulic Engineering,Huazhong University of Science&Technology,Wuhan 430074,China;Ningbo Transport Development Research Center,Ningbo Zhejiang 315042,China)

机构地区:[1]华中科技大学土木与水利工程学院,武汉430074 [2]宁波交通发展研究中心,浙江宁波315042

出  处:《计算机应用研究》2022年第2期407-410,431,共5页Application Research of Computers

摘  要:地址匹配是危化品运输交通起止点调查分析的关键技术之一。为解决复杂非标危化品道路运输地址匹配精度较低的问题,通过构建自扩展中文分词及自扩展的地址数据知识图谱,基于全文索引知识图谱进行危化品运输中文地址的匹配,纳入加权拼音全文搜索机制以提高拼写错误地址的匹配准确率,结合在线地理解析接口构建危化品运输地址多重匹配机制,并对少量疑难地址执行半监督匹配,形成了完整的危化品运输地址匹配方法体系。针对危化品运输电子运单地址数据的计算表明,算法能实现复杂中文危险化学品运输地址匹配的高准确率及高精度。在随机地址测试集中准确率达94.6%,在较难分类地址测试集中准确率达67.5%,在较难分类地址匹配的准确率及精度上均相比于通用匹配方法及地理搜索引擎有大幅度的提升。Address matching is one of the key technologies in the investigation and analysis of origin-destination in hazardous materials transportation.In order to help solving the problem of low accuracy of complex non-standard address matching for hazardous materials road transportation,this paper adopted self-expanding knowledge graph of address data,proposed a self-expanding Chinese word segmentation method and an address matching mechanism based on full-text index.Besides,this paper incorporated a weighted Pinyin full-text search mechanism to improve the matching accuracy for misspelling addresses.The online interface of geographic resolving and semi-supervised matching mechanism made the proposed address matching system a complete one.The matching results of the address data of the electronic waybill for the hazardous materials transportation show a high accuracy and high precision.The accuracy is over 94.6%for random address dataset and 67.5%for difficult-to-parse address dataset,both are much higher than the general matching methods and geographic search methods in terms of the accuracy and precision of address matching.

关 键 词:危化品运输地址匹配 中文分词 全文搜索 知识图谱 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象