实体识别技术研究进展综述  被引量:1

Overview of the research progress in entity recognition technology

在线阅读下载全文

作  者:马艺洁 赖海光 刘子威 杨楠 张更新 MA Yijie;LAI Haiguang;LIU Ziwei;YANG Nan;ZHANG Gengxin(Institute of Satellite Communication,Nanjing University of Posts and Telecommunications,Nanjing Jiangsu 210003,China;Cowave Satellite Communication Technology Co.,Ltd,Nanjing Jiangsu 211135,China)

机构地区:[1]南京邮电大学卫星通信研究所,江苏南京210003 [2]南京控维通信科技有限公司,江苏南京211135

出  处:《太赫兹科学与电子信息学报》2024年第5期503-515,共13页Journal of Terahertz Science and Electronic Information Technology

基  金:国家自然科学基金资助项目(U21A20450,62271266);江苏省前沿引领技术基础研究专项资助项目(BK20192002,BK20212001)。

摘  要:实体识别技术作为知识图谱构建的重要步骤,已广泛用于语义网络、机器翻译、问答系统等自然语言处理中,在推动自然语言处理技术落地实践的过程中起着非常关键的作用。本文根据实体识别技术的发展历程调研了现有的实体识别方法,主要为早期基于规则和词典的实体识别方法、基于机器学习的以及基于深度学习的命名实体识别方法;整理了每种实体识别方法的关键思路、优缺点和具有代表性的模型,特别对目前使用较多的基于双向长短期记忆网络(BiLSTM)模型和基于Transformer模型的实体识别方法进行了概述;介绍了目前主流的数据集以及评价标准。最后,面向未来机器类通信的语义需求,总结了实体识别技术面临的挑战,并对其未来在物联网业务数据方面的发展进行了展望。Entity recognition technology,as an important step in constructing knowledge graphs,has been extensively applied in natural language processing applications such as semantic network,machine translation,and question answering systems.It plays a crucial role in promoting the practical application of natural language processing technology.According to the development process of entity recognition technology,the existing entity recognition methods are investigated in this paper.These methods can be classified as:early rule and dictionary based entity recognition methods,machine learning based entity recognition methods,and deep learning-based entity recognition methods.The core ideas,advantages and disadvantages,and representative models of each entity recognition method are summarized,especially the latest entity recognition methods based on Bi-directional Long Short-term Memory(BiLSTM) and Transformer.Additionally,the current mainstream datasets and evaluation criteria are introduced.Finally,facing the semantic requirements of future machine communication,we have summarized the challenges faced by entity recognition technology,and its future advancement in Internet of Things(IoT) business data is anticipated.

关 键 词:实体识别 语义提取 深度学习 知识图谱 

分 类 号:TN927.2[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象