检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Zhichen Hu Huali Ren Jielin Jiang Yan Cui Xiumian Hu Xiaolong Xu
机构地区:[1]School of Computer and Software,Nanjing University of Information Science and Technology,Nanjing,210044,China [2]Institution of Artificial Intelligence and Blockchain,Guangzhou University,Guangzhou,515021,China [3]School of Earth Sciences and Engineering,Nanjing University,Nanjing,210023,China [4]College of Mathematics and Information Science,Nanjing Normal University of Special Education,Nanjing,210023,China
出 处:《Computer Modeling in Engineering & Sciences》2023年第4期91-108,共18页工程与科学中的计算机建模(英文)
基 金:supported by the National Natural Science Foundation of China under Grant No.42050102;the National Science Foundation of China(Grant No.62001236);the Natural Science Foundation of the Jiangsu Higher Education Institutions of China(Grant No.20KJA520003).
摘 要:An obviously challenging problem in named entity recognition is the construction of the kind data set of entities.Although some research has been conducted on entity database construction,the majority of them are directed at Wikipedia or the minority at structured entities such as people,locations and organizational nouns in the news.This paper focuses on the identification of scientific entities in carbonate platforms in English literature,using the example of carbonate platforms in sedimentology.Firstly,based on the fact that the reasons for writing literature in key disciplines are likely to be provided by multidisciplinary experts,this paper designs a literature content extraction method that allows dealing with complex text structures.Secondly,based on the literature extraction content,we formalize the entity extraction task(lexicon and lexical-based entity extraction)for entity extraction.Furthermore,for testing the accuracy of entity extraction,three currently popular recognition methods are chosen to perform entity detection in this paper.Experiments show that the entity data set provided by the lexicon and lexical-based entity extraction method is of significant assistance for the named entity recognition task.This study presents a pilot study of entity extraction,which involves the use of a complex structure and specialized literature on carbonate platforms in English.
关 键 词:Named entity recognition carbonate platform corpus entity extraction english literature detection
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7