检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨荣莹 何庆 杜逆索[2,3] YANG Rongying;HE Qing;DU Nisuo(College of Big Data&Information Engineering,Guizhou University,Guiyang 550025,China;Guizhou Provincial Key Laboratory of Public Big Data,Guizhou University,Guiyang 550025,China;Guizhou Province Big Data Industry Development and Application Research Institute,Guizhou University,Guiyang 550025,China)
机构地区:[1]贵州大学大数据与信息工程学院,贵阳550025 [2]贵州大学贵州省公共大数据重点实验室,贵阳550025 [3]贵州大学贵州省大数据产业发展应用研究院,贵阳550025
出 处:《计算机工程与应用》2022年第8期117-124,共8页Computer Engineering and Applications
基 金:贵州省科技计划项目重大专项项目(黔科合重大专项字[2018]3002,黔科合重大专项字[2016]3022);贵州省公共大数据重点实验室开放课题(2017BDKFJJ004);贵州省教育厅青年科技人才成长项目(黔科合KY字[2016]124)。
摘 要:在不引入其他辅助特征的情况下,仅关注文本自身,通过构建多个特征提取器深度挖掘文本序列抽象、深层、高维的特征。采用BERT预训练模型获取信息更丰富的词嵌入;将词嵌入分别输入到BiLSTM和IDCNN中进行第一轮的特征提取,为获取更高维的特征,实现信息的多通道传输和流量控制,在IDCNN网络中引入门控机制;为提高特征提取效率,加入多头自注意力机制;构建共享BiLSTM,实现特征信息的交互流通,提高特征表征强度;创建两个CRF模型,丰富特征分布并实现特征信息的跨层传输,以提升标签序列预测的准确性。在两个数据集上进行测试,与四种NER模型进行比较,结果表明,F1值在一定程度上得到提升。Without introducing other auxiliary features, only focusing on the text, it constructs multiple feature extractors to capture more abstract, deeper, and higher-dimensional features of the text sequence. It uses the BERT pre-training model to obtain more rich information of word embedding. Word embedding is input into BiLSTM and IDCNN respectively for the first round of feature extraction. In order to obtain higher-dimensional features, transmitting information on multichannel and control the flow, a gating mechanism is introduced in the IDCNN. In order to improve the efficiency of feature extraction, multi-head self-attention mechanism is added. It constructs share-BiLSTM, realizes the interactive circulation of features, improves the strength of feature representation. It creates two CRF to enrich feature distribution and crosslayer transmission, to promote the accuracy of predicting tag sequence. Tested on two data sets and compared with four NER models, the results show that the F1 value has been improved to a certain extent.
关 键 词:特征提取 词嵌入 门控机制 共享BiLSTM 多头自注意力
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7