检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李龚林 范一晨 米宇舰 李明[1] LI Gonglin;FAN Yichen;MI Yujian;LI Ming(School of Economics and Management,Xi’an University of Technology,Xi’an Shaanxi 710054,China)
机构地区:[1]西安理工大学经济与管理学院,西安710054
出 处:《计算机应用》2023年第S02期28-33,共6页journal of Computer Applications
基 金:陕西省大学生创新创业训练计划项目(S202210700148)。
摘 要:针对单一模型用于文本分类存在的模型体量大,难以适用于舆情信息文本的多元化非规范的表达等问题,提出基于Bagging训练思想的、动态微调和二次加权的模型集成算法(Bagging-DyFAS)。首先,使用自助采样构建的数据集训练弱分类器,使该分类器具有一定的先验知识;其次,依据该分类器在开发集的表现,进行一次动态加权和一次静态加权,并使用得到的一系列权重将模型泛化到无标注的数据上,进一步提升模型在文本分类任务的性能。在所构建的数据集上的实验结果表明,在训练一轮的情况下,相较于基线模型MiniBRT、BRT3和LERT(Linguisticallymotivated bidirectional Encoder Representation from Transformer),所提算法的准确率、精确率、召回率和F1值分别至少提升3.6、3.8、1.3和3.2个百分点,实验结果验证了所提算法的有效性。In view of the problems of large model size and difficulty in applying a single model for text classification to diverse and non-normative representations of public opinion information,a model ensemble algorithm based on Bagging-Dynamic Fine-tuning And Secondary weighting(Bagging-DyFAS)was proposed.First,weak classifiers were trained with a dataset constructed by self-sampling,so that some priori knowledge was in the classifiers.Then,they were dynamically weighted once and statically weighted once based on their performance in the development set.Using the obtained series of weights,the models were generalized to unlabeled data,which could further improve the performance of the models in text classification tasks.Experimental results on the constructed test dataset show that,after training for on round,compared to the baseline models MiniBRT,BRT3 and LERT(Linguistically-motivated bidirectional Encoder Representation from Transformer),the proposed algorithm improved the accuracy,precision,recall and F1 value by at least 3.6,3.8,1.3 and 3.2 percentage points respectively,validating the effectiveness of the proposed algorithm.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.166