基于不平衡数据与集成学习的图像情感分类研究  

Research on Image Sentiment Classification Based on Unbalanced Data and Ensemble Learning

在线阅读下载全文

作  者:杨松 吴桐 苏博 YANG Song;WU Tong;SU Bo(School of Software,Dalian University of Foreign Languages;Research Center for Networks Space Multi-Languages Big Data Intelligence Analysis,Dalian University of Foreign Languages,Dalian 116044,China;Dalian Customs Logistics Management Center,Dalian 116007,China)

机构地区:[1]大连外国语大学软件学院 [2]大连外国语大学网络空间多语言大数据智能分析研究中心,辽宁大连116044 [3]大连海关后勤管理中心,辽宁大连116007

出  处:《软件导刊》2023年第7期118-124,共7页Software Guide

基  金:国家自然科学基金项目(61806038);辽宁省社会科学规划基金项目(L18BTQ005);辽宁省教育厅科学研究项目(2019JYT07)。

摘  要:目前,基于图像的情感分析已成为情感计算领域的研究热点。针对图像情感分析常用的开放数据集通常表现为多分类的不均衡数据,单一模型存在抽取特征单一、泛化能力不强等问题。首先,改进Focal Loss损失函数,使模型跟随训练进度动态调整聚焦参数。然后,设定概率阈值参数确定困难样本,通过舍弃困难样本避免模型学习错误特征。接下来,选取3个分类性能良好的卷积神经网络模型作为基分类器,分别关注图像的局部、颜色及深度语义特征。最后,采取加权投票法策略,引入信息熵更新多分类器决策的权值。实验表明,所提方法能提升图像情感多分类的准确率,可为基于不平衡数据与集成学习的图像情感分类研究提供参考与借鉴。At present,image based affective analysis has become a research hotspot in the field of affective computing.The open data set com⁃monly used in image emotion analysis is usually presented as multi category unbalanced data,and the single model has the problems of single feature extraction and weak generalization ability.First,the Focal Loss loss function is improved to make the model dynamically adjust the fo⁃cus parameters following the training progress.Then,probability threshold parameters are set to determine the difficult samples,and the mod⁃el avoids learning incorrect features by discarding the difficult samples.Next,select three convolutional neural network models with good clas⁃sification performance as base classifiers,focusing on the local,color,and depth semantic features of the image.Finally,the weighted voting strategy is adopted,and information entropy is introduced to update the weights of multi classifier decisions.The experiment shows that the pro⁃posed method can improve the accuracy of image sentiment multi classification,and can provide reference for the research of image sentiment classification based on imbalanced data and ensemble learning.

关 键 词:图像情感分析 不均衡数据 Focal loss损失函数 困难样本 加权投票法 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象