基于样本分布损失的图像多标签分类研究

Study on Multi-label Image Classification Based on Sample Distribution Loss

作　　者：朱旭东熊贇 ZHU Xu-dong;XIONG Yun(School of Computer Science and Technology,Fudan University,Shanghai 200433,China;Research Center of Dataology and Data Science,Fudan University,Shanghai 200433,China)

机构地区：[1]复旦大学计算机科学与技术学院,上海200433 [2]上海市数据科学重点实验室(复旦大学),上海200433

出　　处：《计算机科学》2022年第6期210-216,共7页Computer Science

基　　金：国家自然科学基金(U1636207)。

摘　　要：与一般图像分类场景下的数据分布情况不同,在图像多标签分类问题的场景下,不同标签类别之间存在样本数量分布不均衡,少量头部类别通常占据大多数样本数量的情况。而由于多个标签间同时标记的相关性,再加上多标签下困难样本的分布还与数据分布和类别分布相关,使得单标签问题中解决数据不平衡的重采样等方法在多标签场景下无法有效适用。文中提出了一种基于图像多标签场景下样本分布损失和深度学习的分类方法。首先对多标签数据不均衡分布设置类别相关重采用损失,并通过动态学习方式防止分布过度异化,然后设计非对称样本学习损失,设置对正负样本和困难样本的不同学习能力,同时通过软化样本学习权重减少信息丢失。相关数据集的实验显示,所提算法在解决多标签数据分布不均衡场景下的样本学习问题时取得了很好的效果。Different from the data distribution in general image classification scenarios,in the scenario of multi label image classification,the sample number distribution among different label categories is unbalanced,and a small number of head categories often account for the majority of sample size.However,due to the correlation between multiple labels,and the distribution of diffi-cult samples under multiple labels is also related to the data distribution and category distribution,the re-sampling and other methods for solving the data imbalance in the single label problem cannot be effectively applied in the multi label scenario.This paper proposes a classification method based on the loss of sample distribution in multi label image scene and deep learning.Firs-tly,the unbalanced distribution of multi label data is set with category correlation,and the loss is re-used,and the dynamic lear-ning method is used to prevent the excessive alienation of distribution.Then,the asymmetric sample learning loss is designed,and different learning abilities for positive and negative samples and difficult samples are set.At the same time,the information loss is reduced by softening the sample learning weight.Experiments on related data sets show that the algorithm has achieved good results in solving the sample learning problem in the scene of uneven distribution of multi-label data.

关键词：多标签标签关系重采样深度学习图像分类

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于样本分布损失的图像多标签分类研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于样本分布损失的图像多标签分类研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索