一种基于在线蒸馏的轻量化噪声标签学习方法

A Lightweight Noise Label Learning Method Based on Online Distillation

作　　者：黄贻望黄雨鑫[2] 刘声 Huang Yiwang;Huang Yuxin;Liu Sheng(School of Data Science,Tongren University,Tongren,Guizhou 554300;School of Computer Science and Mathematics,Fujian University of Technology,Fuzhou 350001;Guizhou Provincial Key Laboratory of Public Big Data(Guizhou University),Guiyang 550025)

机构地区：[1]铜仁学院大数据学院,贵州铜仁554300 [2]福建理工大学计算机科学与数学学院,福州350001 [3]贵州省公共大数据重点实验室(贵州大学),贵阳550025

出　　处：《计算机研究与发展》2024年第12期3121-3133,共13页Journal of Computer Research and Development

基　　金：国家自然科学基金项目(62066040,62261047);贵州省公共大数据重点实验室开放基金项目(2018BDKFJJ011);铜仁市科技局项目(铜仁市科研[2022]5号)。

摘　　要：利用含有有损标签的噪声数据来训练深度学习模型是机器学习中的研究热点.研究表明深度学习模型训练易受噪声数据的影响而产生过拟合现象.最近,一种将元学习与标签校正相结合的方法能够使模型更好地适应噪声数据以减缓过拟合现象,然而这种元标签校正方法依赖于模型的性能,同时轻量化模型在噪声数据下不具备良好的泛化性能.针对这一问题,本文结合元学习提出一种基于在线蒸馏的轻量化噪声标签学习方法KDMLC(knowledge distillation-based meta-label correction learning),该方法将深度神经网络与多层感知机构成的元标签校正(meta label correction,MLC)模型视为教师模型,对噪声标签进行校正并指导轻量化模型进行训练,同时采用双层优化策略训练并增强教师模型的泛化能力,从而生成更高质量的伪标签用于训练轻量化模型.实验表明,KDMLC在高噪声水平下对比MLC方法准确率提高了5.50个百分点;同时对CIFAR10数据集使用Cutout数据增强,KDMLC在高噪声水平下对比MLC准确率提升了9.11个百分点,而在真实噪声数据集Clothing1M上的实验,KDMLC也优于其他方法,验证了KDMLC的可行性和有效性.Training deep learning models with noisy data containing lossy labels is a hot research topic in machine learning.Studies have shown that deep learning model training is susceptible to overfitting due to noisy data.Recently,a method combining meta-learning and label correction can make the model better adapt to the noisy data to mitigate the overfitting phenomenon.However,this meta-label correction method relies on the model’s performance,and the lightweight model does not have good generalization performance under noisy data.To address this problem,we propose a knowledge distillation-based meta-label correction learning method(KDMLC),which treats the meta label correction model(MLC)composed of a deep neural network and a multilayer perceptron as a teacher model to correct the noise labels and guide the training of the lightweight model,and at the same time,KDMLC adopts a twolayer optimization strategy to train and enhance the generalization ability of the teacher model,so as to generate a higher-quality pseudo-labels for training the lightweight model.The experiments show that KDMLC improves the test accuracy by 5.50%compared with MLC method at high noise level;meanwhile,using Cutout data enhancement on the CIFAR10 dataset,KDMLC improves the test accuracy by 9.11%compared with MLC at high noise level,and the experiments on the real noisy dataset,Clothing1M,also show that KDMLC outperforms the other methods,verifying that KDMLC is better than the other methods,which verifies the feasibility and validity of KDMLC method.

关键词：伪标签标签校正元学习知识蒸馏噪声数据

分类号：TP181[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于在线蒸馏的轻量化噪声标签学习方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于在线蒸馏的轻量化噪声标签学习方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索