检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:职为梅[1] 常智 卢俊华 耿正乾 ZHI Weimei;CHANG Zhi;LU Junhua;GENG Zhengqian(School of Computer and Artificial Intelligence,Zhengzhou University,Zhengzhou 450001,China)
机构地区:[1]郑州大学计算机与人工智能学院,郑州450001
出 处:《电子与信息学报》2024年第11期4208-4218,共11页Journal of Electronics & Information Technology
基 金:国家重点研发计划(2023YFC2206404)。
摘 要:许多适用于低维数据的传统不平衡学习算法在图像数据上的效果并不理想。基于生成对抗网络(GAN)的过采样算法虽然可以生成高质量图像,但在类不平衡情况下容易产生模式崩溃问题。基于自编码器(AE)的过采样算法容易训练,但生成的图像质量较低。为进一步提高过采样算法在不平衡图像中生成样本的质量和训练的稳定性,该文基于生成对抗网络和自编码器的思想提出一种融合自编码器和生成对抗网络的过采样算法(BAEGAN)。首先在自编码器中引入一个条件嵌入层,使用预训练的条件自编码器初始化GAN以稳定模型训练;然后改进判别器的输出结构,引入一种融合焦点损失和梯度惩罚的损失函数以减轻类不平衡的影响;最后从潜在向量的分布映射中使用合成少数类过采样技术(SMOTE)来生成高质量的图像。在4个图像数据集上的实验结果表明该算法在生成图像质量和过采样后的分类性能上优于具有辅助分类器的条件生成对抗网络(ACGAN)、平衡生成对抗网络(BAGAN)等过采样算法,能有效解决图像数据中的类不平衡问题。Many traditional imbalanced learning algorithms suitable for low-dimensional data do not perform well on image data.Although the oversampling algorithm based on Generative Adversarial Networks(GAN)can generate high-quality images,it is prone to mode collapse in the case of class imbalance.Oversampling algorithms based on AutoEncoders(AE)are easy to train,but the generated images are of lower quality.In order to improve the quality of samples generated by the oversampling algorithm in imbalanced images and the stability of training,a Balanced oversampling method with AutoEncoders and Generative Adversarial Networks(BAEGAN)is proposed in this paper,which is based on the idea of GAN and AE.First,a conditional embedding layer is introduced in the Autoencoder,and the pre-trained conditional Autoencoder is used to initialize the GAN to stabilize the model training;then the output structure of the discriminator is improved,and a loss function that combines Focal Loss and gradient penalty is proposed to alleviate the impact of class imbalance;and finally the Synthetic Minority Oversampling TEchnique(SMOTE)is used to generate highquality images from the distribution map of latent vectors.Experimental results on four image data sets show that the proposed algorithm is superior to oversampling methods such as Auxiliary Classifier Generative Adversarial Networks(ACGAN)and BAlancing Generative Adversarial Networks(BAGAN)in terms of image quality and classification performance after oversampling and can effectively solve the class imbalance problem in image data.
关 键 词:不平衡图像数据 过采样 生成对抗网络 对抗自编码器 合成少数类过采样技术
分 类 号:TN911.73[电子电信—通信与信息系统] TP181[电子电信—信息与通信工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222