检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:申京傲 李广明 王怀济 吴京 SHEN Jing’ao;LI Guangming;WANG Huaiji;WU Jing(School of Computer Science and Technology,Dongguan University of Technology,Dongguan 523808,China;School of Computer Science and Technology,Hainan University,Haikou 570228,China)
机构地区:[1]东莞理工学院计算机科学与技术学院,广东东莞523808 [2]海南大学计算机科学与技术学院,海南海口570228
出 处:《东莞理工学院学报》2024年第5期9-17,共9页Journal of Dongguan University of Technology
基 金:国家自然科学基金青年科学基金资助项目(62106046);广东大学生科技创新培育专项资金项目(Pdjh2002a0505)。
摘 要:目前用于生成X射线图像的方法中存在主体过拟合和背景欠拟合等问题,针对此类问题,基于去噪扩散概率模型DDPM(Denoising Diffusion Probability Model)提出了一种新型图像生成模型MDDPM(Masked DDPM),设计一种无监督图像分割方法对X射线图像进行分割,将分割后得到的二值图像作为遮罩加权到损失函数,增强扩散模型;设计一种含有增强型SE注意力块的卷积块ESE Block(Enhanced Squeeze-and-Excitation Block),结合注意力机制和上、下采样模块等搭建U-Net结构的神经网络,进一步提高网络的学习、表征和泛化能力。使用MDDPM在OPIXray数据集上验证了对X射线违禁品图像进行增广的可行性,针对五个类别的违禁品,实验结果表明,相比于DDPM,MDDPM的生成图像质量分布差异指标FID分别提升了18.3%、24.82%、32.85%、29.12%和33.62%。将使用本模型生成的图像与原始图像进行混合,与只使用原始图像进行图像分类实验相比,分类精确度提高了3.2%,此结果表明,生成的图像不仅保留了原始数据的特征,而且提高了数据高维特征的多样性。Current methods used to generate X-ray images have problems such as subject overfitting and background underfitting.To address the above problems,the paper proposes a new image generation model Masked DDPM(MDDPM)based on the Denoising Diffusion Probability Model(DDPM),designs an unsupervised image segmentation method to segment X-ray images,and uses the binary image obtained after segmentation as a mask to weight the loss function to enhance the diffusion model;in addition,a convolution block ESE Block(Enhanced Squeeze-and-Excitation Block)with enhanced SE attention block is designed,combining attention mechanism and up-and-down sampling modules to build a U-Net structured neural network to further improve the learning,representation and generalization ability of the network.The feasibility of augmenting X-ray contraband images was verified using MDDPM on the OPIXray data set.For five categories of contraband,the experimental results show that compared with DDPM,the generated image quality distribution difference index FID of MDDPM is improved respectively 18.3%,24.82%,32.85%,29.12%and 33.62%.The images generated by using this model are mixed with the original images.Compared with using only the original images for image classification experiments,the classification accuracy is increased by 3.2%.This result shows that the generated images not only retain the characteristics of the original data,but also improve the diversity of high-dimensional features of data.
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.149.249.140