面向X射线图像生成的遮罩增强扩散模型  

Mask-Enhanced Diffusion Model For X-Ray Image Generation

在线阅读下载全文

作  者:申京傲 李广明 王怀济 吴京 SHEN Jing’ao;LI Guangming;WANG Huaiji;WU Jing(School of Computer Science and Technology,Dongguan University of Technology,Dongguan 523808,China;School of Computer Science and Technology,Hainan University,Haikou 570228,China)

机构地区:[1]东莞理工学院计算机科学与技术学院,广东东莞523808 [2]海南大学计算机科学与技术学院,海南海口570228

出  处:《东莞理工学院学报》2024年第5期9-17,共9页Journal of Dongguan University of Technology

基  金:国家自然科学基金青年科学基金资助项目(62106046);广东大学生科技创新培育专项资金项目(Pdjh2002a0505)。

摘  要:目前用于生成X射线图像的方法中存在主体过拟合和背景欠拟合等问题,针对此类问题,基于去噪扩散概率模型DDPM(Denoising Diffusion Probability Model)提出了一种新型图像生成模型MDDPM(Masked DDPM),设计一种无监督图像分割方法对X射线图像进行分割,将分割后得到的二值图像作为遮罩加权到损失函数,增强扩散模型;设计一种含有增强型SE注意力块的卷积块ESE Block(Enhanced Squeeze-and-Excitation Block),结合注意力机制和上、下采样模块等搭建U-Net结构的神经网络,进一步提高网络的学习、表征和泛化能力。使用MDDPM在OPIXray数据集上验证了对X射线违禁品图像进行增广的可行性,针对五个类别的违禁品,实验结果表明,相比于DDPM,MDDPM的生成图像质量分布差异指标FID分别提升了18.3%、24.82%、32.85%、29.12%和33.62%。将使用本模型生成的图像与原始图像进行混合,与只使用原始图像进行图像分类实验相比,分类精确度提高了3.2%,此结果表明,生成的图像不仅保留了原始数据的特征,而且提高了数据高维特征的多样性。Current methods used to generate X-ray images have problems such as subject overfitting and background underfitting.To address the above problems,the paper proposes a new image generation model Masked DDPM(MDDPM)based on the Denoising Diffusion Probability Model(DDPM),designs an unsupervised image segmentation method to segment X-ray images,and uses the binary image obtained after segmentation as a mask to weight the loss function to enhance the diffusion model;in addition,a convolution block ESE Block(Enhanced Squeeze-and-Excitation Block)with enhanced SE attention block is designed,combining attention mechanism and up-and-down sampling modules to build a U-Net structured neural network to further improve the learning,representation and generalization ability of the network.The feasibility of augmenting X-ray contraband images was verified using MDDPM on the OPIXray data set.For five categories of contraband,the experimental results show that compared with DDPM,the generated image quality distribution difference index FID of MDDPM is improved respectively 18.3%,24.82%,32.85%,29.12%and 33.62%.The images generated by using this model are mixed with the original images.Compared with using only the original images for image classification experiments,the classification accuracy is increased by 3.2%.This result shows that the generated images not only retain the characteristics of the original data,but also improve the diversity of high-dimensional features of data.

关 键 词:扩散模型 数据增广 X射线图像 图像生成 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象