一种检测C&W对抗样本图像的盲取证算法被引量：1

Blind forensics of adversarial images generated by C&W algorithm

作　　者：邓康罗盛海彭安杰[1,2] 曾辉黄晓芳[1] DENG Kang;LUO Shenghai;PENG Anjie;ZENG Hui;HUANG Xiaofang(School of Computer Science and Technology,Southwest University of Science and Technology,Mianyang 621010,China;Guangdong Key Laboratory of Information Security Technology,Sun Yat-Sen University,Guangzhou 510275,China)

机构地区：[1]西南科技大学计算机科学与技术学院,中国绵阳621010 [2]中山大学广东省信息安全重点实验室,中国广州510275

出　　处：《信息安全学报》2020年第6期1-10,共10页Journal of Cyber Security

基　　金：国家自然科学基金(No.61702429);四川省科技厅基金(No.19yyjc1656);四川省教育厅基金(No.17ZB0450)资助。

摘　　要：对抗样本图像能欺骗深度学习网络,亟待对抗样本防御机制以增强深度学习模型的安全性。C&W攻击是目前较热门的一种白盒攻击算法,它产生的对抗样本具有图像质量高、可转移、攻击性强、难防御等特点。本文以C&W攻击生成的对抗样本为研究对象,采用数字图像取证的思路,力图实现C&W对抗样本的检测,拒绝对抗样本输入深度学习网络。基于对抗样本中的对抗扰动易被破坏的假设,我们设计了基于FFDNet滤波器的检测算法。具体来说,FFDNet是一种基于深度卷积网络CNN的平滑滤波器,它能破坏对抗扰动,导致深度学习模型对对抗样本滤波前后的输出不一致。我们判断输出不一致的待测图像为C&W对抗样本。我们在ImageNet-1000图像库上针对经典的ResNet深度网络生成了6种C&W对抗样本。实验结果表明本文方法能较好地检测C&W对抗样本。相较于已有工作,本文方法不仅极大地降低了虚警率,而且提升了C&W对抗样本的检测准确率。Adversarial images which can fool Deep neural networks have attracted researchers to focus on how to harden DNNs against adversarial attacks.Among typical attack algorithms,the C&W attack is one of the strongest attacks,which ensures the attack success rates yet causes less adversarial perturbations on the original image,and is taken as a benchmark in defense attempts.In this paper,we employ the blind forensic methodology to detect C&W adversarial images,which aims to avoid adversarial inputs for deep neural networks.Supposing that the adversarial perturbations are easily damaged by some image processing operations,we proposed a detecting method by using the fast and flexible de-noising convolution neural network called FFDNet.Specially,we compare the model’s prediction on the test image and its filtered version.If the original and filtered inputs produce substantially different outputs from the model,the test image is likely to be adversarial.We employ ResNet as the targeted network,and generate 6 kinds of C&W adversarial images on ImageNet-1000 database.Experimental results show that the proposed method is effective in the detection of C&W adversarial images,and outperforms state-of-the-arts in terms of false positive rates and true positive rates.

关键词：深度学习对抗样本数字图像取证图像滤波

分类号：TP309.2[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种检测C&W对抗样本图像的盲取证算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种检测C&W对抗样本图像的盲取证算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一种检测C&W对抗样本图像的盲取证算法被引量：1