检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:肖景博 殷琪林 卢伟[1,2,3,4] 罗向阳 郭世泽[6] Jingbo XIAO;Qilin YIN;Wei LU;Xiangyang LUO;Shize GUO(School of Computer Science and Engineering,Sun Yat-sen University,Guangzhou 510006,China;Institute of Artificial Intelligence,Sun Yat-sen University,Zhuhai 519082,China;Key Laboratory of Information Technology,Ministry of Education,Guangzhou 510006,China;Guangdong Province Key Laboratory of Information Security Technology,Guangzhou 510006,China;State Key Laboratory of Mathematical Engineering and Advanced Computing,Zhengzhou 450002,China;School of Cyberspace Security,Beijing University of Posts and Telecommunications,Beijing 100876,China)
机构地区:[1]中山大学计算机学院,广州510006 [2]中山大学人工智能研究院,珠海519082 [3]信息技术教育部重点实验室,广州510006 [4]广东省信息安全技术重点实验室,广州510006 [5]数学工程与先进计算国家重点实验室,郑州450002 [6]北京邮电大学网络空间安全学院,北京100876
出 处:《中国科学:信息科学》2024年第11期2572-2588,共17页Scientia Sinica(Informationis)
基 金:国家自然科学基金(批准号:U2001202,U23A20305,62072480,62172435);广东省信息安全技术重点实验室(批准号:2023B1212060026)资助项目。
摘 要:随着深度伪造技术的快速发展,深度伪造视频在每一帧上表现得极为真实,现有检测方法难以有效识别出深度伪造视频.针对这一问题,本文首次提出了一种基于视频流谱特征空间的深度伪造检测方法.该方法基于流谱理论构建了一个视频流谱特征空间,通过视频流谱基底模型将视频流从视频特征隐空间映射到视频流谱特征空间,精准刻画视频流中不一致性信息,获取可分离度更高的视频流谱不一致性特征,从而实现深度伪造视频的检测.具体而言,首先提出了一种视频流谱特征空间的构建方法,通过对视频特征隐空间进行基底映射,得到一个近似同构的视频流谱特征描述空间,在视频流谱特征空间中融合视频流不同视角的高维表征,实现对视频流的精准刻画与分析;然后设计了一个视频不一致性流谱映射模型,通过视频流谱变换算子,从时序角度将视频流的空域信息聚合映射到视频流谱特征空间,建模深度伪造视频的不一致性信息,构建数据可分离度更高的视频表征.实验结果表明,所提方法在Celeb-DF数据集上达到99.23%的准确率,在DFDC数据集上达到95.24%的准确率.The rapid advancement of deepfake technology has led to the creation of deepfake videos that appear extremely realistic on each frame.Existing detection methods have struggled to effectively identify deepfake videos.To tackle this issue,a deepfake detection method based on video flow spectrum feature space is proposed for the first time in this paper.The video flow spectrum feature space is constructed using flow spectrum theory,which maps the spatio-temporal information in the video to the video flow spectrum feature space through the video flow spectrum basis model.This approach better captures the motion inconsistency of the video and obtains a more discriminative video representation,enabling the detection of deepfake videos.Specifically,the paper proposes a method for constructing the video flow spectrum feature space,which obtains an approximately isomorphic video flow spectrum feature description space by basis-mapping the video feature hidden space.It also fuses the high-dimensional representations of different perspectives of the video stream in the video flow spectrum feature space to achieve accurate portrayal and analysis of video streams.Furthermore,a video inconsistency flow spectrum map model is designed to map the spatial information of the video stream into the video flow spectrum feature space from the temporal perspective using the video flow spectrum transform operator.This model effectively captures the inconsistency information of deepfake videos and constructs the video representation with higher data separability.Experimental results demonstrate that the proposed method achieves an accuracy of 99.23%on the Celeb-DF dataset and 95.24%on the DFDC dataset.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.142.200.134