检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:文津 蒋凯元 韩禹洋 王志强[1] 罗乐琦 田文亮 Wen Jin;Jiang Kaiyuan;Han Yuyang;Wang Zhiqiang;Luo Leqi;Tian Wenliang(Beijing Electronic Science and Technology Institute,Beijing 100070;National Center for Public Credit Information,Beijing 100045)
机构地区:[1]北京电子科技学院,北京100070 [2]国家公共信用信息中心,北京100045
出 处:《信息安全研究》2024年第8期729-737,共9页Journal of Information Security Research
基 金:中国博士后科学基金面上项目(2019M650606);中央高校基本科研业务费专项资金项目(328202267,328202203,20230045Z0114);北京电子科技学院一流学科建设项目(3201012)。
摘 要:近年来,随着监控摄像头的不断增多和互联网的迅速发展,监控视频与网络视频越来越多,对视频进行自动行为冲突检测对降低人为审核导致的隐私信息泄露风险及维护社会治安、净化网络环境等具有重要意义.为了充分提取视频中的行为冲突特征,并获得有较好泛化能力与检测效果的模型,采用I3D(inflated 3D convolutional network)与VGGish,基于XD-Violence进行多模态特征的提取,并提出了基于Transformer和图卷积网络的行为冲突检测模型TG-BCDM(behavior conflict detection model based on Transformer and graph convolution networks).该模型包含Transformer编码器模块和图卷积模块,可以在有效捕捉视频中长距离依赖关系的同时,关注视频特征的全局信息和局部信息.经过实验证明,该模型优于现有的8种方法.In recent years,with the increasing number of surveillance cameras and the rapid development of the Internet,there are more and more surveillance and online videos.The automatic detection of behavior conflict in videos is of great significance to reduce the risk of privacy information leakage caused by human auditing,maintain social order and purify the environment online.To fully extract features of behavior conflict from videos and obtain models with good generalization ability and detection performance,we use I3D(inflated 3D convolutional network)and VGGish to extract multimodal features based on the XD-Violence dataset,and propose the behavior conflict detection model based on transformer and graph convolution networks(TG-BCDM)for behavior conflict detection.The model contains a Transformer encoder module and a graph convolution module,which can effectively capture the long-range dependencies in videos while paying attention to global and local information of video features.After experimental verification,the model outperforms eight existing methods.
关 键 词:突检测 动作识别 多模态特征融合 TRANSFORMER 图卷积网络
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15