引入注意力机制的自监督光流计算被引量：3

Self-supervised optical flow estimation with attention module

作　　者：安峰[1] 戴军[1,2] 韩振[2] 严仲兴[1] AN Feng;DAI Jun;HAN Zhen;YAN Zhong-xing(School of Artificial Intelligence,Suzhou Industrial Park Institute of Services Outsourcing,Suzhou Jiangsu 215123,China;School of Economics&Management,Tongji University,Shanghai 210092,China)

机构地区：[1]苏州工业园区服务外包职业学院人工智能学院,江苏苏州215123 [2]同济大学经济与管理学院,上海210092

出　　处：《图学学报》2022年第5期841-848,共8页Journal of Graphics

基　　金：国家自然科学基金项目(71272048);江苏省高校“青蓝工程”优秀教学团队项目(苏教师函[2020]10号)。

摘　　要：光流计算是诸多计算机视觉系统的关键模块,广泛应用于动作识别、机器人定位与导航等领域。但目前端到端的光流计算仍受限于数据源的缺少,尤其是真实场景下的光流数据难以获取。人工合成的光流数据占绝大多数,且合成数据不能完全反应真实场景(如树叶晃动、行人倒影等),难以避免过拟合等情况。无监督或自监督方法可以利用海量的视频数据进行训练,摆脱了对数据集的依赖,是解决数据集缺少的有效途径。基于此搭建了一个自监督学习光流计算网络,其中的“Teacher”模块和“Student”模块集成了最新光流计算网络:稀疏相关体网络(SCV),减少了计算冗余量;同时引入注意力模型作为网络的一个节点,以提高图像特征在通道和空间上的维度属性。将SCV与注意力机制集成在自监督学习光流计算网络之中,在KITTI 2015数据集上的测试结果达到或超过了常见的有监督训练网络。Optical flow estimation is the key module of many computer vision systems,which is widely utilized in motion recognition,robot positioning,and navigation.However,due to the absence of labeled optical flow datasets of real scenes,synthetic datasets were used as the main training data sources,and synthetic data could not fully represent real scenes(such as leaf movement and pedestrian reflection).Unsupervised or self-supervised methods could employ a large amount of video data for training,and at the same time facilitate fine-tuning of supervised training,which was an effective way to solve the lack of datasets.In this paper,a self-supervised learning optical flow calculation network was constructed,in which the“Teacher”module and the“Student”module adopted sparse correlation volume(SCV)network to reduce the redundancy of correlation computation,and the attention model was introduced as a node of the network,in order to enhance the dimension attribute of image feature in terms of channel and space.This paper marks the first endeavor to implement a self-supervised optical flow computing network based on SCV.The test results on the KITTI 2015 dataset could reach or outperform those of the common supervised training networks such as FlowNet and LightFlowNet.

关键词：光流计算自监督学习卷积注意力模块空间/通道注意力稀疏相关体

分类号：TP242[自动化与计算机技术—检测技术与自动化装置]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

引入注意力机制的自监督光流计算被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

引入注意力机制的自监督光流计算 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

引入注意力机制的自监督光流计算被引量：3