基于事件的端到端视觉位置识别弱监督网络架构被引量：2

An End-to-End Weakly Supervised Network Architecture for Event-based Visual Place Recognition

作　　者：孔德磊方正李昊佳侯宽旭姜俊杰 KONG Delei;FANG Zheng;LI Haojia;HOU Kuanxu;JIANG Junjie(Faculty of Robot Science and Engineering,Northeastern University,Shenyang 110169,China;Science and Technology on Near-Surface Detection Laboratory,Wuxi 214000,China)

机构地区：[1]东北大学机器人科学与工程学院,辽宁沈阳110169 [2]近地面探测技术重点实验室,江苏无锡214000

出　　处：《机器人》2022年第5期613-625,共13页Robot

基　　金：国家自然科学基金(62073066,U20A20197);近地面探测技术重点实验室项目(6142414200208);中央高校基本科研业务专项资金(N2226001);辽宁省科技重大专项计划(2019JH1/10100026)。

摘　　要：传统的视觉位置识别(VPR)方法通常使用基于图像帧的相机,存在剧烈光照变化、快速运动等易导致VPR失败的问题。针对上述问题,本文提出了一种使用事件相机的端到端VPR网络,可以在具有挑战性的环境中实现良好的VPR性能。所提出算法的核心思想是,首先采用事件脉冲张量(EST)体素网格对事件流进行表征,然后利用深度残差网络进行特征提取,最后采用改进的局部聚合描述子向量(VLAD)网络进行特征聚合,最终实现基于事件流的端到端VPR。将该方法在基于事件的驾驶数据集(MVSEC、DDD17)和人工合成的事件流数据集(Oxford RobotCar)上与典型的基于图像帧的视觉位置识别方法进行了比较实验。结果表明,在具有挑战性的场景(例如夜晚场景)中,本文方法的性能优于基于图像帧的视觉位置识别方法,其Recall@1指标提升约6.61%。据我们所知,针对视觉位置识别任务,这是首个直接处理事件流数据的端到端弱监督深度网络架构。Frame-based cameras are generally used in traditional visual place recognition(VPR) methods, which often causes failure of VPR in the cases of dramatic illumination changes or fast motion. To overcome this, an end-to-end VPR network using event cameras is proposed, which can achieve good VPR performance in challenging environments. The key idea of the proposed algorithm is to firstly characterize the event streams with the event spike tensor(EST) voxel grid,then extract features using a deep residual network, and finally aggregate features using an improved VLAD(vector of locally aggregated descriptor) network to realize end-to-end VPR using event streams. Comparison experiments among the proposed method and classical VPR methods are carried out on the event-based driving datasets(MVSEC, DDD17) and the synthetic event stream datasets(Oxford RobotCar). As results, the performance of the proposed method is better than that of framebased VPR methods in challenging scenarios(such as night scenes), with an approximately 6.61% improvement in Recall@1index. To our knowledge, for visual place recognition task, this is the first end-to-end weakly supervised deep network architecture that directly processes event stream data.

关键词：视觉位置识别(VPR) 事件相机事件脉冲张量(EST) 深度残差网络三元组排序损失

分类号：TP24[自动化与计算机技术—检测技术与自动化装置]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于事件的端到端视觉位置识别弱监督网络架构被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于事件的端到端视觉位置识别弱监督网络架构 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于事件的端到端视觉位置识别弱监督网络架构被引量：2