Multi-modal visual tracking:Review and experimental comparison  被引量:2

在线阅读下载全文

作  者:Pengyu Zhang Dong Wang Huchuan Lu 

机构地区:[1]Faculty of Electronic Information and Electrical Engineering,Dalian University of Technology,Dalian 116024,Chin

出  处:《Computational Visual Media》2024年第2期193-214,共22页计算可视媒体(英文版)

基  金:supported in part by National Natural Science Foundation of China(Nos.U23A20384 and 62022021);in part by Joint Fund of Ministry of Education for Equipment Pre-research(No.8091B032155);in part by the National Defense Basic Scientific Research Program(No.WDZC20215250205);in part by Central Guidance on Local Science and Technology Development Fund of Liaoning Province(No.2022JH6/100100026).

摘  要:Visual object tracking has been drawing increasing attention in recent years,as a fundamental task in computer vision.To extend the range of tracking applications,researchers have been introducing information from multiple modalities to handle specific scenes,with promising research prospects for emerging methods and benchmarks.To provide a thorough review of multi-modal tracking,different aspects of multi-modal tracking algorithms are summarized under a unified taxonomy,with specific focus on visibledepth(RGB-D)and visible-thermal(RGB-T)tracking.Subsequently,a detailed description of the related benchmarks and challenges is provided.Extensive experiments were conducted to analyze the effectiveness of trackers on five datasets:PTB,VOT19-RGBD,GTOT,RGBT234,and VOT19-RGBT.Finally,various future directions,including model design and dataset construction,are discussed from different perspectives for further research.

关 键 词:visual tracking object tracking multi-modal fusion RGB-T tracking RGB-D trackin 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象