级联特征融合孪生网络目标跟踪算法研究  被引量:7

Research on Object Tracking Algorithm Based on Cascading Feature Fusion of Siamese Network

在线阅读下载全文

作  者:韩明 王景芹[2] 王敬涛[1] 孟军英 HAN Ming;WANG Jingqin;WANG Jingtao;MENG Junying(School of Computer Science and Engineering,Shijiazhuang University,Shijiazhuang 050035,China;State Key Laboratory of Reliability and Intelligence of Electrical Equipment,Hebei University of Technology,Tianjin 300130,China)

机构地区:[1]石家庄学院计算机科学与工程学院,石家庄050035 [2]河北工业大学省部共建电工装备可靠性和智能化国家重点实验室,天津300130

出  处:《计算机工程与应用》2022年第6期208-218,共11页Computer Engineering and Applications

基  金:河北省高等学校科学技术研究重点项目(ZD2020405);河北省“三三三人才工程”(A202101102);石家庄市科学技术研究与发展计划(201130181A)。

摘  要:在光照变化、遮挡、背景相似、变形等复杂情况下,目标跟踪过程中难以精确地提取丰富的特征信息,容易导致目标跟踪出现漂移或者跟踪丢失。由于多层神经网络的浅层特征具有高分辨率,适合于目标定位;深层特征具有丰富的语义信息,适合于目标分类。充分利用这一优势,提出了一种级联特征融合的孪生网络目标跟踪算法。对ResNet-50网络进行改进,在减少模型参数和计算量的同时提高跟踪速度;采用级联特征融合策略将ResNet-50最后一阶段的3层特征进行逐级级联融合,进行目标深层语义信息和浅层空间信息的有效提取,实现目标的多特征准确表示。针对目标跟踪过程中大多数算法仅利用第一帧作为目标模板导致跟踪过程中目标模板退化问题,引入模板更新机制,利用相似度阈值法进行模板的实时更新。在OBT2015、VOT2016和VOT2018标准数据集上进行对比实验,实验结果表明,该算法的跟踪精度较高,复杂场景下鲁棒性较强,相对于其他算法有较强的竞争优势。It is difficult to accurately extract rich feature information in the process of target tracking under complex environments such as illumination variation, occlusion, background clutters and deformation, which is easy to lead to the object shift or tracking loss. Because the low-level features have high resolution of multilayer neural network, which is suitable for positioning the object. While the high-level features have rich semantic information and are suitable for object classification. To take full use of the advantage of the multilayer neural network, the siamese network algorithm of cascading feature fusion for object tracking is proposed. The ResNet-50 network is improved, which is reduced the model parameters and computation, and the tracking speed is improved. The cascade feature fusion strategy is adopted to cascade the three layers of features in the last stage of ResNet-50, and to effectively extract the high-level semantic information and low-level spatial information of the object, so as to achieve the accurate multi-feature representation of the object. In the process of object tracking, only the first frame is used as the object template most of the algorithm, which leads to the object template degradation. The template update mechanism is introduced, and the similarity threshold method is used to update the template in real time. The extensive comparative experiments are conducted on the OBT2015, VOT2016 and VOT2018.The experimental results show that the proposed algorithm has higher tracking accuracy and stronger robustness in complex scenes, and has a stronger competitive advantage compared with other algorithms.

关 键 词:计算机视觉 目标跟踪 孪生网络 特征融合 模板更新 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象