检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:代睿 徐鹏越 李洁[1] 何立火[1] DAI Rui;XU Pengyue;LI Jie;HE Lihuo(School of Electronic Engineering,Xidian University,Xi'an 710071,China)
机构地区:[1]西安电子科技大学电子工程学院,陕西西安710071
出 处:《西北工业大学学报》2024年第5期959-968,共10页Journal of Northwestern Polytechnical University
基 金:国家自然科学基金(U21A20514,62276203,62036007);陕西省重点产业创新链(2022ZDLGY01-14)资助。
摘 要:特征金字塔广泛应用于基于多尺度特征学习的图像理解任务中,最新多尺度特征学习侧重于特征在语义特征和细节特征的交互融合,特征金字塔通过相邻层特征插值和求和来补充多尺度信息语义特征和细节特征,由于非线性运算的存在和不同输出维数的卷积层,不同能级之间关系复杂,逐像素求和并不是最有效的方法。因此,提出了基于语义一致性监督金字塔网络的目标检测方法。该网络模型由多语义特征增强模块和非对称卷积侧接模块组成,其中非对称卷积侧接模块通过学习不同感受野的特征图,提升特征对各种姿态目标泛化性,多语义特征增强模块通过为高层特征图补全底层信息,提升高层特征的细节表达能力,同时在准确性和检测性能之间实现更好的权衡。在基准测试集MSCOCO上进行的实验结果表明,所提出的目标检测方法在不增加FLOPs的基础上,将检测平均精确度提高了2.6%,显著提高了目标检测的性能。Feature pyramid network is widely used in image understanding tasks based on multi-scale feature learning.The latest multi-scale feature learning focuses on the interactive integration of features in semantic features and detail features.Feature pyramid network complements multi-scale information semantic features and detail features through feature interpolation and summation of adjacent layers.Due to the existence of nonlinear operation and convolution layers with different output dimensions,the relationship among different levels is much more complex,and pixel by pixel summation is suboptimal method.A supervised feature pyramid network based on semantic consistency for object detection is proposed.The present method is composed of asymmetric convolution lateral connection and multi-scale semantic features augmentation.The asymmetric convolution lateral connection improves the generalization of features to various pose objects by learning the feature maps of different receptive fields.The multi-scale semantic features augmentation network improves the detail expression ability of high-level features by supplementing the low-level information for the high-level feature map.Moreover,the present method can provide a better trade-off between accuracy and detection performance.Experiments conduct on the MSCOCO dataset,and the results show that the proposed object detection method's accuracy is improved by 2.6% without increasing extra FLOPs.
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.145.51.214