检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:任晶晶[1] 张小勇[1] 贾伟宽 Ren Jingjing;Zhang Xiaoyong;Jia Weikuan(Department of Intelligence and Information Engineering,Taiyuan University,Taiyuan,030032,China;School of Information Science and Engineering,Shandong Normal University,Jinan,250358,China)
机构地区:[1]太原学院智能与信息工程系,太原市030032 [2]山东师范大学信息科学与工程学院,济南市250358
出 处:《中国农机化学报》2025年第3期182-187,共6页Journal of Chinese Agricultural Mechanization
基 金:国家自然科学基金面上项目(62372278);山西省高等学校科技创新项目(2024L386);山东省自然科学基金(ZR2020MF076)。
摘 要:目标果实检测精度直接影响果园智能作业的效率,当前以卷积神经网络为代表的特征提取网络仅从局部感受野中提取特征用于目标检测,果实受枝叶遮挡或果实间重叠时存在一定的局限性,导致检测精度偏低。为提升被遮挡目标果实的检测精度,提出抗遮挡的FoveaBox果实检测优化模型。首先,新模型引入Swin Transformer作为骨干网络,通过计算块间的相似度,打破传统卷积仅从局部区域提取特征的限制,从而增强特征映射的表征能力;其次,采用特征金字塔网络,通过横向连接和自顶向下结构聚合浅层高分辨率特征与高层语义信息,输出金字塔型特征映射;然后,将金字塔型特征映射输入Fovea头部网络中,利用分类子网络与边界框子网络进行检测目标;最后,通过焦点损失函数Focal Loss与Smooth L1对模型进行迭代寻优,直至模型收敛。验证表明,优化模型在IoU为0.5阈值下的平均精确度可达86.3%,优于FCOS、TOOD与LAD等先进模型。提出的抗遮挡的FoveaBox可在一定程度上提升被遮挡目标的检测精确度。Fruit detection is a crucial sub-task in smart agriculture,as its accuracy significantly impacts the performance of various operational tasks.However,current feature extraction networks,particularly convolutional neural networks,primarily extract features from local receptive fields.This limitation hinders the detection of fruits occluded by branches and leaves,and fruits overlapped,ultimately culminating in suboptimal detection accuracy.To improve the detection precision of occluded targets,in this study,an enhanced FoveaBox target detection model is proposed.First,the Swin Transformer is employed as the backbone network,enabling the extraction of multi-granularity hierarchical features from a global receptive field.This approach overcomes the constraints of traditional convolutional networks,which only extract features from local regions,thereby improving the representational capacity of feature mapping.Next,the Feature Pyramid Network is utilized to aggregate shallow,high-resolution features with high-level semantic information through lateral connections and a top-down structure.This aggregation enhances the model's ability to detect occluded objects.The pyramidal features are then fed into the Fovea head network,which consists of a classification sub-network and a bounding box sub-network for object detection.Finally,the method is iteratively optimized using Focal Loss and the Smooth L1 function until the model converges.Experimental results demonstrate that the proposed occlusion-resistant FoveaBox detection model,its average precision can reach 86.3% under the IoU threshold of 0.5,which is superior to advanced models such as FCOS,TOOD and LAD.It significantly improves the detection accuracy of occluded targets.
关 键 词:被遮挡苹果检测 多粒度特征感知 FoveaBox Swin Transformer 区域相似度计算
分 类 号:S126[农业科学—农业基础科学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7