检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:董适 赵国瑞 苟豪 文剑[1,2] 林晨 DONG Shi;ZHAO Guorui;GOU Hao;WEN Jian;LIN Chen(School of Technology,Beijing Forestry University,Beijing 100083,China;Key Lab of State Forestry and Grassland Administration on Forestry Equipment and Automation,Beijing 100083,China)
机构地区:[1]北京林业大学工学院,北京100083 [2]林业装备与自动化国家林业和草原局重点实验室,北京100083
出 处:《农业工程学报》2025年第1期212-220,共9页Transactions of the Chinese Society of Agricultural Engineering
基 金:国家自然科学基金资助项目(32071679)。
摘 要:为了实现光照变化等复杂环境下果实的选择性采摘,该研究以黄瓜为研究对象,以RT-Detr为基线网络,提出了RT-Detr-EV模型。首先在主干网络中添加RepVGG模块,以加强网络特征提取能力,并减少推理时计算量;加入轻量化自注意力机制,减少计算量,增加网络深度;最后使用MPDIoU(minimum point distance based intersection over union)替换原模型中的损失函数,加快模型的收敛,提高模型的检测准确率。研究表明,改进RT-Detr-EV的平均精度均值mAP50相较于原模型提升了3.2个百分点,检测速度相较原模型提升了17.4帧/s。与YOLOv7-X、YOLOv8-l相比,对非适宜采摘的黄瓜识别准确率分别提升4.6、6.5个百分点,检测速度分别提升了40.6、25帧/s,参数量分别减少了55.5%、27.3%。同时试验证明,模型对光照条件多种变化的采摘场景也具有一定的鲁棒性与泛化能力。该研究提出的RT-Detr-EV模型能够满足复杂生长环境黄瓜果实的实时检测需求,可为后续移动式选择性采摘的研究提供技术支持。Selective picking of ripe cucumber fruits can often be realized under unstructured environments,where the cucumbers grow.However,manual harvesting is high cost and labor intensity.Cucumber-picking robots can be expected to reduce the manpower requirements in modern agriculture.Therefore,the vision system can dominate the accurate and rapid recognition of the fruit,leading to the high efficiency of the robot picking.This study aims to achieve the efficient selective picking of cucumber fruits under complex environments,such as light changes.The RT-Detr-EV model was also proposed to take the RT-Detr as the baseline network.The cucumber was selected as the research object.Firstly,the Re-parameterization VGG module was added into the backbone network.A multi-branch structure was then adopted during training,in order to strengthen the feature extraction of the network for the high recognition accuracy.While the multi-branch structure was merged during inference.The complexity of the network and the amount of computation were reduced to optimize the inference performance;Secondly,the lightweight cascade in the neck network was added into the grouping self-attention mechanism module,in order to reduce the computational overhead.Thus the high detection speed of the model was obtained to increase the depth of the network;Finally,Minimum Point Distance based Intersection Over Union(MPDIoU)also replaced the loss function in the original model.All the intersection and merger ratios were considered to calculate the regression loss of the target frame.The convergence of the model was accelerated to improve the detection accuracy.The results show that the mean average detection precision and speed of the improved RT-Detr-EV reached 95.8%and 61.3 frames/s,respectively,compared with the original model by 3.2 percentage points and 17.4 frames/s,respectively.The accuracy of identifying cucumbers that are not suitable for picking has increased by 4.6 and 6.5 percentage points,respectively,compared with the YOLOv7-X and YOLOv8-l.Whil
关 键 词:图像识别 目标检测 黄瓜 选择性采摘 RT-Detr 级联群组自注意力机制
分 类 号:S126[农业科学—农业基础科学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7