基于半监督CST的湿地场景下细粒度鸟类检测  

Detecting fine-grained bird images in wetland using semi-supervised learning with CST module

在线阅读下载全文

作  者:赵玥[1,2,3,4] 徐钐钐 韩巧玲[1,2,3,4] 刘卫平 郑一力[1,2,3,4] 赵燕东 唐延龄[1] ZHAO Yue;XU Shanshan;HAN Qiaoling;LIU Weiping;ZHENG Yili;ZHAO Yandong;TANG Yanling(School of Technology,Beijing Forestry University,Beijing 100083,China;Beijing Laboratory of Urban and Rural Ecological Environment,Beijing Municipal Education Commission,Beijing100083,China;Key Lab of State Forestry Administration for Forestry Equipment and Automation,Beijing 100083,China;Research Center for Intelligent Forestry,Beijing Forestry University,Beijing 100083,China)

机构地区:[1]北京林业大学工学院,北京100083 [2]北京市教育委员会城乡生态环境北京实验室,北京100083 [3]国家林业局林业装备与自动化重点实验室,北京100083 [4]北京林业大学智慧林业研究中心,北京100083

出  处:《农业工程学报》2025年第6期185-194,共10页Transactions of the Chinese Society of Agricultural Engineering

基  金:国家自然科学基金面上项目(32071838);国家自然科学基金青年科学基金项目(32101590);北京林业大学“5·5工程”科研创新团队项目(BLRC2023C05)。

摘  要:针对细粒度鸟类检测的数据标注成本高,以及湿地地区鸟类种类繁多、现实场景复杂化等引起的湿地鸟类检测精度低的问题,该研究提出一种基于半监督CST的湿地场景下的细粒度鸟类检测算法(semi-supervised bird detection with CNN and swin transformer,SSBY-CST),首先基于北京14处监测站在不同湿地场景下采集到的图像,构建了涵盖17种鸟类图像数据集,为模型鲁棒性提供可靠数据支撑。其次提出基于伪标签学习法的单阶段半监督学习框架,基于Yolov5主干网络构建教师学生模型,高效利用无标签数据提升检测性能;训练阶段使用双阈值伪标签分配策略替代传统单一阈值伪标签分配,以优化无监督损失函数。然后设计了结合CNN和Swin Transformer的双通道卷积模块CST,以提高不同类别鸟类与湿地背景的区分能力。试验结果表明,仅在100张标注图像下,该文SSBY-CST算法对17种复杂环境下鸟类的检测精准率和mAP@0.5分别为77.5%和58.2%,相比同时期较先进的YOLO模型提升了17.4个百分点和15.5个百分点,在少量标注的前提下实现了较高的检测性能提升,其中黑鹳、西伯利亚银鸥的m AP@0.5分别达到了95.7%和94.5%,相比基线提升了24.9个百分点和14.3个百分点。此外,消融试验分析了双阈值伪标签分配的作用及CST模块的效果,验证了双阈值伪标签分配与CST模块设计的有效性。该框架利用无标注样本在极少量标注量下提升复杂环境下细粒度鸟类检测性能,以加强农林生态的智能数字化管理。该文将半监督扩展到细粒度鸟类检测,为处理农林生态环境下的鸟类检测提供了技术路径。Fine-grained detection of bird species has been confined to the high cost of data annotation during imaging.The low accuracy of detection can also be attributed to the diversity of bird species under the complex environments in wetlands.In this study,a semi-supervised CST-based algorithm was proposed to detect the fine-grained images of the birds in the wetland scenes,termed SSBY-CST(semi-supervised bird detection with CNN and Swin Transformer).The unlabeled samples were also utilized to enhance the fine-grained performance during bird detection in complex environments with minimal labeled data.This framework also facilitated the intelligent digital management of agroforestry ecosystems.The core contributions of this research were as follows.1)Data Collection and Dataset Construction.The images were first collected from 14 monitoring stations in Beijing,China.A dataset was then created under different wetland environments.The dataset included 17 species of birds,thus offering reliable data support for the robustness of the model.The variability and diversity were obtained to train a robust detection model under real-world conditions,particularly in wetland habitats where environmental conditions were challenging and variable.A single-stage Semi-Supervised Learning Framework was also proposed using Pseudo-Labeling.A teacher-student model was constructed using the Yolov5 backbone network.The unlabeled data was efficiently utilized to improve the performance of detection.A dual-threshold pseudo-label assignment was introduced to replace the traditional single-threshold during training.The unsupervised loss function was optimized to effectively reduce the impact of low-quality pseudo-labels.The overall accuracy of the model was improved to minimize the reliance on the large amounts of annotated data.The labeled and unlabeled data were combined for the generalization and robustness of the improved model,particularly for the condition with scarce labeled data.2)Dual-Channel Convolution Module(CST).A dual-channel convol

关 键 词:鸟类检测 深度学习 半监督学习 目标检测 注意力模块 

分 类 号:S718.5[农业科学—林学] S718.6

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象