检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]武汉大学测绘遥感信息工程国家重点实验室,湖北 武汉 [2]武汉大学遥感信息工程学院,湖北 武汉 [3]国家海洋局南海规划与环境研究院,广东 广州 [4]自然资源部海洋环境探测技术与应用重点实验室,广东 广州
出 处:《测绘科学技术》2023年第2期122-134,共13页Geomatics Science and Technology
摘 要:本文针对街景图像立面元素检测问题,提出了融合空间结构权重优化注意力机制的立面元素目标检测网络。在主干网络部分使用嵌入基于空间结构优化坐标注意力机制的C3模块,增加横纵坐标权重分支,有效利用空间结构编码信息,提升立面元素定位精度;其次针对立面最主要组成元素窗户、阳台的小目标特性,使用改进的递归门控卷积模块替换原始卷积模块,融合丰富的多尺度上下文信息,并增加小目标检测分支,提升检测精度;最后设计了ECIOU损失同时对检测框的长宽比以及定位中心进行监督,增强网络对立面元素的感知能力,提升网络收敛速度。在FacadeWHU数据集上实验结果表明,本文模型的平均精度比相较于基线网络Yolov5s而言整体平均精度提升了16.4%,窗户目标的平均精度提升了22.4%,阳台目标的平均精度提升了25.5%,可以有效检测立面元素,更好的服务于病害检测、能耗分析等下游任务。Aiming at the problem of facade element detection in street view image, this paper proposes a fa-cade element object detection network integrating spatial structure weight optimization mecha-nism. C3 module embedded in the coordinate attention mechanism based on spatial structure op-timization is used in the backbone network to increase the weight branches of horizontal and verti-cal coordinates, effectively use the spatial structure coding information, and improve the position-ing accuracy of elevation elements. Secondly, in view of the small target characteristics of Windows and balconies, which are the main components of the facade, an improved recursive gated convolu-tional module is used to replace the original convolutional module, integrate rich multi-scale con-text information, and add small target detection branches to improve detection accuracy. Finally, ECIOU loss is designed to supervise the aspect ratio of the detection frame and the positioning cen-ter, which enhances the perception ability of the opposite elements of the network and improves the convergence speed of the network. Experimental results on Facade WHU data set show that compared with baseline network yolov5s, the average accuracy of the proposed model is improved by 16.4% overall, 22.4% for window target and 25.5% for balcony target, which can effectively de-tect facade elements. Better service for disease analysis, energy consumption analysis and other downstream tasks.
关 键 词:注意力机制 主干网络 检测网络 上下文信息 精度提升 元素检测 编码信息 病害检测
分 类 号:TP3[自动化与计算机技术—计算机科学与技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222