检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:易清明[1,2] 王渝 石敏 骆爱文[1] YI Qingming;WANG Yu;SHI Min;LUO Aiwen(School of Information Science and Technology,Jinan University,Guangzhou 510632,China;Taidou Microelectronic Science and Technology Co.,Ltd.,Guangzhou 510663,China)
机构地区:[1]暨南大学信息科学技术学院,广州510632 [2]泰斗微电子科技有限公司,广州510663
出 处:《电子科技大学学报》2024年第3期366-375,共10页Journal of University of Electronic Science and Technology of China
基 金:国家自然科学基金(62002134);广东省基础与应用基础研究基金(2020A1515110645,2023A1515010834);广东省普通高校新型半导体与器件重点实验室项目(2021KSY001);羊城创新创业领军人才支持计划(2019019);广东省科技创新战略专项(大学生科技创新培育)(pdjh2023b0061)。
摘 要:语义分割是当前场景理解领域的基础技术之一。现存的语义分割网络通常结构复杂、参数量大、图像特征信息损失过多和计算效率低。针对以上问题,基于编-解码器框架和离散小波变换,设计了一个联合多连接特征编解码与小波池化的轻量级语义分割网络MLWP-Net(Multi-Link Wavelet-Pooled Network),在编码阶段利用多连接策略并结合深度可分离卷积、空洞卷积和通道压缩设计了轻量级特征提取瓶颈结构,并设计了低频混合小波池化操作替代传统的下采样操作,有效降低编码过程造成的信息丢失;在解码阶段,设计了多分支并行空洞卷积解码器以融合多级特征并行实现图像分辨率的恢复。实验结果表明,MLWP-Net仅以0.74 MB的参数量在数据集Cityscapes和CamVid上分别达到74.1%和68.2%mIoU的分割精度,验证了该算法的有效性。Semantic segmentation is currently one of the basic technologies in the field of scene understanding.Existing semantic segmentation networks usually result in complex structures,a large number of parameters,excessive loss of image feature information,and low computational efficiency.To address these problems,this work proposes a lightweight semantic segmentation network named MLWP-Net(Multi-Link Wavelet-Pooled Network)which combines features with multiple connections and wavelet pooling based on the encoder-decoder framework and Discrete Wavelet Transform(DWT).In the encoding phase,a lightweight feature extraction bottleneck is designed by combining with the depthwise separable convolution,dilated convolution,and channel compression,using a multi-link strategy to fuse multi-level features;besides,a low-frequency-mixed wavelet pooling operation is employed to replace the traditional downsampling operation for effectively reducing the information loss during the encoding process.In the decoding stage,a multi-branch parallel dilated convolutional decoder is designed to fuse multiple features linked to the different layers in the encoder to recover the image resolution in parallel.The experimental results show that our MLWP-Net achieves 74.1%and 68.2%mIoU segmentation accuracy on the datasets of Cityscapes and Camvid with only 0.74M parameters,which demonstrates its effectiveness for semantic segmentation.
关 键 词:实时语义分割 轻量级神经网络 多连接特征融合 小波池化 多分支空洞卷积
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49