联合多连接特征编解码与小波池化的轻量级语义分割

Lightweight Semantic Segmentation by Combining Multi-Link Feature Codec with Wavelet Pooling

作　　者：易清明[1,2] 王渝石敏骆爱文[1] YI Qingming;WANG Yu;SHI Min;LUO Aiwen(School of Information Science and Technology,Jinan University,Guangzhou 510632,China;Taidou Microelectronic Science and Technology Co.,Ltd.,Guangzhou 510663,China)

机构地区：[1]暨南大学信息科学技术学院,广州510632 [2]泰斗微电子科技有限公司,广州510663

出　　处：《电子科技大学学报》2024年第3期366-375,共10页Journal of University of Electronic Science and Technology of China

基　　金：国家自然科学基金(62002134);广东省基础与应用基础研究基金(2020A1515110645,2023A1515010834);广东省普通高校新型半导体与器件重点实验室项目(2021KSY001);羊城创新创业领军人才支持计划(2019019);广东省科技创新战略专项(大学生科技创新培育)(pdjh2023b0061)。

摘　　要：语义分割是当前场景理解领域的基础技术之一。现存的语义分割网络通常结构复杂、参数量大、图像特征信息损失过多和计算效率低。针对以上问题,基于编-解码器框架和离散小波变换,设计了一个联合多连接特征编解码与小波池化的轻量级语义分割网络MLWP-Net(Multi-Link Wavelet-Pooled Network),在编码阶段利用多连接策略并结合深度可分离卷积、空洞卷积和通道压缩设计了轻量级特征提取瓶颈结构,并设计了低频混合小波池化操作替代传统的下采样操作,有效降低编码过程造成的信息丢失;在解码阶段,设计了多分支并行空洞卷积解码器以融合多级特征并行实现图像分辨率的恢复。实验结果表明,MLWP-Net仅以0.74 MB的参数量在数据集Cityscapes和CamVid上分别达到74.1%和68.2%mIoU的分割精度,验证了该算法的有效性。Semantic segmentation is currently one of the basic technologies in the field of scene understanding.Existing semantic segmentation networks usually result in complex structures,a large number of parameters,excessive loss of image feature information,and low computational efficiency.To address these problems,this work proposes a lightweight semantic segmentation network named MLWP-Net(Multi-Link Wavelet-Pooled Network)which combines features with multiple connections and wavelet pooling based on the encoder-decoder framework and Discrete Wavelet Transform(DWT).In the encoding phase,a lightweight feature extraction bottleneck is designed by combining with the depthwise separable convolution,dilated convolution,and channel compression,using a multi-link strategy to fuse multi-level features;besides,a low-frequency-mixed wavelet pooling operation is employed to replace the traditional downsampling operation for effectively reducing the information loss during the encoding process.In the decoding stage,a multi-branch parallel dilated convolutional decoder is designed to fuse multiple features linked to the different layers in the encoder to recover the image resolution in parallel.The experimental results show that our MLWP-Net achieves 74.1%and 68.2%mIoU segmentation accuracy on the datasets of Cityscapes and Camvid with only 0.74M parameters,which demonstrates its effectiveness for semantic segmentation.

关键词：实时语义分割轻量级神经网络多连接特征融合小波池化多分支空洞卷积

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

联合多连接特征编解码与小波池化的轻量级语义分割

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

联合多连接特征编解码与小波池化的轻量级语义分割

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索