检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:苏常保 龚世才 SU Chang-bao;GONG Shi-cai(School of Science,Zhejiang University of Science and Technology,Hangzhou Zhejiang 310000,China)
出 处:《图学学报》2022年第2期247-253,共7页Journal of Graphics
基 金:浙江省自然科学基金项目(Ly20A010005)。
摘 要:针对抠图任务中人物抠图完整度低、边缘不够精细化等繁琐问题,提出了一种基于深度学习的人物肖像全自动抠图算法。算法采用三分支网络进行学习,语义分割分支(SSB)学习α图的语义信息,细节分支(DB)学习α图的细节信息,混合分支(COM)将2个分支的学习结果汇总。首先算法的编码网络采用轻量级卷积神经网络(CNN) MobileNetV2,以加速算法的特征提取过程;其次在SSB中加入注意力机制对图像特征通道重要性进行加权,在DB加入空洞空间金字塔池化(ASPP)模块,对图像的不同感受野所提取的特征进行多尺度融合;然后解码网络的2个分支通过跳级连接融合不同阶段编码网络提取到的特征进行解码;最后将2个分支学习的特征融合在一起得到图像的α图。实验结果表明,该算法在公开的数据集上抠图效果优于所对比的基于深度学习的半自动和全自动抠图算法,在实时流视频抠图的效果优于Modnet。Aiming at the problems of low completeness of character matting, insufficiently refined edges, and cumbersome matting in matting tasks, an automatic matting algorithm for portraits based on deep learning was proposed. The algorithm employed a three-branch network for learning: the semantic information of the semantic segmentation branch(SSB) learning α graph, and the detailed information of the detail branch(DB)learning α graph. The combination branch(COM) summarized the learning results of the two branches. First, the algorithm’s coding network utilized a lightweight convolutional neural network MobileNetV2, aiming to accelerate the feature extraction process of the algorithm. Second, an attention mechanism was added to the SSB branch to weight the importance of image feature channels, the atrous spatial pyramid pooling module was added to the DB branch, and multi-scale fusion was achieved for the features extracted from the different receptive fields of the image. Then, the two branches of the decoding network merged the features extracted by the encoding network at different stages through the jump connection, thus conducting the decoding. Finally, the features learned by the two branches were fused together to obtain the image α graph. The experimental results show that on the public data set, this algorithm can outperform the semi-automatic and fully automatic matting algorithms based on deep learning, and that the effect of real-time streaming video matting is superior to that of Modnet.
关 键 词:全自动抠图 轻量级卷积神经网络 注意力机制 空洞空间金字塔池化 特征融合
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.128.24.183