基于深度学习的人物肖像全自动抠图算法  被引量:3

Fully automatic matting algorithm for portraits based on deep learning

在线阅读下载全文

作  者:苏常保 龚世才 SU Chang-bao;GONG Shi-cai(School of Science,Zhejiang University of Science and Technology,Hangzhou Zhejiang 310000,China)

机构地区:[1]浙江科技学院理学院,浙江杭州310000

出  处:《图学学报》2022年第2期247-253,共7页Journal of Graphics

基  金:浙江省自然科学基金项目(Ly20A010005)。

摘  要:针对抠图任务中人物抠图完整度低、边缘不够精细化等繁琐问题,提出了一种基于深度学习的人物肖像全自动抠图算法。算法采用三分支网络进行学习,语义分割分支(SSB)学习α图的语义信息,细节分支(DB)学习α图的细节信息,混合分支(COM)将2个分支的学习结果汇总。首先算法的编码网络采用轻量级卷积神经网络(CNN) MobileNetV2,以加速算法的特征提取过程;其次在SSB中加入注意力机制对图像特征通道重要性进行加权,在DB加入空洞空间金字塔池化(ASPP)模块,对图像的不同感受野所提取的特征进行多尺度融合;然后解码网络的2个分支通过跳级连接融合不同阶段编码网络提取到的特征进行解码;最后将2个分支学习的特征融合在一起得到图像的α图。实验结果表明,该算法在公开的数据集上抠图效果优于所对比的基于深度学习的半自动和全自动抠图算法,在实时流视频抠图的效果优于Modnet。Aiming at the problems of low completeness of character matting, insufficiently refined edges, and cumbersome matting in matting tasks, an automatic matting algorithm for portraits based on deep learning was proposed. The algorithm employed a three-branch network for learning: the semantic information of the semantic segmentation branch(SSB) learning α graph, and the detailed information of the detail branch(DB)learning α graph. The combination branch(COM) summarized the learning results of the two branches. First, the algorithm’s coding network utilized a lightweight convolutional neural network MobileNetV2, aiming to accelerate the feature extraction process of the algorithm. Second, an attention mechanism was added to the SSB branch to weight the importance of image feature channels, the atrous spatial pyramid pooling module was added to the DB branch, and multi-scale fusion was achieved for the features extracted from the different receptive fields of the image. Then, the two branches of the decoding network merged the features extracted by the encoding network at different stages through the jump connection, thus conducting the decoding. Finally, the features learned by the two branches were fused together to obtain the image α graph. The experimental results show that on the public data set, this algorithm can outperform the semi-automatic and fully automatic matting algorithms based on deep learning, and that the effect of real-time streaming video matting is superior to that of Modnet.

关 键 词:全自动抠图 轻量级卷积神经网络 注意力机制 空洞空间金字塔池化 特征融合 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象