基于语义引导的人像自动抠图模型

Automatic portrait matting model based on semantic guidance

作　　者：程艳严志航赖建明王桂喜钟林辉[3,4] CHENG Yan;YAN Zhihang;LAI Jianming;WANG Guixi;ZHONG Linhui(School of Software,Jiangxi Normal University,Nanchang Jiangxi 330022,China;School of Digital Industry,Jiangxi Normal University,Shangrao Jiangxi 334000,China;School of Computer Information and Engineering,Jiangxi Normal University,Nanchang Jiangxi 330022,China;Key Laboratory of Intelligent Information Processing and Emotional Computing of Jiangxi Province,Nanchang Jiangxi 330022,China)

机构地区：[1]江西师范大学软件学院,江西南昌330022 [2]江西师范大学数字产业学院,江西上饶334000 [3]江西师范大学计算机信息与工程学院,江西南昌330022 [4]江西省智能信息处理与情感计算省重点实验室,江西南昌330022

出　　处：《图学学报》2024年第4期683-695,共13页Journal of Graphics

基　　金：国家自然科学基金项目(62167006,61967011);江西省科技创新基地计划省重点实验室项目(2024SSY03131);江西省自然科学基金项目(20212BAB202017);江西省03专项及5G项目(20212ABC03A22);江西省主要学科学术和技术带头人培养计划领军人才项目(20213BCJL22047)。

摘　　要：为解决现有人像抠图方法中存在的语义判别错误和抠图细节模糊问题,提出一种基于语义引导的人像自动抠图模型。首先引入CNN-Transformer混合架构EMO进行特征编码。接着,在语义分割解码分支利用多尺度混合注意力模块处理最高层编码特征,以增强多尺度表征和像素级判别能力。然后,使用特征增强模块融合高层次特征,促使高层语义信息在浅层网络的流动。同时,细节抠取解码分支中的聚合以引导来自模块不同分支的特征聚合,利用聚合特征更好地引导网络提取浅层特征,提高了边缘细节抠取精度。在3个数据集上的实验表明,该方法与所比较方法相比性能达到了最优,并显著降低了参数量和计算复杂度,具有较高的竞争力。To address the issues of semantic discrimination errors and unclear details in existing portrait matting methods,an automatic matting model based on semantic guidance was proposed.Firstly,a hybrid CNN-Transformer architecture EMO was introduced for feature encoding.Then,the semantic segmentation decoding branch utilized a multi-scale hybrid attention module to process the top-level encoded features,enhancing multi-scale representation and pixel-level discrimination capabilities.Next,a feature enhancement module was employed to merge high-level features,facilitating the flow of high-level semantic information through the shallow network.Simultaneously,the aggregation guidance module in the detail extraction decoding branch aggregated features from different branches,utilizing the aggregated features to better guide the network in extracting shallow features,thereby improving the accuracy of edge and detail extraction.Experiments on three datasets demonstrated that our approach outperformed the compared methods,achieving optimal performance while significantly reducing parameter count and computational complexity,validating the competitiveness of our proposed method.

关键词：人像抠图语义引导多尺度特征增强聚合引导

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于语义引导的人像自动抠图模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于语义引导的人像自动抠图模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索