检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:何平 张恒[2] 刘成林[2,3] HE Ping;ZHANG Heng;LIU Chenglin(School of Computer Science and Technology,Anhui University,Hefei 230601;National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing 100190;School of Artificial Intelligence,University of Chinese Academy of Sciences,Beijing 100049)
机构地区:[1]安徽大学计算机科学与技术学院,合肥230601 [2]中国科学院自动化研究所模式识别国家重点实验室,北京100190 [3]中国科学院大学人工智能学院,北京100049
出 处:《模式识别与人工智能》2022年第7期614-624,共11页Pattern Recognition and Artificial Intelligence
基 金:国家自然科学基金项目(No.61936003,61721004)资助。
摘 要:自然场景文本擦除技术可应用在图像通信中的隐私保护、图像编辑等领域,然而现阶段的场景文本擦除在面对背景复杂、文本尺度变化较大的场景图像时,难以提取鲁棒的文本特征,出现文本检测不全、背景修复不完整等问题.针对上述问题,文中提出基于多尺度注意力机制的场景文本擦除框架.该框架主要由背景修复网络和文本检测网络共同组成,它们共享一个主干网络.在背景修复网络中,设计纹理自适应模块,从原始特征的通道和空间两个维度进行特征编码,自适应地集成局部特征与全局特征,有效修复因重构文本区域而导致的阴影部分.在文本检测网络中,设计上下文感知模块,学习图像中文本区域和非文本区域之间的判别关系,有效区分文本区域和非文本区域,提升文本检测的效果.此外,为了增强网络的感受野,改进不同尺度文本的擦除效果,提出多尺度特征损失函数,同时优化背景修复网络和文本检测网络.SCUT-SYN、SCUT-EnsText数据集上的实验表明,文中框架可取得较优的文本擦除性能.Scene text removal is of great significance for privacy protection and image editing in image communication.However,existing scene text removal models are insufficient in extracting robust features for images with complex background and multi-scale texts,resulting in incomplete text detection and background repair.To solve this problem,a scene text removal framework based on multi-scale attention mechanism is proposed for robust background repair and text detection.The proposed framework is mainly composed of background repair network and text detection network,sharing a backbone network.In the background repair network,a texture adaptive module is designed to encode the channel/spatial features and adaptively integrate local/global features,effectively repairing shadow parts in text reconstruction.To improve text detection,a context aware module is designed to learn the discriminative features between texts and non-texts in the image.Besides,to enhance the receptive field of the network and improve the removal of multi-scale texts,a multi-scale feature loss function is designed to optimize the background repair and text detection modules.Experimental results on SCUT-SYN and SCUT-EnsText datasets show that the proposed method can achieve the state-of-the-art performance in text removal.
关 键 词:场景文本擦除 文本分割 注意力机制 多尺度特征 端到端方法
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.127