SemID: Blind Image Inpainting with Semantic Inconsistency Detection  被引量:1

在线阅读下载全文

作  者:Xin Li Zhikuan Wang Chenglizhao Chen Chunfeng Tao Yuanbo Qiu Junde Liu Baile Sun 

机构地区:[1]College of Computer Science and Technology,China University of Petroleum(East China),Qingdao 266580,China [2]CNPC Oriental Geophysical Exploration Co.Ltd.,Baoding 072751,China

出  处:《Tsinghua Science and Technology》2024年第4期1053-1068,共16页清华大学学报自然科学版(英文版)

基  金:supported by the Natural Science Foundation of Shandong Province of China(No.ZR2020MF140);the Major Scientific and Technological Projects of CNPC(No.ZD2019-183-004);the Fundamental Research Funds for the Central Universities(No.20CX05019A).

摘  要:Most existing image inpainting methods aim to fill in the missing content in the inside-hole region of the target image. However, the areas to be restored in realistically degraded images are unspecified. Previous studies have failed to recover the degradations due to the absence of the explicit mask indication. Meanwhile, inconsistent patterns are blended complexly with the image content. Therefore, estimating whether certain pixels are out of distribution and considering whether the object is consistent with the context is necessary. Motivated by these observations, a two-stage blind image inpainting network, which utilizes global semantic features of the image to locate semantically inconsistent regions and then generates reasonable content in the areas, is proposed. Specifically, the representation differences between inconsistent and available content are first amplified, iteratively predicting the region to be restored from coarse to fine. A confidence-driven inpainting network based on prediction masks is then used to estimate the information regarding missing regions. Furthermore, a multiscale contextual aggregation module is introduced for spatial feature transfer to refine the generated contents. Extensive experiments over multiple datasets demonstrate that the proposed method can generate visually plausible and structurally complete results that are particularly effective in recovering diverse degraded images.

关 键 词:blind image inpainting inconsistent pattern representation difference contextual aggregation 

分 类 号:U495[交通运输工程—交通运输规划与管理]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象