检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张文涛 王园宇 李赛泽 ZHANG Wentao;WANG Yuanyu;LI Saize(College of Information and Computer,Taiyuan University of Technology,Jinzhong Shanxi 030600,China)
机构地区:[1]太原理工大学信息与计算机学院,山西晋中030600
出 处:《计算机应用》2022年第9期2865-2875,共11页journal of Computer Applications
基 金:山西省自然科学基金资助项目(201801D121142);山西省回国留学人员科研资助项目。
摘 要:针对霾环境中图像降质导致的传统深度估计模型退化问题,提出了一种融合双注意力机制的基于条件生成对抗网络(CGAN)的单幅霾图像深度估计模型。首先,对于模型的生成器的网络结构,提出了融合双注意力机制的DenseUnet结构,其中DenseUnet将密集块作为U-net编码和解码过程中的基本模块,并利用密集连接和跳跃连接在加强信息流动的同时,提取直接传输率图的底层结构特征和高级深度信息。然后,通过双注意力模块自适应地调整空间特征和通道特征的全局依赖关系,同时将最小绝对值损失、感知损失、梯度损失和对抗损失融合为新的结构保持损失函数。最后,将霾图像的直接传输率图作为CGAN的条件,通过生成器和鉴别器的对抗学习估计出霾图像的深度图。在室内数据集NYU Depth v2和室外数据集DIODE上进行训练和测试。实验结果表明,该模型具有更精细的几何结构和更丰富的局部细节。在NYU Depth v2上,与全卷积残差网络相比,对数平均误差(LME)和均方根误差(RMSE)分别降低了7%和10%;在DIODE上,与深度有序回归网络相比,精确度(阈值小于1.25)提高了7.6%。可见,所提模型提高了在霾干扰下深度估计的准确性和泛化能力。To address the degradation problem of traditional depth estimation models caused by image quality degradation in haze environment,a model based on Conditional Generative Adversarial Network(CGAN)was proposed to estimate the depth of single haze image by fusing dual attention mechanism. Firstly,for the network structure of the generator of the model,the DenseUnet structure fused with dual attention mechanism was proposed. The dense blocks were used as basic blocks in the encoding and decoding processes of U-net. Dense and jump connections were used to enhance information flow,as well as extract the underlying structural features and high-level depth information of the direct transmission rate map. Then,the global dependencies of spatial features and channel features were adaptively adjusted by the dual attention module. At the same time,a new structure-preserving loss function was proposed by combining the least absolute value function,perceptual loss,gradient loss,and adversarial loss. Finally,using the direct transmission rate map of the haze image as a condition of CGAN,the depth map of the haze image was estimated through the adversarial learning of the generator and the discriminator. Training and testing were performed on the indoor dataset NYU Depth v2 and the outdoor dataset DIODE.Experimental results show that the proposed model has a finer geometric structure and richer local details. Compared with the fully convolutional residual network,on NYU Depth v2,the proposed model has the Logarithmic Mean Error(LME)and Root Mean Square Error(RMSE)error reduced by 7% and 10%,respectively. Compared with the deep ordinal regression network,on DIODE,the proposed model has the accuracy with threshold less than 1. 25 increased by 7. 6%. It can be seen that the proposed model improves the estimation accuracy and generalization ability of depth estimation under the interference of haze.
关 键 词:深度估计 霾图像 注意力机制 梯度损失 条件生成对抗网络 直接传输率图
分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222