注意力机制海洋场景图像理解算法  被引量:2

Attention Mechanism Image Understanding Algorithm of Ocean Scene

在线阅读下载全文

作  者:邬满 文莉莉 孙苗 WU Man;WEN Lili;SUN Miao(Information Department,Guangxi Academy of Oceanography,Nanning 530022,China;Technology Innovation Center of Marine Information,Ministry of Natural Resources,Tianjin 300171,China;School of Electrical Engineering,Guangxi University,Nanning 530007,China;Information Industry Office,Guangxi Botanical Garden of Medicinal Plants,Nanning 530023,China)

机构地区:[1]广西壮族自治区海洋研究院信息科,南宁530022 [2]自然资源部海洋信息技术创新中心,天津300171 [3]广西大学电气工程学院,南宁530007 [4]广西壮族自治区药用植物园信息产业办,南宁530023

出  处:《计算机工程与应用》2022年第10期231-239,共9页Computer Engineering and Applications

基  金:自然资源部海洋信息技术创新中心开放基金;国家自然科学基金(61763007,61866007);广西科技重大专项(桂科AA18118025)。

摘  要:针对复杂海洋场景(目标多尺度、对象多样化、风格差异大、时空强关联且存在不确定性目标)特点,研究基于注意力机制的复杂图像有效特征提取方法,提出一种基于卷积神经网络(convolutional neural network,CNN)和长短时记忆网络(long short-term memory,LSTM)相结合的复杂海洋场景图像中文描述生成模型;结合Jieba分词工具,实现了对复杂海洋场景监测图像的自动翻译。利用91卫图助手及无人机高清影像数据,建立模型并对算法进行验证。结果表明,Inception-v4比VGG16模型有更强的复杂特征提取能力,在相同数据集下,Inception-v4模型的图像分类能力高出约5.3个百分点;基于卷积神经网络和长短时记忆模型的图像中文描述生成算法基本可行,可以解决批量图像的自动标注问题,但在算法的稳定性和描述的准确性上需进一步提高。Aiming at the characteristics of complex ocean scene(multi-scale target, diverse object, great style difference,strong spatiotemporal correlation and uncertain target), this paper studies the effective feature extraction method of complex image based on attention mechanism, and proposes a Chinese description generation model of complex ocean scene image based on convolutional neural network(CNN)and long short-term memory(LSTM)network. Combined with Jieba word segmentation tool, the complex ocean scene image is realized automatic translation of ocean scene monitoring images.Using 91 satellite map assistant and UAV high-definition image data, the model is established and the algorithm is verified. The results show that the Inception-v4 model has stronger complex feature extraction ability than VGG16 model,and the image classification ability of Inception-v4 model is about 5.3 percentage points higher than that of VGG16 model.Based on convolutional neural network and long short-term memory model, the image classification ability is basically feasible and can solve the problem of automatic annotation of batch images, but the stability and accuracy of the algorithm need to be further improved.

关 键 词:图像特征提取 注意力机制 长短时记忆模型 图像描述生成 中文分词 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象