图卷积神经网络及其在图像识别领域的应用综述  被引量:11

Graph Convolutional Neural Network and Its Application in Image Recognition

在线阅读下载全文

作  者:李文静 白静[1,2] 彭斌 杨瞻源 LI Wenjing;BAI Jing;PENG Bin;YANG Zhanyuan(School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China;The Key Laboratory of Images&Graphics Intelligent Processing of State Ethnic Affairs Commission,North Minzu University,Yinchuan 750021,China)

机构地区:[1]北方民族大学计算机科学与工程学院,银川750021 [2]国家民委图形图像智能处理实验室,银川750021

出  处:《计算机工程与应用》2023年第22期15-35,共21页Computer Engineering and Applications

基  金:国家自然科学基金(62162001,61762003);宁夏自然科学基金(2022AAC02041);宁夏优秀人才支持计划;北方民族大学研究生创新项目(YCX23150)。

摘  要:卷积神经网络被广泛应用于图像识别领域并且展现出强大的特征提取能力,但它只能处理欧氏空间的结构化数据,无法适用于非结构化数据的处理。为应对该限制,图卷积神经网络利用谱域和空域方法,拓展了卷积运算的范围,使其能够在非欧几里德空间中进行特征学习,具备图数据的平移不变性,可以实现对非结构化图数据的表征学习。首先阐述了基于频域和空域的两种类型图卷积神经网络的基本原理,并且介绍了相关的改进工作;然后围绕图像识别领域,重点介绍了图卷积神经网络在多标签图像识别、基于骨架的动作识别和高光谱图像分类中的具体应用,总结其研究的最新进展,并对相关的模型进行了性能对比与分析;最后对全文内容进行总结,并对未来的发展方向进行展望。Convolutional neural network has found widespread application in the field of image recognition,demonstrating remarkable feature extraction capabilities.However,it is inherently designed for processing structured data in Euclidean space,making it less suitable for handling unstructured data.To address this limitation,graph convolutional neural network leverages spectral and spatial methods to extend the scope of convolutional operations,enabling feature learning in nonEuclidean spaces.GCN possesses translational invariance for graph data,facilitating representation learning for unstructured data.Firstly,the basic principles and improvement work of two types of graph convolutional neural networks based on spectral domain and space domain are explained.Then,around the field of image recognition,the application of graph convolutional neural network in multi-label image recognition,skeleton-based action recognition and hyperspectral image classification is introduced,the research progress is summarized,and the performance comparison and analysis of related models are carried out.Finally,the content of the full text is summarized and the future development direction is looked forward.

关 键 词:图像识别 图卷积神经网络 非欧氏空间 深度学习 人工智能 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象