基于时空图卷积网络改进的人体行为识别方法  被引量:1

Improved Human Action Recognition Method Based on Spatial Temporal Graph Convolutional Network

在线阅读下载全文

作  者:王松[1] SongWang(School of Management and Economics,Chuxiong Normal University,Chuxiong,Yunnan Province 675000)

机构地区:[1]楚雄师范学院管理与经济学院,云南楚雄675000

出  处:《楚雄师范学院学报》2022年第3期91-100,共10页Journal of Chuxiong Normal University

摘  要:针对目前利用时空图卷积网络ST-GCN行为识别模型进行人体行为识别准确性有待提高和如何更好地学习骨骼数据中关节点和骨架边所表达的动作特征等问题,改进现有的时空图卷积网络(ST-GCN)行为识别模型。首先,使用有向图来表示关节点和骨骼边的信息以及它们之间的依赖关系,提取相邻帧的关节位置差异作为运动信息;其次,使用双流框架分别学习运动信息和空间信息,进行融合提高识别性能;最后,使用注意力权重矩阵让图的拓扑结构具有自适应性,增大节点的感受野,使网络能够学习到远端关节之间的语义信息,更好的捕捉动作特征。将所提出的方法在NTURGB+D数据集上进行实验。研究结果表明,采用基于时空图卷积网络改进的人体行为识别方法在数据集上达到了96%的准确率,与现有ST-GCN模型相比,准确率提高了。此方法可进一步促进人体行为识别技术在智能家居、智能监控安防、人机交互、基于内容的视频检索、智慧城市发展等领域的广泛应用。In order to improve the accuracy of human action recognition using Spatial Temporal Graph Convolutional Networks(ST-GCN) and to better learn the motion features expressed by joint points and skeleton edges in bone data, the existing Spatial Temporal Graph Convolutional Networks(ST-GCN) human action recognition model is improved. Firstly, the directed graph is used to represent the information of joint points and bone edges and the relationship between them, and the joint position difference of adjacent frames is extracted as the motion information;secondly, the Two-stream Framework is used to learn the motion information and spatial information respectively to improve the recognition performance;finally, the attention weight matrix is used to make the topology of the graph adaptive and increase the recognition efficiency. The receptive field of nodes enables the network to learn the semantic information between the distal joints and better capture the motion features. The proposed method is tested on NTU-RGB+D dataset. The results show that the improved human action recognition method based on spatiotemporal convolution neural network achieves96% accuracy on the data set, and the accuracy is improved compared with that of the existing ST-GCN model. This method can further promote the wide application of human action recognition technology in smart home, intelligent monitoring, security, human-computer interaction, content-based video retrieval, smart city development and other fields.

关 键 词:人体行为识别 时空图卷积神经网络 有向图网络 注意力机制 双流框架 

分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象