检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:贺子泽 战荫伟[1] HE Zize;ZHAN Yinwei(School of Computer Science and Technology,Guangdong University of Technology,Guangzhou 510006,Guangdong,China)
机构地区:[1]广东工业大学计算机学院,广东广州510006
出 处:《计算机工程》2024年第11期276-283,共8页Computer Engineering
基 金:国家自然科学基金(62272108);广东省重点领域研发计划(2019B010150002;2020B0101130019)。
摘 要:动作识别是计算机视觉领域一个重要研究方向。目前,主流方法在局部动作特征上的关注度不足。部分动作识别方法为关注局部动作特征,将预定义的人体骨架拆分成左右手、左右腿等多个部分。但是,这些部分包含的骨架关键点较少,使得动作特征较相似,导致识别准确率降低。此外,已有基于局部动作特征的动作识别方法未充分考虑全局姿态特征,模型识别准确率不稳定。为此,提出基于图卷积的局部特征细化动作识别方法。将预定义人体骨骼拓扑图划分为身体、上下肢,加强模型关注局部动作特征的能力。同时,设计局部特征细化器,采用对比学习策略扩大不同种类动作的局部动作特征差异,缩小同类动作之间的差异,解决因划分策略造成动作特征相似的问题。在此基础上,将上下肢与身体的分类结果相结合,充分利用全局姿态特征,提高模型的稳定性。实验结果表明,该方法在NTU RGB+D 602个基准数据集X-Sub、X-View的识别准确率分别为93.0%和98.8%,在NTU RGB+D 1202个基准数据集X-Sub、X-Set的识别准确率分别为88.8%和90.1%,能够有效提高动作识别的准确率。Although action recognition is an important research area in computer vision,current mainstream methods lack a sufficient emphasis on local features.Some action recognition approaches focus on local action features by dividing the predefined human skeleton into various parts,such as the left and right hands,left and right legs.However,these parts contain fewer skeleton keypoints,resulting in similar action features and a lower recognition efficiency.Moreover,existing methods based on local action features often neglect global posture characteristics,leading to unstable model recognition accuracy.To address these issues,this study proposes a method for refining local features in action recognition based on graph convolution.The proposed method divides the predefined human skeleton topology into body and upper/lower limbs,enhancing the model′s capability to focus on local action features.Simultaneously,a local feature refiner uses contrastive learning strategies to expand the differences in the local action features of different types of actions,reduce the differences between similar actions,and solve the problem of similar action features caused by partitioning strategies.Accordingly,the classification results of the upper and lower limbs are combined with those of the body,fully utilizing the global pose features to improve model stability.Experimental results show that the recognition accuracies achieved by this method on two NTU RGB+D 60 benchmark datasets X-Sub and X-View are 93.0%and 98.8%,respectively.Furthermore,the recognition accuracies of X-Sub and X-Set on the NTU RGB+D 120 benchmark datasets are 88.8%and 90.1%,respectively,representing effective improvements in the accuracy of action recognition.
关 键 词:动作识别 对比学习 骨骼关键点 预定义骨骼拓扑 局部特征细化
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222