融合生成模型和判别模型的双层RBM运动捕获数据语义识别算法  被引量:2

Two-Layer Motion Semantic Recognition by Fusing the Restricted Boltzmann Machine Based Generative Model and Discriminative Model

在线阅读下载全文

作  者:周兵[1,2] 彭淑娟[1,2] 柳欣[1,2] Zhou Bing;Peng Shujuan;Liu Xin(College of Computer Science and Technology, Huaqiao University, Xiamen 361021;Key Lab of Pattern Recognition and Computer Vision, Xiamen City, Xiamen 361021)

机构地区:[1]华侨大学计算机科学与技术学院,厦门361021 [2]厦门市模式识别与计算机视觉重点实验室,厦门361021

出  处:《计算机辅助设计与图形学学报》2017年第4期689-698,共10页Journal of Computer-Aided Design & Computer Graphics

基  金:国家自然科学基金(61673185;61673186);福建省自然科学基金(2015J01656);华侨大学科研创新能力培养资助项目(1511414012)

摘  要:对人体运动捕获数据底层特征和高层语义之间常常存在语义鸿沟的问题,结合深度学习思想,提出一种融合受限玻尔兹曼机生成模型和判别模型的运动捕获数据语义识别算法.该算法采用双层受限玻尔兹曼机,分别对运动捕获数据进行判别性特征提取(特征提取层)和风格识别(语义判别层),首先考虑到自回归模型对时序信息具有出色的表达能力,构建一种基于单通道三元因子交互的条件限制玻尔兹曼机生成模型,用于提取运动捕捉数据的时空特征信息;然后将提取出的特征与对应的风格标签相耦合,作为语义判别层中受限玻尔兹曼机判别模型的当前帧数据层输入,进行单帧风格识别的训练;最后在获得各帧参数的基础上,在模型顶部加入投票空间实现对运动捕捉序列的风格语义的有效识别.实验结果表明,文中算法具有良好的鲁棒性和可扩展性,能够满足多样化运动序列识别的需求,便于数据的有效重用.The semantic gap problem between the low-level features and high-level semantics often existswithin the motion capture data.To tackle this problem,we refer to the deep learning theory and propose atwo-layer motion recognition approach by fusing the Restricted Boltzmann Machine(RBM)based generativemodel and discriminative model,in which the generative layer is utilized for feature representation andthe discriminative layer is selected for semantic discrimination.Within the proposed approach,we first utilizethe autoregressive model to establish an one-way three-factored conditional RBM,whereby the spatiotemporalfeatures of the captured motions can be well obtained.Then,these features are coupled with theircorresponding labels and selected as the visible input of the RBM based discriminative model.Finally,byadding a voting space,the motion semantics can be efficiently recognized via this two layer fused model.The experimental results have shown that our proposed approach is able to recognize different kinds of motionposes,featuring robustness and expandability to the motion capture data.It is expected that the proposedapproach would be well utilized for motion capture data reusing in a practical way.

关 键 词:动作捕捉 时空特征 深度学习 受限玻尔兹曼机 判别模型 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象