检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国民航飞行学院计算机学院,四川广汉618307
出 处:《计算机工程与应用》2018年第4期174-178,230,共6页Computer Engineering and Applications
基 金:四川省教育厅科研项目(No.16ZB0032)
摘 要:针对现有的3D卷积神经网络(3D Convolutional Neural Networks,3DCNN)行为识别算法将输入视频分块划分为固定长度,其包含的行为信息可能冗余或不全的问题,提出了解决方案。利用人体运动质点轨迹的特性定义了人体原子行为;以原子行为的长度作为视频分块的长度进行视频划分,得到包含完整信息的人体行为。3DCNN要求输入数据必须是相同维度,而原子行为视频块长度不同。为此改进了空间金字塔池化(3D Spatial Pyramid Pooling,3D SPP)技术,以适用于不同长度视频处理。把SPP层放置在全连接层前,处理3DCNN卷积层输出的不同长度特征图,以输出相同长度特征向量。与相关算法相比,实验数据说明该算法对输入数据要求更低,由于视频分块信息的完整性,识别率有显著提高。A novel action recognition algorithm is proposed for 3 D Convolutional Neural Networks(3 DCNN)'s disadvantage that demands a fixed length for all video clips as the input data. This disadvantage makes lack of information or data redundancy situation because of the fixed size video clips. Firstly, human atom action is defined by human action particle trajectory.Then the length of video clips is defined by the length of human atom action. The divided video clips include unabridged information for a human action. However, the length of these clips is different. There is a conflict for classification and identification in 3 DCNN, because 3 DCNN needs the same length of input data. To solve the problem, 3 D Spatial Pyramid Pooling(SPP)algorithm is improved for processing different length video data. 3 D SPP, which is put before fully-connected layers in 3 DCNN, outputs the same size representation vectors. This technology is compared with several related algorithms in experiments. The experimental results show that there are two advantages in this technology: a lower requirement for input data and higher recognition rate with a intact information in clips.
关 键 词:行为识别 视频分析 3D空间金字塔池化 原子行为 3D卷积神经网络
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.144.48.13