检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]华南农业大学数学与信息学院,广东广州510642
出 处:《计算机与现代化》2018年第4期62-67,共6页Computer and Modernization
基 金:2016年省级大学生创新训练计划项目(201610564356);广州市科技计划项目(201707010031)
摘 要:卷积神经网络本身具有丰富的特征表达能力和学习能力,但本质上,其模块中几何变换能力是固定的。因此,引入可变形卷积核来改进VGG-16的网络结构,搭建名为DC-VGG的卷积神经网络结构来进行手势识别的研究。在不同数据集下,基于可变形卷积神经网络的手势识别方法能够直接把RGB图像数据输入网络。最终输出的结果,对手势的平均识别率达到97%以上,有效提高网络的性能,提升卷积神经网络对样本对象的容忍度和多样性,丰富卷积神经网络的特征表达能力,与传统LeNet-5、VGG-16结构和传统人工特征提取算法相比效果更佳,比传统结构更深,鲁棒性更好,识别率更强,可以为复杂背景下有效识别手势提供参考,具有一定的延拓能力。Convolution neural network itself has a rich ability of expressing features and learning,but in essence,the module geometric transformation ability is fixed. Therefore,the VGG-16 network structure is improved by introducing a deformable convolution kernel,and a convolution neural network structure named DC-VGG is built to study the gesture recognition. In different data sets,the gesture recognition method based on deformable convolution neural network can input RGB image data directly into the network. The results show that the average recognition rate of gestures is over 97%,which can improve the performance of the network,enhance the tolerance and diversity of the convolution neural network to the sample object,and enrich the expression ability of the convolution neural network. Compared with the traditional LeNet-5,VGG-16 structure and traditional feature extraction by hand,DC-VGG is deeper than the traditional structure,the robustness is better,the recognition rate is stronger,which can provide reference for the effective recognition of gestures in complex background,and has some extension ability.
关 键 词:手势识别 可变形卷积 卷积神经网络 卷积核 双线性插值
分 类 号:TP391.9[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.21.35.68