多阶段特征融合的三支流头部姿态估计算法  被引量:1

Three-Stream Head Pose Estimation Algorithm Based on Multi-Stage Feature Fusion

在线阅读下载全文

作  者:韩雪 张红英[1,2] 卢琇雯 张奇 HAN Xue;ZHANG Hongying;LU Xiuwen;ZHANG Qi(School of Information Engineering,Southwest University of Science and Technology,Mianyang,Sichuan 621010,China;Robot Technology Used for Special Environment Key Laboratory of Sichuan Provincial,Southwest University of Science and Technology,Mianyang,Sichuan 621010,China)

机构地区:[1]西南科技大学信息工程学院,四川绵阳621010 [2]西南科技大学特殊环境机器人技术四川省重点实验室,四川绵阳621010

出  处:《计算机工程与应用》2023年第17期212-222,共11页Computer Engineering and Applications

基  金:国家部委预研项目。

摘  要:针对现有的头部姿态估计算法在复杂场景下实时性较差、识别率较低的问题,提出了一种多阶段特征融合的三支流头部姿态估计算法。该算法具有多级输出的结构,用三条不同类型的网络分别对输入图像进行特征提取,并且每条支流上都有三个阶段,每一阶段只需要细化前一阶段的特征,相同阶段提取出的特征图经过特征融合模块来生成特征映射,有效避免了特征丢失问题;特征提取模块选择Ghost模块作为特征提取网络,利用模型压缩,使之在保证网络精度的同时减少网络参数和计算量;为提取出重要性更强的有效特征,引入高效通道注意力模块ECA-Net,从而提升头部姿态估计的准确性。实验结果表明,所提算法在AFLW2000数据集和BIWI数据集上均取得优异的性能,对比当前诸多头部姿态估计方法,模型大小仅为0.55MB,在AFLW2000和BIWI数据集上的MAE分别降低至4.68和3.59。Aiming at the problems of poor real-time performance and low recognition rate of existing head pose estima-tion algorithms in complex scenes,a three-stream head pose estimation algorithm based on multi-stage feature fusion is proposed.The algorithm has a multi-level output structure.Three different types of networks are used to extract features from the input image,and each branch has three stages.Each stage only needs to refine the features of previous stage.Feature map extracted at the same stage is generated by the feature fusion module,which effectively avoids the problem of feature loss.The feature extraction module selects the Ghost module as the feature extraction network,and uses model compression to reduce network parameters and computation while ensuring network accuracy.In order to extract more important and effective features,an efficient channel attention module ECA-Net is introduced to improve the accuracy of head pose estimation.Experimental results show that the proposed algorithm achieves excellent performance on both the AFLW2000 dataset and the BIWI dataset,with a model size of only 0.55 MB and a reduced MAE of 4.68 and 3.59 on the AFLW2000 and BIWI datasets respectively,compared to many current head pose estimation methods.

关 键 词:头部姿态估计 GhostNet 高效通道注意力 特征提取 特征融合 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象