改进YOLOv5的智慧课堂人脸检测算法  

Intelligent Classroom Face Detection Algorithm with Improved YOLOv5

在线阅读下载全文

作  者:钟源 袁家政 李鸿天 刘宏哲 徐成[1,2] ZHONG Yuan;YUAN Jiazheng;LI Hongtian;LIU Hongzhe;XU Cheng(Beijing Key Laboratory of Information Service Engineering,Beijing Union University,Beijing 100101,China;Institute for Brain&Cognitive Sciences,Beijing Union University,Beijing 100101,China;School of Science and Technology,Beijing Open University,Beijing 100081,China)

机构地区:[1]北京联合大学,北京市信息服务工程重点实验室,北京100101 [2]北京联合大学脑与认知智能北京实验室,北京100101 [3]北京开放大学科学技术学院,北京100081

出  处:《计算机工程与应用》2024年第11期251-257,共7页Computer Engineering and Applications

基  金:国家自然科学基金(62171042,62102033,62006020);北京市重点科技项目(KZ202211417048);北京市属高等学校高水平科研创新团队建设支持计划项目(BPHR20220121);北京市自然科学基金(4232026);协同创新中心(CYXC2203)。

摘  要:智慧课堂是人工智能领域热门的应用场景。针对课堂场景下摄像头位置较远且偏,图像中目标存在人脸过小和遮挡导致漏检或错检等问题,提出了一种改进YOLOv5的智慧课堂人脸检测算法YOLOv5-SASA。该算法主要包括三个部分,在backbone层沿用了CSPDarknet53网络,通过在最后的空间池化层中使用BasicRFB模块来有效增强网络的特征提取能力;采用NWD损失函数来提高模型对小目标检测的鲁棒性,同时在head层中引入了独立自注意力机制模块SASA,以解决人脸遮挡的问题,并降低模型的参数量;通过降低中间层通道神经元的数量、调节学习率等方式,对改进的YOLOv5网络进行了优化,以避免模型过拟合。实验结果表明,所提出的方法在WiderFace验证集的easy、medium和hard难度下的效果均优于原网络,分别达到了97.5%、96.3%和86.5%的准确率,能够有效提升课堂场景下人脸检测的精度。The intelligent classroom is a popular application scenario in the field of artificial intelligence.This paper proposes a face detection algorithm based on improved YOLOv5,named YOLOv5-SASA,to address the issues of missed or false detection caused by small or occluded faces in images captured by cameras located far away or at an angle.The algorithm consists of three parts.Firstly,the CSPDarknet53 network is utilized in the backbone layer,and the BasicRFB module is used in the final spatial pooling layer to enhance the network’s feature extraction ability.Secondly,the NWD loss function is employed to improve the model’s robustness in detecting small targets.Thirdly,the independent self-attention mechanism module SASA is introduced in the head layer to address the issue of face occlusion and reduce the model’s parameter count.Finally,the improved YOLOv5 network is optimized by reducing the number of neurons in the middle layer channels and adjusting the learning rate to avoid overfitting.Experimental results demonstrate that the proposed method outperforms the original network in the easy,medium,and hard levels of the WiderFace validation set,achieving accuracies of 97.5%,96.3%,and 86.5%,respectively,which effectively improves the accuracy of face detection in classroom scenarios.

关 键 词:智慧课堂 人脸检测 YOLOv5 独立自注意力机制 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象