SmokerViT: A Transformer-Based Method for Smoker Recognition

作　　者：Ali Khan Somaiya Khan Bilal Hassan Rizwan Khan Zhonglong Zheng

机构地区：[1]College of Mathematics and Computer Science,Zhejiang Normal University,Jinhua,321004,China [2]School of Electronics Engineering,Beijing University of Posts and Telecommunications,Beijing,100876,China [3]Department of Electrical Engineering and Computer Science,Khalifa University of Science and Technology,Abu Dhabi,127788,United Arab Emirates [4]Key Laboratory of Intelligent Education of Zhejiang Province,Zhejiang Normal University,Jinhua,321004,China

出　　处：《Computers, Materials & Continua》2023年第10期403-424,共22页计算机、材料和连续体（英文）

摘　　要：Smoking has an economic and environmental impact on society due to the toxic substances it emits.Convolutional Neural Networks(CNNs)need help describing low-level features and can miss important information.Moreover,accurate smoker detection is vital with minimum false alarms.To answer the issue,the researchers of this paper have turned to a self-attention mechanism inspired by the ViT,which has displayed state-of-the-art performance in the classification task.To effectively enforce the smoking prohibition in non-smoking locations,this work presents a Vision Transformer-inspired model called SmokerViT for detecting smokers.Moreover,this research utilizes a locally curated dataset of 1120 images evenly distributed among the two classes(Smoking and NotSmoking).Further,this research performs augmentations on the smoker detection dataset to have many images with various representations to overcome the dataset size limitation.Unlike convolutional operations used in most existing works,the proposed SmokerViT model employs a self-attention mechanism in the Transformer block,making it suitable for the smoker classification problem.Besides,this work integrates the multi-layer perceptron head block in the SmokerViT model,which contains dense layers with rectified linear activation and linear kernel regularizer with L2 for the recognition task.This work presents an exhaustive analysis to prove the efficiency of the proposed SmokerViT model.The performance of the proposed SmokerViT performance is evaluated and compared with the existing methods,where it achieves an overall classification accuracy of 97.77%,with 98.21%recall and 97.35%precision,outperforming the state-of-the-art deep learning models,including convolutional neural networks(CNNs)and other vision transformer-based models.

关键词：Smoker recognition SmokerViT deep learning transformer for vision

分类号：TP181[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

SmokerViT: A Transformer-Based Method for Smoker Recognition

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

SmokerViT: A Transformer-Based Method for Smoker Recognition

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索