机构地区:[1]Technical Faculty,Singidunum University,Belgrade,11000,Serbia [2]Informatics and Computing,Singidunum University,Belgrade,11000,Serbia [3]Business Economics,Singidunum University,Belgrade,11000,Serbia [4]Computing and Informatics,Sinergija University,Bijeljina,76300,Bosnia and Herzegovina [5]Department for Information Systems and Technologies,University“Union Nikola Tesla”,Cara Dusana,Belgrade,11080,Serbia [6]Department for Computer Science and Informatics,School of Electrical Engineering,University of Belgrade,Belgrade,11000,Serbia [7]Department of Electrical and Electronics Engineering,Kongu Engineering College(Autonomous),Perundurai,Erode,638060,India [8]Department of Mathematics,Saveetha School of Engineering(Deemed to be University),SIMATS Thandalam,Chennai,602105,India [9]MEU Research Unit,Middle East University,Amman,11831,Jordan
出 处:《Computers, Materials & Continua》2024年第9期4997-5027,共31页计算机、材料和连续体(英文)
基 金:supported by the Science Fund of the Republic of Serbia,Grant No.7373;Characterizing Crises-Caused Air Pollution Alternations Using an Artificial Intelligence-Based Framework-crAIRsis and Grant No.7502;Intelligent Multi-Agent Control and Optimization applied to Green Buildings and Environmental Monitoring Drone Swarms-ECOSwarm.
摘 要:Cyberbullying is a form of harassment or bullying that takes place online or through digital devices like smartphones,computers,or tablets.It can occur through various channels,such as social media,text messages,online forums,or gaming platforms.Cyberbullying involves using technology to intentionally harm,harass,or intimidate others and may take different forms,including exclusion,doxing,impersonation,harassment,and cyberstalking.Unfortunately,due to the rapid growth of malicious internet users,this social phenomenon is becoming more frequent,and there is a huge need to address this issue.Therefore,the main goal of the research proposed in this manuscript is to tackle this emerging challenge.A dataset of sexist harassment on Twitter,containing tweets about the harassment of people on a sexual basis,for natural language processing(NLP),is used for this purpose.Two algorithms are used to transform the text into a meaningful representation of numbers for machine learning(ML)input:Term frequency inverse document frequency(TF-IDF)and Bidirectional encoder representations from transformers(BERT).The well-known eXtreme gradient boosting(XGBoost)ML model is employed to classify whether certain tweets fall into the category of sexual-based harassment or not.Additionally,with the goal of reaching better performance,several XGBoost models were devised conducting hyperparameter tuning by metaheuristics.For this purpose,the recently emerging Coyote optimization algorithm(COA)was modified and adjusted to optimize the XGBoost model.Additionally,other cutting-edge metaheuristics approach for this challenge were also implemented,and rigid comparative analysis of the captured classification metrics(accuracy,Cohen kappa score,precision,recall,and F1-score)was performed.Finally,the best-generated model was interpreted by Shapley additive explanations(SHAP),and useful insights were gained about the behavioral patterns of people who perform social harassment.
关 键 词:Coyote optimization algorithm NLP TF-IDF BERT XGBoost online harassment and cyberbullying metaheuristics
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...