融合LDA和CNN的施工触电事故原因识别和预判被引量：1

Identifying and predicting the causes of construction electric shock accidents through integration of LDA and CNN

作　　者：李珏[1,2] 潘悦吴畅 LI Jue;PAN Yue;WU Chang(School of Traffic and Transportation Engineering,Changsha University of Science&Technology,Changsha 410075,China;Intelligent Construction,Operation&Maintenance Management of Transportation Infrastructure Key College Laboratory of Hunan Province,Changsha University of Science&Technology,Changsha 410075,China)

机构地区：[1]长沙理工大学交通运输工程学院,长沙410075 [2]长沙理工大学交通基础设施智慧建造与运维管理湖南省高等学校重点实验室,长沙410075

出　　处：《安全与环境学报》2024年第10期3973-3981,共9页Journal of Safety and Environment

基　　金：湖南省自然科学基金项目(2021JJ30744);湖南省教育厅项目(20K011)。

摘　　要：鉴于施工触电事故具有突发性强、致死率高的特点,为了有效辅助事故原因的调查,首先,对318份施工触电事故进行预处理,运用词频-逆文档频率(Term Frequency-Inverse Document Frequency,TF-IDF)关键词处理算法和可视化技术提取关键信息。其次,通过狄利克雷主题模型(Latent Dirichlet Allocation,LDA)提取原因主题词条,并根据关键信息构建相应的原因主题标签。随后,运用Word2Vec模型将“事故经过”和“主题标签”转化为词向量矩阵,并输入卷积神经网络(Convolutional Neural Networks,CNN)模型中,利用CNN模型数据预测的特征,实现事故原因的预判。最后,对比分析CNN模型与其他两种经典模型的预判效果。试验结果表明,该方法能够在事故调查完成前,较准确地预判事故可能原因。该模型构建的事故原因库,可以为事故预防提供一定参考,模型可以作为辅助事故实际调查的有效手段。To enhance the investigation of accident causes,this paper proposes a comprehensive model that integrates Latent Dirichlet Allocation(LDA),Word2Vec,and Convolutional Neural Networks(CNN).This combined approach aims to effectively assist in identifying and analyzing accident causes.First,unstructured accident reports undergo preprocessing,including data cleaning,word tokenization,and stop-word removal.Key information is extracted and visualized using Term Frequency-Inverse Document Frequency(TF-IDF).Next,the optimal number of topics(K-value) is determined based on perplexity and coherence measures.LDA is then applied to cluster the accident reports into K distinct topics related to accident causes.Labels for these topics are assigned by referencing both the extracted key information from the reports and summaries of causes found in relevant literature.Finally,an accident cause label repository is established.The accident cause labels are determined based on key information extracted from accident reports and summaries found in references,establishing an accident cause label repository.Following this,"accident histories" are extracted from reports using regular expressions.The 318 accident records are partitioned into 9:1 training and testing sets.The “accident histories” in the training set are paired with their respective cause labels.Afterwards,the data from the 318 accidents are split into a 9:1 ratio for training and testing purposes.The “accident history” along with its corresponding cause label values in the training set are used to construct a word vector matrix using Word2Vec for training the CNN model.Subsequently,the “accident history” from the test set is inputted into the CNN model to predict the corresponding cause label value,thereby determining the predicted cause of the accident.Finally,to evaluate the effectiveness and accuracy of the CNN model,its output was compared with those of the Support Vector Machine(SVM) and Naive Bayes(NB) models using three major metrics.The CNN model achiev

关键词：安全工程施工触电事故事故原因狄利克雷主题模型(LDA) Word2Vec模型卷积神经网络(CNN)

分类号：X947[环境科学与工程—安全科学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

融合LDA和CNN的施工触电事故原因识别和预判被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

融合LDA和CNN的施工触电事故原因识别和预判 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

融合LDA和CNN的施工触电事故原因识别和预判被引量：1