Discovering latent themes in aviation safety reports using text mining and network analytics  

在线阅读下载全文

作  者:Yingying Xing Yutong Wu Shiwen Zhang Ling Wang Haoyuan Cui Bo Jia Hongwei Wang 

机构地区:[1]The Key Laboratory of Road and Traffic Engineering of Ministry of Education,Tongji University,Shanghai,201804,China [2]Institute of Safety Operation Research Institute,China Eastern Technology Application R&D Center,Shanghai 201707,China [3]Institute of High Performance Computing(IHPC),Agency for Science,Technology and Research(A*STAR),Singapore 138632,Singapore

出  处:《International Journal of Transportation Science and Technology》2024年第4期292-316,共25页交通科学与技术(英文)

摘  要:Aviation accidents,referring to unexpected and undesirable events involving aircraft,often cause great damage to property and human life.Learning from historical accidents is pivotal for improving safety in aviation.However,aviation accidents are typically documented and stored as unstructured or semi-structured free-text,rendering the ability to analyze such data a difficult task.This study presents a novel framework that combines text mining and network analytics techniques to provide the ability to analyze aviation accident reports automatically.The framework comprises a four-step modelling approach to:(1)the transformation of unstructured aviation safety report texts into structured numeric matrices using the TF-IDF matrix;(2)the identification of aviation accident topics using a structural topic model(STM);(3)the production of a word co-occurrence network(WCN)to determine the interrelations between aviation safety risk factors;and(4)quantitative analysis by technology of keywords to pinpoint key causal factors in aviation safety events.The proposed framework is validated by analyzing aviation accident reports collected by the National Transportation Safety Board(NTSB).The results indicate that STM provides a more granular partitioning of topics and better distinguishes between similar events compared to traditional latent dirichlet allocation(LDA).Among the identified topics,“Fuel and Power”and“En-route Phase”have the highest occurrence rate according to STM.Additionally,“Aircraft Crash”is the most prevalent topic in aviation accidents that resulted in fatal injuries,whereas the“Landing phase”is the most prevalent topic in nonfatal injuries on accidents.Based on the WCN,three centrality measures highlight“inspection of equipment”and“take off”as the most important risk factors in aviation safety.The proposed framework provides a comprehensive solution for in-depth analysis of aviation safety reports,offering decision support for aviation safety management and accident prevention,thereb

关 键 词:Aviation safety Aviation accident report Text mining Topic modeling Network analysis 

分 类 号:F56[经济管理—产业经济]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象