Crash-causing information extraction via text-mining techniques:implementation of the Chinese state-related crash narratives  

在线阅读下载全文

作  者:Guoqing Zou Helai Huang Hanchu Zhou Jipu Li 

机构地区:[1]School of Traffic and Transportation Engineering,Central South University,Changsha,Hunan 410075,China

出  处:《Transportation Safety and Environment》2024年第4期95-104,共10页交通安全与环境(英文)

摘  要:Crash data is the foundation of traffic safety analysis,which can help experts find the cause of crashes and propose corresponding countermeasures.In China,the accident reporting form only allows reporting of one crash cause for each crash based on the prespecified crash cause code.This designation may lead to inaccuracy in recording crash data,especially for state-related crashes.The crash narratives,which are the responding officer’s written account of what occurred before,during and after a crash,contain considerable free-form information associated with the crash occurrence.This study investigated the directly contributory factors behind staterelated crashes through the development of natural language processing and deep-learning models based on 1625 state-related crash narratives.According to the directly causative factors described in the crash narratives,state-related crashes were labelled speed related,turning related and other causes.Then the crash narratives were vectorized for model training and frequency analysis.The text-CNN,LSTM,GRU and SVM models were applied to reclass the vectorized crash.The results showed that the text-CNN model showed the best model performance in text classification,and the AUC value of this model reached 0.90 for micro-average curves.The results from this study can engage the usage of crash narratives and help identify the actual causative reasons hidden behind some inaccurate crash value designations.

关 键 词:text analysis state-related crash variable designation deep learning crash narratives 

分 类 号:U49[交通运输工程—交通运输规划与管理]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象