基于改进LDA的水电工程进度管理文本智能分析  被引量:6

Intelligent text analysis of hydropower project progress management based on improved LDA

在线阅读下载全文

作  者:李明超[1] 吕沅庚 田丹 沈扬 LI Mingchao;LV Yuangeng;TIAN Dan;SHEN Yang(State Key Laboratory of Hydraulic Engineering Simulation and Safety,Tianjin University,Tianjin 300350;China Three Gorges Corporation,Beijing 100038)

机构地区:[1]天津大学水利工程仿真与安全国家重点实验室,天津300350 [2]中国长江三峡集团有限公司,北京100038

出  处:《水力发电学报》2022年第3期133-141,共9页Journal of Hydroelectric Engineering

基  金:国家自然科学基金(52179139);湖北省水电工程施工与管理重点实验室开放基金(2020KSD05)。

摘  要:进度控制是水电工程管理的重要任务,及时总结进度管理信息有助于工程进度计划的制定与调整。水电工程建设中的进度信息多以半结构化、非结构化的文本形式呈现,增加了信息提取难度,实现水电工程进度文本信息自动化与智能化挖掘是当前亟待解决的问题。本文提出基于改进LDA的水电工程进度信息智能提取方法,智能提取进度管理文本中的关键信息。该方法基于传统LDA模型针对吉布斯采样机制,充分考虑词语间的关联关系,将原有随机单个采样过程改进为以共现度为基准的词对采样,强化了词语间的语义关联,提高了主题词语间的紧密性以及主题词语对主题描述的准确性。将所提出的方法应用于实际水电工程,对221份水电工程施工监理周报进行分析,共提取12个主题的工序关键词,并依照计算结果提取出主副工序;结果表明,改进LDA主题模型在水电工程进度文本工序特征词提取效果优于传统LDA主题模型,有助于提高工程施工进度关键工序词提取与信息挖掘效率,为水电工程施工智能化管理提供了新的手段。Schedule control is a key task of hydropower project management;A timely summary of schedule management information helps formulate and adjust the project schedules.In hydropower project construction,progress information is often presented in semi-structured or unstructured text forms,thereby inducing a difficulty in information extraction.An urgent issue is how to realize automation and intelligent mining of the text information of hydropower project progress.This paper presents a new intelligent extraction method of hydropower project schedule information based on an improved LDA method for intelligent extraction of the key information from schedule management texts.This method,based on the Gibbs sampling mechanism of the traditional LDA model,takes full consideration of the association relationship between words,and improves semantic association between words,closeness between words,and accuracy in the description of topic words.It is applied to practical hydropower projects to analyze 221 weekly reports on construction supervision,and extracted the key words of 12 themes,with the main and secondary processes extracted through calculations.Results show that our improved LDA topic model is better than the traditional LDA and it helps improve word extraction efficiency and information mining efficiency for hydropower construction.

关 键 词:水电工程 施工进度 关键词提取 改进LDA主题模型 共现度 文本智能分析 

分 类 号:TV512[水利工程—水利水电工程] TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象