基于字词混用集成模型的电力变压器缺陷记录文本挖掘方法  

Character-word level ensemble integrated model for power transformer defect recording text mining method

在线阅读下载全文

作  者:李元 李睿 林金山 金凌峰 邵先军 张冠军[1] LI Yuan;LI Rui;LIN Jinshan;JIN Lingfeng;SHAO Xianjun;ZHANG Guanjun(School of Electrical Engineering,Xi′an Jiaotong University,Xi′an 710049,China;State Grid Zhejiang Electric Power Co.,Ltd.Research Institute,Hangzhou 310014,China)

机构地区:[1]西安交通大学电气工程学院,陕西西安710049 [2]国网浙江省电力有限公司电力科学研究院,浙江杭州310014

出  处:《电力工程技术》2024年第6期153-162,共10页Electric Power Engineering Technology

基  金:国家自然科学基金资助项目(52107165)。

摘  要:变压器运维管理中积累了海量以文本形式记录的非结构化缺陷数据,但缺乏有效挖掘手段导致其利用率极低。文中提出一种基于字词混用集成模型的变压器缺陷记录文本挖掘方法,首先对变压器缺陷文本进行文本分词、去除停用词、文本增强、文本特征表示等预处理,以文本数学向量形式为输入,集成多个词汇级和字符级分类模型,通过元学习器对各基学习器性能的协同互补作用,实现变压器缺陷类型的准确识别和分类。与单一文本分类算法相比,该方法能够更全面地获得文本的语义特征,分类精确率达91%,模型准确率和召回率的综合评价分数F 1=0.9。将自然语言处理技术应用于电力设备缺陷记录文本,可以实现精准高效分类和故障识别,唤醒数据资源,显著提升电力变压器智能化管理水平。The operation and maintenance management of transformers has accumulated a large amount of unstructured defect recording data in the form of text.However,the lack of effective mining method has led to an extremely low utilization rate.A text mining method for transformer defect recording text based on a character-word level ensemble integrated model is proposed in this paper.Firstly,the transformer defect recording texts are preprocessed with text segmentation,stop word removal,text augmentation,and text feature representation to convert the data into mathematical vectors for input.By integrating multiple word-and character-level classification models,the method can realize accurate identification and classification of transformer defect types through the synergistic and complementary effects of meta-learners on the individual base learners.Compared to single-text classification algorithms,this method can obtain the semantic features of the text more comprehensively,achieving a classification precision of 91%and F 1 score of 0.9,which is the comprehensive evaluation score for model precision and recall.By applying natural language processing technology to precise power equipment defect recoding text classification and efficient fault recognition,data resources are awakened,and the intelligent management level of power transformers is significantly improved.

关 键 词:电力变压器 自然语言处理 文本挖掘 故障诊断 集成学习 人工智能 

分 类 号:TM8[电气工程—高电压与绝缘技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象