检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李元 李睿 林金山 金凌峰 邵先军 张冠军[1] LI Yuan;LI Rui;LIN Jinshan;JIN Lingfeng;SHAO Xianjun;ZHANG Guanjun(School of Electrical Engineering,Xi′an Jiaotong University,Xi′an 710049,China;State Grid Zhejiang Electric Power Co.,Ltd.Research Institute,Hangzhou 310014,China)
机构地区:[1]西安交通大学电气工程学院,陕西西安710049 [2]国网浙江省电力有限公司电力科学研究院,浙江杭州310014
出 处:《电力工程技术》2024年第6期153-162,共10页Electric Power Engineering Technology
基 金:国家自然科学基金资助项目(52107165)。
摘 要:变压器运维管理中积累了海量以文本形式记录的非结构化缺陷数据,但缺乏有效挖掘手段导致其利用率极低。文中提出一种基于字词混用集成模型的变压器缺陷记录文本挖掘方法,首先对变压器缺陷文本进行文本分词、去除停用词、文本增强、文本特征表示等预处理,以文本数学向量形式为输入,集成多个词汇级和字符级分类模型,通过元学习器对各基学习器性能的协同互补作用,实现变压器缺陷类型的准确识别和分类。与单一文本分类算法相比,该方法能够更全面地获得文本的语义特征,分类精确率达91%,模型准确率和召回率的综合评价分数F 1=0.9。将自然语言处理技术应用于电力设备缺陷记录文本,可以实现精准高效分类和故障识别,唤醒数据资源,显著提升电力变压器智能化管理水平。The operation and maintenance management of transformers has accumulated a large amount of unstructured defect recording data in the form of text.However,the lack of effective mining method has led to an extremely low utilization rate.A text mining method for transformer defect recording text based on a character-word level ensemble integrated model is proposed in this paper.Firstly,the transformer defect recording texts are preprocessed with text segmentation,stop word removal,text augmentation,and text feature representation to convert the data into mathematical vectors for input.By integrating multiple word-and character-level classification models,the method can realize accurate identification and classification of transformer defect types through the synergistic and complementary effects of meta-learners on the individual base learners.Compared to single-text classification algorithms,this method can obtain the semantic features of the text more comprehensively,achieving a classification precision of 91%and F 1 score of 0.9,which is the comprehensive evaluation score for model precision and recall.By applying natural language processing technology to precise power equipment defect recoding text classification and efficient fault recognition,data resources are awakened,and the intelligent management level of power transformers is significantly improved.
关 键 词:电力变压器 自然语言处理 文本挖掘 故障诊断 集成学习 人工智能
分 类 号:TM8[电气工程—高电压与绝缘技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.90