Incorporating empirical knowledge into data-driven variable selection for quantitative analysis of coal ash content by laser-induced breakdown spectroscopy  被引量:1

在线阅读下载全文

作  者:吕一涵 宋惟然 侯宗余 王哲 Yihan LYU;Weiran SONG;Zongyu HOU;Zhe WANG(State Key Laboratory of Power System Operation and Control,International Joint Laboratory on Low Carbon Clean Energy Innovation,Department of Energy and Power Engineering,Tsinghua University,Beijing 100084,People’s Republic of China;School of Energy and Electrical Engineering,Qinghai University,Xining 810016,People’s Republic of China;Tsinghua University,Shanxi Research Institute Clean Energy,Taiyuan 030032,People’s Republic of China)

机构地区:[1]State Key Laboratory of Power System Operation and Control,International Joint Laboratory on Low Carbon Clean Energy Innovation,Department of Energy and Power Engineering,Tsinghua University,Beijing 100084,People’s Republic of China [2]School of Energy and Electrical Engineering,Qinghai University,Xining 810016,People’s Republic of China [3]Tsinghua University,Shanxi Research Institute Clean Energy,Taiyuan 030032,People’s Republic of China

出  处:《Plasma Science and Technology》2024年第7期148-156,共9页等离子体科学和技术(英文版)

基  金:financial supports from National Natural Science Foundation of China(No.62205172);Huaneng Group Science and Technology Research Project(No.HNKJ22-H105);Tsinghua University Initiative Scientific Research Program and the International Joint Mission on Climate Change and Carbon Neutrality。

摘  要:Laser-induced breakdown spectroscopy(LIBS)has become a widely used atomic spectroscopic technique for rapid coal analysis.However,the vast amount of spectral information in LIBS contains signal uncertainty,which can affect its quantification performance.In this work,we propose a hybrid variable selection method to improve the performance of LIBS quantification.Important variables are first identified using Pearson's correlation coefficient,mutual information,least absolute shrinkage and selection operator(LASSO)and random forest,and then filtered and combined with empirical variables related to fingerprint elements of coal ash content.Subsequently,these variables are fed into a partial least squares regression(PLSR).Additionally,in some models,certain variables unrelated to ash content are removed manually to study the impact of variable deselection on model performance.The proposed hybrid strategy was tested on three LIBS datasets for quantitative analysis of coal ash content and compared with the corresponding data-driven baseline method.It is significantly better than the variable selection only method based on empirical knowledge and in most cases outperforms the baseline method.The results showed that on all three datasets the hybrid strategy for variable selection combining empirical knowledge and data-driven algorithms achieved the lowest root mean square error of prediction(RMSEP)values of 1.605,3.478 and 1.647,respectively,which were significantly lower than those obtained from multiple linear regression using only 12 empirical variables,which are 1.959,3.718 and 2.181,respectively.The LASSO-PLSR model with empirical support and 20 selected variables exhibited a significantly improved performance after variable deselection,with RMSEP values dropping from 1.635,3.962 and 1.647 to 1.483,3.086 and 1.567,respectively.Such results demonstrate that using empirical knowledge as a support for datadriven variable selection can be a viable approach to improve the accuracy and reliability of LIBS quantification.

关 键 词:laser-induced breakdown spectroscopy(LIBS) coal ash content quantitative analysis variable selection empirical knowledge partial least squares regression(PLSR) 

分 类 号:TN2[电子电信—物理电子学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象