虚拟样本生成方法及其在重整数据建模中的应用  被引量:3

VIRTUAL SAMPLE GENERATION METHOD AND ITS APPLICATION IN REFORMING DATA MODELING

在线阅读下载全文

作  者:贺许龙 张蕾[1] 周涵[1] 王鑫磊[1] 苗准 He Xulong;Zhang Lei;Zhou Han;Wang Xinlei;Miao Zhun(SINOPEC Research Institute of Petroleum Processing,Beijing 100083)

机构地区:[1]中国石化石油化工科学研究院,北京100083

出  处:《石油炼制与化工》2021年第6期92-95,共4页Petroleum Processing and Petrochemicals

基  金:国家重点研发计划资助项目(2017YFB0306501)。

摘  要:采用催化重整装置的工业原料组成数据训练产品预测决策树回归模型。由于工业数据样本范围比较集中,利用该模型在预测芳烃收率时,会存在过拟合现象,造成其适用性较差,因而借助多元高斯概率分布方法构建重整进料虚拟样本,并利用HYSYS机理模型计算虚拟进料样本对应的芳烃收率数据,改进工业数据常见的小样本问题。结果表明,将虚拟数据与真实数据混合用于决策树回归模型的训练后,模型对检验样本的平均绝对误差由1.4097降至0.6318,说明虚拟样本可以用于模型训练,提升了数据驱动模型的适用性。The yield of aromatics was predicted based on a Decision Tree Regression model,which was trained using actual feed composition data from a continuous reformer.The relatively concentrated sample range of industrial data can lead to over-fitting which limits the model’s application.The small sample issue can be seen as a common problem when dealing with industrial data.A virtual sample of reforming feed was constructed with Multivariate Gaussian probability distribution method,and the corresponding aromatics yield was simulated with HYSYS mechanism model to improve the problem mentioned above.After the Decision Tree Regression model training with feed composition mixed virtual data and real data,the mean absolute error of the test sample was reduced from 1.4097 to 0.6318,which proves that virtual samples can be used for model training to expand the application of data-driven models.

关 键 词:重整工艺数据 虚拟样本 高斯分布 HYSYS模拟 

分 类 号:TE624.42[石油与天然气工程—油气加工工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象