基于集成学习优化的肉制品安全风险等级预警分析  被引量:4

Safety Risk Level Early Warning of Meat Products Based on Optimized Ensemble Learning

在线阅读下载全文

作  者:穆书敏 陈锂 尹佳 郭鹏程 陈晨 董曼 赵锦 徐晴雪 文红 桂预风[1] MU Shumin;CHEN Li;YIN Jia;GUO Pengcheng;CHEN Chen;DONG Man;ZHAO Jin;XU Qingxue;WEN Hong;GUI Yufeng(School of Sciences,Wuhan University of Technology,Wuhan 430070,China;Hubei Provincial Institute for Food Supervision and Test,Key Laboratory of Detection Technology of Focus Chemical Hazards in Animal-derived Food for State Market Regulation,Hubei Provincial Engineering and Technology Research Center for Food Quality and Safety Test,Wuhan 430075,China;Shishou City Public Inspection Test Centre,Jingzhou 434200,China)

机构地区:[1]武汉理工大学理学院,湖北武汉430070 [2]湖北省食品质量安全监督检验研究院,湖北省食品质量安全检测工程技术研究中心,国家市场监管重点实验室(动物源性食品中重点化学危害物检测技术),湖北武汉430075 [3]石首市公共检验检测中心,湖北荆州434200

出  处:《现代食品科技》2023年第8期273-286,共14页Modern Food Science and Technology

基  金:国家重点研发计划项目(2018YFC1603602)。

摘  要:该研究依据2013~2017年肉制品抽检数据构造了5个安全风险等级,使用特征构造及独热编码进一步关联与肉制品安全相关的影响因素,构建极端梯度提升树算法(Extreme Gradient Boosting,XGBOOST)研究食品生产过程各类因素对于食品安全风险等级的影响程度,并使用多个指标评价模型。此外通过上采样解决样本不平衡问题、贝叶斯优化调节超参数,来提高模型性能及分类效果。相较于模型决策树(Decision Tree,DT)和随机森林(Random Forest,RF),XGBOOST模型在肉制品安全风险等级分类中的表现效果最佳。研究结果表明,食品生产过程环节错综复杂,使用one-hot encoding处理后的模型能够有效判断出各类因素对于食品安全风险等级的影响程度,集成模型中RF的学习效果比较稳定,XGBOOST经过参数调节后准确率等指标得到有效的提升且优于RF。不同采样下XGBOOST的平均精确率均能达到89.14%,平均F1值为88.59%,说明XGBOOST在肉制品安全风险等级预警中适用性,为日常抽检提供技术指导。Five safety risk levels were established based on the detection data of meat products sampled between 2013 and 2017.Feature construction and one-hot encoding were used to further correlate factors relevant to meat product safety.An extreme gradient boosting(XGBOOST)model was established to study the influence levels of various factors during food production on the safety risk level;subsequently then multiple indices were used to evaluate the model.In addition,sample imbalance problem was solved by upsampling,and the hyperparameters were adjusted by Bayesian optimization to improve the model performance and classification results.Simultaneously,the model constructed was compared with the decision tree(DT)and random forest(RF)methods to evaluate their classification performance.The XGBOOST outperformed others in classifying the safety risk levels of meat products.Food production processes are complex,and this study shows that model processing with one-hot encoding could effectively identify the influence levels of various factors on food safety.Moreover,the result suggested that XGBOOST performs better in terms of total accuracy and other indices after parameter adjustment,compared to the RF model,while RF had the most stable learning performance.The average of accuracy and F1 score can reach 89.14%and 88.59%,respectively,under different sampling.The results suggest that XGBOOST can be applied to determine safety risk levels of meat products and provide technical support for daily supervision.

关 键 词:食品安全风险 独热编码 决策树 集成学习 极端梯度提升树 随机森林 

分 类 号:TS251.5[轻工技术与工程—农产品加工及贮藏工程] TS201.6[轻工技术与工程—食品科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象