基于机器学习的环氧合酶-2抑制剂分类模型的构建  

onstruction of a Classification Model for Cyclooxygenase-2 Inhibitors based on Machine Learning

在线阅读下载全文

作  者:萧耿苗 穆云萍 千爱君 李芳红 赵子建 XIAO Geng-miao;MU Yun-ping;QIAN Ai-jun;LI Fang-hong;ZHAO Zi-jian(School of Biomedical and Pharmaceutical Sciences,Guangdong University of Technology,Guangzhou,Guangdong,510006,China)

机构地区:[1]广东工业大学生物医药学院,广东广州510006

出  处:《现代生物医学进展》2024年第4期606-611,共6页Progress in Modern Biomedicine

基  金:国家重点研发计划项目(2018YFA0800603);广东省"珠江人才计划"项目(2016ZT06Y432);广东省重点领域研发计划项目(2019B020201015)。

摘  要:目的:构建环氧合酶-2(Cyclooxygenase-2,COX-2)抑制剂分类模型,用以筛选和优化COX-2抑制剂。方法:基于八种机器学习算法构建模型,比较不同模型的预测性能,筛选出最优模型后利用Y随机验证法对其进行测试,最后运用SHAP(Shapley Additive eXplanation)算法对最优模型进行可解释性分析。结果:八种不同模型的性能比较结果显示,基于随机森林算法建立的模型最优,其预测准确率、平衡准确率、马修斯相关系数、特征曲线下面积和F1分数(分别为0.893、0.825、0.673、0.909和0.933)最高;Y随机验证结果表明最优模型的预测结果并非偶然;此外,通过SHAP算法挖掘出20个最有可能影响COX-2抑制剂活性的结构片段。结论:本研究为新型COX-2抑制剂的开发提供理论依据,可供本领域其他研究人员对先导化合物进行优化或设计更好的COX-2抑制剂。Objective:This study aims to develop a classification model for cyclooxygenase-2(COX-2)inhibitors for the purpose of screening and optimizing COX-2 inhibitors.Methods:Eight machine learning algorithms were used to construct models,and their predictive performance was compared to identify the best model.The optimal model was tested by using Y-scrambling validation method,finally the interpretability analysis of the optimal model was performed by using Shapley Additive eXplanation(SHAP)algorithm.Results:Among the eight different models compared,the Random Forest algorithm exhibited the best performance.With the highest accuracy,balanced accuracy,Matthew's correlation coefficient,area under the ROC curve,and F1 scores(0.893,0.825,0.673,0.909 and 0.933,respectively),it comes out on top.Validation with Y-scrambling showed that the predictions of the optimal model were not coincidence.Moreover,the SHAP algorithm was used to mine 20 structural fragments that could affect COX-2 inhibitor activity.Conclusion:In this study,we developed a theoretical basis for developing COX-2 inhibitors,which is useful to other researchers in this field when optimizing lead compounds and designing new COX-2 inhibitors.

关 键 词:COX-2抑制剂 机器学习 可解释性 重要结构片段 

分 类 号:R-33[医药卫生] Q55[生物学—生物化学] Q811R914

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象