检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:石岩[1] 李宁 魏锋[1] 马双成[1] SHI Yan;LI Ning;WEI Feng;MA Shuang-cheng(National Institutes for Food and Drug Control,Beijing 102629,China;Beijing Institute for Drug Control,Beijing 102206,China)
机构地区:[1]中国食品药品检定研究院,北京102629 [2]北京市药品检验研究院,北京102206
出 处:《药物分析杂志》2024年第5期866-873,共8页Chinese Journal of Pharmaceutical Analysis
摘 要:目的:建立以黄酮类成分为特征的栽培黄芪、半野生黄芪和野生黄芪的三分类模型,并且对自动机器学习技术和数据增强技术在药物分析领域中的应用进行探索和评价。方法:首先,对黄芪的黄酮类成分含量数据进行相关性分析、主成分分析,建立决策树和逻辑回归模型,根据模型分析黄酮类成分的重要性程度;然后,使用TVAE表格数据生成算法,根据真实数据生成600批虚拟数据,使用自动学习框架AutoGluon,num_bag_folds设为5,分别对64批真实数据和600批虚拟数据进行学习,得到2组共30个模型,依据准确率进行评估。结果:对机器学习模型的分析可知,芒柄花素、毛蕊异黄酮葡萄糖苷和刺芒柄花苷这3种黄酮类成分对于黄芪质量,尤其是来源等级的控制具有重要意义;2组共30个模型预测准确率表明,基于NeuralNet的模型和基于树模型的机器学习算法对于黄酮成分数据表征的黄芪而言分类效果最好;数据增强技术生成的虚拟数据与真实数据在所训练得到的模型准确率趋势方面基本一致。结论:机器学习相关技术在以黄酮为特征的黄芪分类中具有较好的应用价值。Objective:To establish a three classification model for cultivated,semi-wild,and wild Astragali Radix characterized by flavonoids,and explore and evaluate the application of techniques of automated machine learning and data augmentation in the field of drug analysis.Methods:Firstly,correlation analysis and principal component analysis were conducted on the flavonoid content data of Astragali Radix,and models of decision tree and logistic regression were established to analyze the importance of flavonoid components based on the models.Then,using the AutoGluon framework with 5 as num_bag_folds,2 sets of 30 models respectively through 64 batches of real data and 600 batches of virtual data generated based on real data with the TVAE table data generation algorithm for training were obtained,and these models were evaluated by accuracy.Results:The analysis of machine learning models,indicated that formononetin,campanulin and onospin played the important roles in the quality control of Astragali Radix,especially for the source grade control.The accuracy of model prediction showed that the models based on Neural Net and tree-model always had the best classification effect for Astragali Radix.The virtual data generated by data augmentation technique is basically consistent with the actual data in terms of the accuracy trend of the model training process.Conclusion:Related techniques of machine learning have good application value in the classification of Astragali Radix characterized by flavonoids.
关 键 词:黄芪 黄酮 毛蕊异黄酮葡萄糖苷 刺芒柄花苷 毛蕊异黄酮 山柰酚 异鼠李素 芒柄花素 机器学习 人工智能 数据增强
分 类 号:R917[医药卫生—药物分析学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.3