检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:任海斌 冯宝龙[2] 范蓓[3] 贺斌彬 李知陆 王清华 高飞[2] 王玉堂[1,3] Ren Haibin;Feng Baolong;Fan Bei;He Binbin;Li Zhilu;Wang Qinghua;Gao Fei;Wang Yutang(Key Laboratory of Dairy Science,Ministry of Education,Northeast Agricultural University,Harbin 150030,China;Center for Education Technology,Northeast Agricultural University,Harbin 150030,China;Institute of Food Science and Technology,Chinese Academy of Agricultural Sciences,Beijing 100193,China)
机构地区:[1]东北农业大学乳品科学教育部重点实验室,哈尔滨150030 [2]东北农业大学现代教育技术中心,哈尔滨150030 [3]中国农业科学院农产品加工研究所,北京100193
出 处:《农业工程学报》2021年第19期303-308,共6页Transactions of the Chinese Society of Agricultural Engineering
基 金:国家重点研发计划项目(2019YFF0217601-02);中国农业科学院农产品加工研究所知识创新计划(125161015000150013)。
摘 要:食品工业一直在积极地发现新的甜味分子,传统发掘方法费时费力,效率较低。该研究基于分子的甜味和分子结构相关的假设,利用文献、专利及数据库中的数据,建立甜味、非甜味分子数据集和甜度分子数据集,采用随机森林和支持向量机算法建立定性构效关系模型定性预测甜味分子;采用主成分回归、k最邻近回归、随机森林回归和偏最小二乘回归四种算法建立定量构效关系模型定量预测甜味分子的甜度。研究发现,随机森林算法模型的分类效果最好,接受者操作特性曲线下的面积为0.987,准确度为0.966;随机森林回归模型的甜度预测效果最好,决定系数为0.82,误差均方根为0.60。联用这两个模型在食品成分数据库中,发现542个具有甜味剂潜力的食品分子。Sweet taste is one of the most important tastes in food flavor and quality. Sweet molecules that can be used to produce new sweeteners have also been actively explored in food processing. However, the traditional methods cannot meet the rapid development of the economy and market demand, due mainly to time-consuming, laborious, and inefficient methods. Therefore, an effective and reliable strategy is essential to produce the sweet stuff. Currently, machine learning and structure-activity relationship can be utilized to realize accurate predictions of sweet molecules in the food industry. In this study, a new database of sweeteners and non-sweeteners together with the scores of sweetness was established using molecular sweetness and structure-activity correlation between molecular structures. MOE software was selected to compute molecular descriptors, to fully characterize the properties of molecules. These descriptors were then filtered through neighborhood variance screening, collinearity removal, and principal component contribution rate screening. Specifically, the feature descriptors were screened by removing the descriptors with high correlation. 80% of the dataset was then divided into training sets for model construction, and 20% were divided into test sets for model validation. Random forest and support vector machines were utilized to establish a qualitative structure-activity relationship for the prediction and identification of potential sweet molecules. Evaluation indexes were taken as the area under the receiver characteristic curve(AUC) and accuracy rate.The higher the AUC and accuracy rate represented the better classification. As such, the optimal model was obtained.Subsequently, the principal component, K-nearest neighbor, random forest, and partial least squares regression were used to establish the quantitative structure-activity relationship for better prediction of sweet molecules. The determination coefficient R;and Root Mean Square Error(RMSE) were used as evaluation indexes of the quantita
关 键 词:机器学习 甜味剂 预测 定性构效关系 定量构效关系
分 类 号:TS202.3[轻工技术与工程—食品科学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.3