检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王新颖[1] 张锦晖[1] 王丹丹[1] 陈海群[1] 周永文
机构地区:[1]常州大学环境与安全工程学院,江苏常州213164 [2]亚什兰(中国)投资有限公司,上海201108
出 处:《计算机与应用化学》2014年第6期732-736,共5页Computers and Applied Chemistry
基 金:常州市国际科技合作计划项目(CZ20120015);产学研联合创新资金-前瞻性联合研究项目(BY2013024-04)
摘 要:为提高脂肪醇化合物对梨形四膜虫急性毒性的预测精度,提出基于定量结构-活性关系(QSAR)原理的脂肪醇化合物对梨形四膜虫急性毒性预测方法。运用遗传算法筛选出5种分子描述符作为变量,采用多元线性回归方法和最小二乘-支持向量机方法建立基于该5种分子描述符的脂肪醇化合物对梨形四膜虫急性毒性的预测模型。对所建立的模型进行内部验证和外部验证,两种模型的复相关系数、留一法交互验证系数分别为0.984、0.979和0.985、0.982,对外部预测样本的复相关系数和外部测试集交互验证系数分别为0.978、0.977和0.979、0.979。结果表明,所建QSAR模型均具有较好的稳健性、预测能力和泛化性能。LS-SVM模型在精度上略优于ML-R模型,而MLR模型更为简单和方便。In order to improve the accuracy of predicting acute toxicities of fatty alcohol compounds to tetrahymena pyriformis,a method based on quantitative structure-activity relationship (QSAR) was proposed.Genetic algorithm (GA) was employed to select five descriptors that have significant contributions to the acute toxieities of fatty alcohol to tetrahymena pyriformis.These five descriptors then were used to build the models by multiple linear regression (MLR) and least square support vector machine (LS-SVM) methods.The statistical results indicate that the multiple correlation coefficient and cross validation using leave-one-out were 0.984,0.979 and 0.985,0.982,respectively.To validate the predictive power of the resulting models,external validation multiple correlation coefficient and cross validation were 0.978,0.977 and 0.979,0.979,respectively.The satisfactory results indicate both the models have high reliability,strong predictive power and fine generalization ability.The model established by LS-SVM is superior to that built by MLR,while the latter one is more simple and convenient.
关 键 词:定量结构-活性相关 脂肪醇 遗传算法 多元线性回归 最小二乘-支持向量机
分 类 号:TQ015.9[化学工程] TP391.9[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249