检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:余冬华[1] 郭茂祖[1,2,3] 刘晓燕 程爽[4] Yu Donghua;Guo Maozu;Liu Xiaoyan;Cheng Shuang(School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001;School of Electrical and Information Engineering,Beijing University of Civil Engineering and Architecture,Beijing 100044;Beijing Key Laboratory of Intelligent Processing for Building Big Data,Beijing 100044;Institute of Materials,China Academy of Engineering Physics,Mianyang,Sichuan 621900)
机构地区:[1]哈尔滨工业大学计算机科学与技术学院,哈尔滨150001 [2]北京建筑大学电气与信息工程学院,北京100044 [3]建筑大数据智能处理方法研究北京市重点实验室(北京建筑大学),北京100044 [4]中国工程物理研究院材料所,四川绵阳621900
出 处:《计算机研究与发展》2019年第9期1881-1888,共8页Journal of Computer Research and Development
基 金:国家自然科学基金项目(61871020,61571164,61671189);北京市教委科技计划重点项目(KZ201810016019);北京建筑大学市属高校基本科研业务费专项资金(X18197,X18198,X18203)~~
摘 要:药物靶标作用关系预测是一种重要的辅助药物研发手段,而生物实验验证药物靶标作用关系耗钱耗时,因此,在数据库中查询验证预测的药物靶标作用关系是对预测方法的重要评价.基于KEGG,DrugBank,ChEMBL这3个数据库,利用爬虫获取信息的方式设计开发了药物靶标作用关系查询验证方法DTcheck(drug-target check),实现了对于提供KEGG DRUG ID及KEGG GENES ID的药物靶标对的高效查询验证功能,并利用DTcheck分别为Enzyme,IC(ion channel),GPCR(G-protein-coupled receptor),NR(nuclear receptor)四个标准数据集扩充新增药物靶标作用关系907,766,458,40对.此外,结合DTcheck查询验证,以BLM(bipartite local models)方法为例分析了预测结果的评价问题,结果表明,采用AUC(area under curve)值评价药物靶标作用关系预测方法没有Top N 评价合理,且AUC值低的BLMd方法在预测新的药物靶标作用关系时优于AUC值高的BLMmax方法.The drug-target interaction prediction is one of the important assistant approaches in drug discovery and design, however, experimental identification and validation of potential drug-target encoded by the human genome is both costly and time-consuming. Therefore, querying and validating the predicted drug-target interaction in databases is an important assessment of prediction methods. In this paper, the query and validation method of drug-target interaction named as DTcheck (drug-target check) is developed and designed with Web spider based on KEGG, DrugBank, ChEMBL databases, which realizes efficient query and validation function for drug-target pair providing both KEGG DRUG ID and KEGG GENES ID. ID mapping function is also designed in DTcheck, which can map Uniprot ID from DrugBank and ChEMBL into KEGG GENE ID. DTcheck expands 907, 766, 458, 40 pairs of new drug-target interaction for Enzyme, IC (ion channel), GPCR (G-protein-coupled receptor), NR (nuclear receptor) standard datasets, respectively. Moreover, combined with query and validation result, the analysis of the prediction results of the BLM (bipartite local models) method shows that evaluation of Top N is more reasonable than AUC (area under curve) value for the prediction method of drug-target interaction. It also shows that the BLMd method with low AUC value is superior to the BLMmax method with high AUC value in predicting the drug-target interaction.
关 键 词:药物靶标作用关系预测 查询验证 药物靶标数据集 AUC评价 TOP N评价
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.158