基于双加权投票的蛋白质功能预测  

Prediction of Protein Functions Based on Bi-weighted Vote

在线阅读下载全文

作  者:唐家琪 吴璟莉[1,2,3] 廖元秀 王金艳[1,2,3] TANG Jia-qi;WU Jing-li;LIAO Yuan-xiu;WANG Jin-yan(School of Computer Science & Information Engineering,Guangxi Normal University,Guilin,Guangxi 541004,China;Guangxi Key Laboratory of Multi-Source Information Mining & Safety,Guangxi Normal University,Guilin,Guangxi 541004,China;Guangxi Regional Multi-Source Information Integration & Intelligent ProcessingCooperation Innovation Center,Guilin,Guangxi 541004,China)

机构地区:[1]广西师范大学计算机科学与信息工程学院,广西桂林541004 [2]广西师范大学广西多源信息挖掘与安全重点实验室,广西桂林541004 [3]广西区域多源信息集成与智能处理协同创新中心,广西桂林541004

出  处:《计算机科学》2019年第4期222-227,共6页Computer Science

基  金:国家自然科学基金项目(61762015;61502111;61662007;61763003);广西自然科学基金项目(2015GXNSFAA139288);"八桂学者"工程专项;广西科技基地和人才专项(AD16380008)资助

摘  要:蛋白质是完成重要生物活动所必需的分子。准确掌握蛋白质功能,将对生命科学研究及应用起到极大的促进作用。高通量技术的发展产生了海量的蛋白质序列,利用计算技术预测大规模蛋白质功能已成为当今生物信息学的核心任务之一。目前,作为蛋白质功能预测的研究热点,基于蛋白质相互作用网络的预测方法在降低数据噪声影响、充分利用网络拓扑特性及整合多源数据等方面仍不够完善。文中结合带阻力随机游走得到的全局拓扑相似度,及功能术语的语义相似度,设计了一种双加权投票蛋白质功能预测算法BiWV;并在此基础上整合了生物通路信息,提出了带生物通路的双加权投票算法——BiWV-P。在酿酒酵母和人类数据集上,对所提算法与TMC,UBiRW和ProHG 3种算法的预测效果进行对比分析。实验结果显示,算法BiWV和BiWV-P能够有效预测蛋白质功能,并在许多数据集上获得较其他算法更高的微正确率与微F1。Proteins are the essential molecules to accomplish important biological activities.It will greatly promote the advance of life science research and application to accurately grasp their functions.A tremendous amount of protein sequences has been generated with the development of high-throughput techniques.Thus,prediction of large-scale protein functions with computation technology has become one of the key tasks in bioinformatics today.Currently,the prediction method based on protein-protein interaction network,which is a research hotspot of protein function prediction,still has shortcomings at such aspects as reducing the impact of data noise,making full use of network topology characteristics,integrating multi-source data,and so on.In this paper,the Bi-Weighted Vote(BIWV) algorithm was proposed to predict protein functions,which combines the global topological similarity produced by Random Walk with Resistance (RWS) and the semantic similarity between terms.In addition,the Bi-Weighted Vote algorithm with pathway (BiWV-P) was presented by integrating the information of biological pathway.By using the data sets of saccharomyces cerevi-siae and homo sapiens,experiments were performed to compare TMC,UBiRW,ProHG,BiWV and BiWV-P.The experimental results indicate that BiWV algorithm and BiWV-P algorithm can predict protein functions effectively,and achieve higher micro-accuracy and micro-F1 than other algorithms in many data sets.

关 键 词:蛋白质相互作用网络 功能预测 随机游走 语义相似度 生物通路 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象