基于多层基因网络的关键基因识别算法  

Key gene identification algorithm based on multi⁃layer network

在线阅读下载全文

作  者:魏丕静 刘晶晶 赵永敏 苏延森 郑春厚[3] WEI Pijing;LIU Jingjing;ZHAO Yongmin;SU Yansen;ZHENG Chunhou(Institutes of Physical Science and Information Technology,Anhui University,Hefei 230601,China;School of Computer Science and Technology,Anhui University,Hefei 230601,China;School of Artificial Intelligence,Anhui University,Hefei 230601,China)

机构地区:[1]安徽大学物质科学与信息技术研究院,合肥230601 [2]安徽大学计算机科学与技术学院,合肥230601 [3]安徽大学人工智能学院,合肥230601

出  处:《生物信息学》2023年第4期277-285,共9页Chinese Journal of Bioinformatics

基  金:国家重点研发计划项目(No.2021YFE0102100);安徽省自然科学基金青年项目(No.2108085QF267,No.2008085QF294)。

摘  要:疾病关键基因可用于疾病诊断、预测和新药或新疗法有效性的评价,故识别与疾病紧密相关的关键基因十分重要。然而现在有些疾病样本数据较少,传统基于大样本的关键基因挖掘方法不适用于该类数据。本文针对含少量样本数据的疾病,首先利用单样本网络构建方法构建每个疾病样本的个体化基因网络,并通过建立基因间的层间联系构建多层基因网络。然后利用基于张量的多层网络中心性方法评估每层网络中基因间的相互作用以及层间影响,对基因进行重要性打分,识别疾病关键基因。最后将该方法应用到哮喘数据集上,并与经典算法进行比较,结果表明,利用该方法所识别的已获批准的药物靶标基因的排名较优;对所得到的新的潜在关键基因TP53、PUS10、MAP3K1等进行功能和通路富集分析,结果表明其与哮喘有紧密关联。Critical genes of diseases can be used to diagnose diseases,predict and evaluate the effectiveness of new drugs or new therapies,so it is very important to identify critical genes closely related to diseases.However,the samples of some diseases are limited.It is difficult to apply the traditional methods based on large sample data to mine critical genes for these diseases.In this paper,for diseases with small amount of samples,we first construct sample⁃specific network for each sample with the single⁃sample network constructing methods,and construct a multi⁃layer gene network by estabishing inter⁃layer connections between genes.A tensor⁃based multi⁃layer network centrality approach is then used to assess the interactions between genes in each layer of the network and the inter⁃layer effects to score the genes for importance and identify disease key genes.Finally,the method is used to two asthma datasets and compared with the classical algorithm.The results show that compared with other methods,the approved drug target genes rank higher in the gene rankings obtained by this method.And function and pathway enrichment analysis of the new potential critical genes TP53,PUS10,MAP3K1,etc.indicat that they were closely related to asthma.

关 键 词:多层基因网络 随机游走 节点中心性 关键基因 

分 类 号:Q343.1[生物学—遗传学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象