基于粗粒化的流感病毒蛋白进化树构建  被引量:1

Construction of Phylogenetic Tree of Flu Virus Proteins Based on Coarse Graining

在线阅读下载全文

作  者:李阳[1] 唐旭清[1] 

机构地区:[1]江南大学理学院,无锡214122

出  处:《模式识别与人工智能》2016年第10期936-942,共7页Pattern Recognition and Artificial Intelligence

基  金:国家自然科学基金项目(No.11371174);国际科技合作研究项目(No.2011DFR70500)资助~~

摘  要:在127 065条血凝素、神经氨酸酶流感病毒蛋白基础上,提出基于粗粒化的病毒蛋白进化树的构建方法.首先基于病毒蛋白序列特征,给出序列间相似性度量,提取流感病毒系统层次递阶结构,并定义层次聚类指标,确定最佳聚类数.然后基于距离中心最近的原则提取流感病毒系统代表.最后采用距离度量构造流感病毒进化树.实验表明,相同流感病毒具有宿主相同、时间跨度较小、爆发地点相近,更倾向于处于相同分支的特点,这与已有的文献吻合,因此该方法有利于挖掘病毒变异轨迹.Based on the coarse graining theory, a method for constructing phylogenetic tree of flu virus proteins is proposed by combining total 127 065 hemagglutinin and neuraminidase protein sequences. Firstly, to determine the appropriate granularity, a feature vector is obtained to present a virus protein sequence and then an approach is given to construct hierarchical structure of virus system by analyzing similarity among multi-protein sequences. The suitable number of clusters is determined according to hierarchical evaluation index based on the system structure. Furthermore, on the basis of the nearest-to-center principle, the significant viruses can be selected to represent characteristics of the whole class. Finally, the phylogenetic tree is established through the distance metric. The test result indicates that the influenza viruses with same host, similar time span, close outbreak location and same names are more likely to belong to the same branch. The results are identical with that of the existing literature on flu virus. The results provide a foundation for investigating the mutation, evolution and prediction of flu viruses.

关 键 词:流感病毒 进化树 粗粒化 结构聚类 大数据处理 

分 类 号:R373[医药卫生—病原生物学] Q811.4[医药卫生—基础医学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象