不同特征描述下H1N1病毒血凝素蛋白序列的比较分析  被引量:4

Comparative Analysis of H1N1 Influenza Virus Hemagglutinin Sequences by Different Feature Descriptions

在线阅读下载全文

作  者:李巍巍[1] 李阳[1] 唐旭清[1] 

机构地区:[1]江南大学理学院,中国江苏无锡214122

出  处:《生命科学研究》2016年第2期119-124,共6页Life Science Research

基  金:国家自然科学基金资助项目(11371174);中央高校基本科研业务费专项(JUSRP51317B);江苏省普通高校研究生科研创新计划项目(1145210232141170)

摘  要:在传统表征蛋白质序列的40维特征向量的基础上,依据氨基酸的种类与理化性质,将蛋白质序列40维向量分解为20维、4维和16维3种子特征向量描述。结合33条H1N1流感病毒血凝素(hemagglutinin,HA)蛋白质序列和统计学相关性分析理论,进行了蛋白质序列两两之间及每条病毒蛋白质序列对应的不同子特征向量之间的相关性分析,发现病毒蛋白质序列之间存在高度相关性,且每条病毒蛋白质序列对应的20维子特征向量与其他两种子特征向量之间均不显著相关,而4维与16维子特征向量之间显著相关。进一步依据不同的特征向量对33条HA蛋白质序列进行分类,研究发现依据40维特征向量与16维特征向量进行的分类结果高度一致。因此,在不影响表征病毒序列特性的前提下,对于已有的表征蛋白质序列的40维特征向量,可以用16维的特征向量进行代替,以减少计算复杂度。Based on the traditional 40-dimensional feature vector of protein sequences, the 40-dimensional vector was decomposed into 20-, 4-and 16-dimensional feature vectors according to classification and physical-chemicat properties of amino acids. Combined with he viruses and the theory of correlation, correlations between viruses were analyzed by three sub-vectors, and correlations magglutinin (HA) sequences from 33 HIN1 flu every two HA sequences of the 33 H1N1 flu between different characteristic vectors of each H1N1 flu virus HA sequence were given by comparative analysis. The results showed a high correlation be- tween every two protein sequences. Meanwhile, results between the 4- and 16-dimensional vectors were sig- nificantly correlated, but the 20-dimensional vector had a low correlation with others. The 33 H1N1 flu virus protein sequences were further classified according to the different eharaeteristic vector. It showed that classification results based on 40-dimensional and 16-dimensional feature vectors were highly consistent. Therefore, the existing 40-dimensional eigenvector of protein sequences could be replaced by 16-dimensional eigenvector on the premise that the characterization of virus sequence features was not affected, which would greatly reduce the complexity of the calculation.

关 键 词:H1N1流感病毒 氨基酸分类 特征向量 相关性分析 系统聚类 

分 类 号:Q71[生物学—分子生物学] O29[理学—应用数学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象