以多肽组分特异性和GC含量分类细菌的有效性分析  

Validity of peptide composition and GC-content for classifying bacteria

在线阅读下载全文

作  者:李静珂[1] 金涛[1] 赵鸿[2] 

机构地区:[1]陕西师范大学物理学与信息技术学院,西安710119 [2]厦门大学物理系,厦门361005

出  处:《中国科学:物理学、力学、天文学》2015年第5期10-17,共8页Scientia Sinica Physica,Mechanica & Astronomica

基  金:国家自然科学基金(批准号:11147020);中央高校基本科研业务费专项资金(编号:GK201102028)资助项目

摘  要:本文研究了细菌的蛋白质多肽组分统计特征与基因组GC(Guanine+Cytosine)含量的相关性,发现当多肽长度较小时多肽组分特异性与GC含量存在着很强的关联;随着多肽长度增加,上述关联发生突变,关联迅速丧失.这一结果表明,基于组分特异性确定细菌亲缘关系的方法的确给出了不同于GC含量的信息,从而能实现有效分类.In the past decades, a lot of methods have been proposed to construct Genome Tree. Among them, K-String Composition Approach which is Alignment-Free shows nonnegligible superiority. On the other hand, the species specificity of GC (Guanine+Cytosine)-content which actually is the lowest-order version of K-String Composition has been discovered for a long time, especially in bacteria. Unfortunately, its resolution is too poor to be applied to reconstruct phylogeny. Motivated by those facts, in this paper, relationship between composition vector of peptides and GC-content of corresponding DNA sequence is studied for bacteria. A strong correlation is uncovered for short peptides, and with the increase of peptide length the correlation exhibits an abrupt change, that is, tends to vanish quickly. These results indicate that the composition vector of longer peptide do contains more precise information of species specificity than that of GC-content, and therefore can effectively measure the genetic relationship of bacteria. Short peptides are obviously not competent.

关 键 词:种系基因组学 非序列比对 多肽组分矢量 GC含量 

分 类 号:R394[医药卫生—医学遗传学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象