检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]陕西师范大学物理学与信息技术学院,西安710119 [2]厦门大学物理系,厦门361005
出 处:《中国科学:物理学、力学、天文学》2015年第5期10-17,共8页Scientia Sinica Physica,Mechanica & Astronomica
基 金:国家自然科学基金(批准号:11147020);中央高校基本科研业务费专项资金(编号:GK201102028)资助项目
摘 要:本文研究了细菌的蛋白质多肽组分统计特征与基因组GC(Guanine+Cytosine)含量的相关性,发现当多肽长度较小时多肽组分特异性与GC含量存在着很强的关联;随着多肽长度增加,上述关联发生突变,关联迅速丧失.这一结果表明,基于组分特异性确定细菌亲缘关系的方法的确给出了不同于GC含量的信息,从而能实现有效分类.In the past decades, a lot of methods have been proposed to construct Genome Tree. Among them, K-String Composition Approach which is Alignment-Free shows nonnegligible superiority. On the other hand, the species specificity of GC (Guanine+Cytosine)-content which actually is the lowest-order version of K-String Composition has been discovered for a long time, especially in bacteria. Unfortunately, its resolution is too poor to be applied to reconstruct phylogeny. Motivated by those facts, in this paper, relationship between composition vector of peptides and GC-content of corresponding DNA sequence is studied for bacteria. A strong correlation is uncovered for short peptides, and with the increase of peptide length the correlation exhibits an abrupt change, that is, tends to vanish quickly. These results indicate that the composition vector of longer peptide do contains more precise information of species specificity than that of GC-content, and therefore can effectively measure the genetic relationship of bacteria. Short peptides are obviously not competent.
关 键 词:种系基因组学 非序列比对 多肽组分矢量 GC含量
分 类 号:R394[医药卫生—医学遗传学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.4