Laws of Genome Nucleotide Composition  

在线阅读下载全文

作  者:Zhang Zhang 

机构地区:[1]National Genomics Data Center,China National Center for Bioinformation,Beijing 100101,China [2]Beijing Institute of Genomics,Chinese Academy of Sciences,Beijing 100101,China [3]University of Chinese Academy of Sciences,Beijing 100049,China

出  处:《Genomics, Proteomics & Bioinformatics》2024年第4期7-10,共4页基因组蛋白质组与生物信息学报(英文版)

基  金:supported by the National Natural Science Foundation of China(Grant No.32030021);the National Key R&D Program of China(Grant No.2023YFC2604400);the International Partnership Program of Chinese Academy of Sciences(Grant No.153F11KYSB20160008).

摘  要:Genome nucleotide composition,one of the most important sequence characteristics at the genome-wide level,is usually expressed in terms of the proportions of four bases in DNA molecule as well as their combinations.It has been studied for decades that genomes of different species are highly variable in their nucleotide composition[1,2],as demonstrated that guanine-plus-cytosine(GC)content varies widely with a broader range from-20%to-80%[3].A body of empirical evidence has further accumulated that heterogeneity of genome-wide nucleotide composition in different species associates closely with a variety of intrinsic and extrinsic factors,such as genome size[4],phylogeny[5],growth temperature[6],environment[7],origin of replication[8],bacterial land colonization[9],codon/amino acid usage[10],and natural selection[11].Contrastingly,very few theoretical efforts have been devoted to exploring whether there is any law underlying such variable genome nucleotide composition across different species.Theoretically,such law(s)would be desirable to be used as a fundamental framework for better understanding genome composition dynamics,molecular evolution,genome organization,and synthetic biology.Built upon previous findings,here we propose three laws of genome nucleotide composition in a mathematical manner and demonstrate their effectiveness to formulate diverse genome nucleotide compositions in a large collection of complete genome sequences across three domains of life.

关 键 词:BASES devoted formulate 

分 类 号:Q811.4[生物学—生物工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象