Evolutionary annotation of conserved long non-coding RNAs in major mammalian species  被引量:3

Evolutionary annotation of conserved long non-coding RNAs in major mammalian species

在线阅读下载全文

作  者:BU DeChao LUO HaiTao JIAO Fei FANG ShuangSang TAN ChengFu LIU ZhiYong ZHAO Yi 

机构地区:[1]Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences [2]University of Chinese Academy of Sciences [3]Department of Biochemistry and Molecular Biology, Binzhou Medical College

出  处:《Science China(Life Sciences)》2015年第8期787-798,共12页中国科学(生命科学英文版)

基  金:supported by Training Program of the Major Research Plan of the National Natural Science Foundation of China(91229120)

摘  要:Mammalian genomes contain tens of thousands of long non-coding RNAs(lnc RNAs) that have been implicated in diverse biological processes. However, the lnc RNA transcriptomes of most mammalian species have not been established, limiting the evolutionary annotation of these novel transcripts. Based on RNA sequencing data from six tissues of nine species, we built comprehensive lnc RNA catalogs(4,142–42,558 lnc RNAs) covering the major mammalian species. Compared to protein-coding RNAs, expression of lnc RNAs exhibits striking lineage specificity. Notably, although 30%–99% human lnc RNAs are conserved across different species on DNA locus level, only 20%–27% of these conserved lnc RNA loci are detected to transcription, which represents a stark contrast to the proportion of conserved protein-coding genes(48%–80%). This finding provides a valuable resource for experimental scientists to study the mechanisms of lnc RNAs. Moreover, we constructed lnc RNA expression phylogenetic trees across nine mammals and demonstrated that lnc RNA expression profiles can reliably determine phylogenic placement in a manner similar to their coding counterparts. Our data also reveal that the evolutionary rate of lnc RNA expression varies among tissues and is significantly higher than those for protein-coding genes. To streamline the processes of browsing lnc RNAs and detecting their evolutionary statuses, we integrate all the data produced in this study into a database named Phylo NONCODE(http://www.bioinfo.org/phylo Noncode). Our work starts to place mammalian lnc RNAs in an evolutionary context and represent a rich resource for comparative and functional analyses of this critical layer of genome.Mammalian genomes contain tens of thousands of long non-coding RNAs (lncRNAs) that have been implicated in diverse biological processes. However, the lncRNA transcriptomes of most mammalian species have not been established, limiting the evolutionary annotation of these novel transcripts. Based on RNA sequencing data from six tissues of nine species, we built comprehensive lncRNA catalogs (4,142-42,558 lncRNAs) covering the major mammalian species. Compared to protein-coding RNAs, expression of lncRNAs exhibits striking lineage specificity. Notably, although 30%-99% human lncRNAs are conserved across different species on DNA locus level, only 20%-27% of these conserved lncRNA loci are detected to transcription, which represents a stark contrast to the proportion of conserved protein-coding genes (48%-80%). This finding provides a valuable resource for experimental scientists to study the mechanisms of lncRNAs. Moreover, we constructed lncRNA expression phylogenetic trees across nine mammals and demonstrated that lncRNA expression profiles can reliably determine phylogenic placement in a manner similar to their coding counterparts. Our data also reveal that the evolutionary rate of lncRNA expression varies among tissues and is significantly higher than those for protein-coding genes. To streamline the processes of browsing lncRNAs and detecting their evolutionary statuses, we integrate all the data produced in this study into a database named PhyloNONCODE (http://www.bioinfo.org/phyloNoncode). Our work starts to place mammalian lncRNAs in an evolutionary context and represent a rich resource for comparative and functional analyses of this critical layer of genome.

关 键 词:IncRNA CONSERVATION evolution 

分 类 号:Q953[生物学—动物学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象