杜仲雌雄株转录组测序数据组装及基因功能注释  被引量:22

Transcriptome data assembly and gene function annotation of female and male plants in Eucommia ulmoides

在线阅读下载全文

作  者:赵德刚[1,2] 李岩[1,2] 赵懿琛[3] 赵丹[1] 吕立堂[1] 刘世会[3] 宋莉[1] 董璇[1,2] 冯怡[1] 

机构地区:[1]贵州大学生命科学学院/农业生物工程研究院山地植物资源保护与种质创新省部共建教育部重点实验室,贵州贵阳550025 [2]贵州大学绿色农药与农业生物工程国家重点实验室培育基地,贵州贵阳550025 [3]贵州大学药学院,贵州贵阳550025

出  处:《山地农业生物学报》2015年第1期1-12,共12页Journal of Mountain Agriculture and Biology

基  金:国家863计划"特色植物功能基因组学研究与应用"子课题任务"杜仲功能基因组研究与应用"(2013AA102605-05);国家自然科学基金"杜仲抗真菌蛋白基因克隆与功能研究"(No.31360272)资助

摘  要:以10年以上树龄的杜仲雌株当年新发枝条上的幼果、嫩芽、叶片和树皮和雄株新发枝条上嫩芽、叶片和树皮为材料,采用Illumina Hi SeqTM2000高通量测序技术进行转录组测序,获得雌株51,574,000条、雄株52,430,502条Clean Reads数据,分别包含总长度为4,641,660,000nt和4,718,745,180nt核苷酸序列数据信息;经拼接组装,获得雌株基因信息长达69,461,730nt的423,339个Contig片段,获得雄株基因信息长达94,814,201nt的542,383个Contig片段;经进一步拼接,分别获得平均长度为288nt的雌株159,434个Unigene片段和平均长度为231nt的雄株257,288个Unigene片段,共有48,761个表达序列标签(EST)。以BLAST(E-value1.0E-5)将Unigene对NR、NT、KEGG和COG数据库进行比对,获得CDS序列35,541条,再通过ESTscan分析获得CDS片段13,220条,共获得48,761条CDS片段。与NR数据库比对发现杜仲雌、雄株转录组Unigene与葡萄相似序列最多(33.8%),其次是蓖麻(11.4%)和杨树(11.2%),与拟南芥的相似序列仅2.3%;根据Unigene与COG数据库比对结果,可将有COG功能的7,571条Unigene分为24类,而根据GO数据库注释,杜仲转录组有GO功能注释的23,314条Unigene可分为生物过程、细胞组分和分子功能3大类55分支。与KEGG数据库比对,杜仲雌、雄株转录组17,468条Ungenes分属128类代谢通路,其中有2,399条属于次生物质代谢途径,314条参与萜类化合物生物合成途径。The Illumina Hi SeqTM2000 high-throughput sequencing technology was used to establish two transcriptome libraries by used the RNA pools which were isolated from female plants( young leaves,young bark and young fruits) and male plants( young buds,young leaves,young bark) in Eucommia ulmoides respectively. An average length of 90 nt and the total of 51. 6 million clean reads for female plant and 52. 4 million clean reads for male plant were generated, which respectively produced159,434 Unigene with a mean length 288 nt for female plant and 257,288 Unigene with a mean length of 231 nt for male plant. These Unigene were annotated using BLAST( E-value 1. 0E-5) against the NR,NT,Swiss Prot,Kyoto encyclopedia of genes and genomes( KEGG) and Clusters of orthologous groups( COG). All-Unigene blast against NR,the top 3 most similarity were 33. 77% within Vitis vinifera,11. 38% within Ricinus communis and 11. 18% within Populus trichocarpa. It was2. 79% low similarity within Arabidopsis lyrata subsp. Lyrata. To compare the Unigene and the COG datbase,the 7,571 Unigene in transcriotome of Eucommia ulmoides female and male plants were divided into 24 classes acconding to the function. The 23,314 Unigene GO function were annotated biological processes,cellular components and molecular function categories of 55 branches. The KEGG database as a reference,17,468 Ungenes in the transcriptome could be divided into 128 classes metabolic pathway. Of these,2,399 Unigene were found to be related to biosynthesis of secondary metabolites,and 314 were involved in terpenoid biosynthesis.

关 键 词:杜仲 转录组 基因功能 

分 类 号:Q785.786[生物学—分子生物学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象