基于高通量测序技术的杭白芷(Angelica dahurica)根转录组数据分析  被引量:4

High-throughput Transcriptome Sequencing of Roots of Angelica dahurica and Data Analyses

在线阅读下载全文

作  者:吴萍[1] 郭俊霞[1] 王晓宇[1] 李青苗[1] 张松林 方清茂[1] Wu Ping;Guo Junxia;Wang Xiaoyu;Li Qingmiao;Zhang Songlin;Fang Qingmao(Sichuan Academy of Traditional Chinese Medicine Sciences,Chengdu,610041)

机构地区:[1]四川省中医药科学院,成都610041

出  处:《分子植物育种》2020年第10期3207-3216,共10页Molecular Plant Breeding

基  金:四川省科技厅2018年基本科研业务专项(A2018N13)资助。

摘  要:为获得杭白芷转录组信息特征,本研究利用Illumina HiSeqⅩTen测序平台对杭白芷根进行高通量转录组测序,获得高质量序列(Clean reads)47742445条,Trinity denovo组装后得到47044条Unigenes,平均长度1164.20 nt。BLAST分析显示分别有32208(68.46%)、23049(48.99%)、10479(22.27%)、17883(38.01%)、28201(59.95%)、20731(44.07%)、55(0.12%)条Unigenes在数据库NR、Swiss-Prot、KEGG、KOG、eggNOG、GO、Pfam中获得注释,可归为GO分类的生物过程、细胞组分和分子功能3大类57分支,涉及205个KEGG代谢通路,其中包括27个次生代谢通路。蛋白编码框序列32303个,高等植物转录因子58个家族,借助MISA软件发现10020个SSR,其中二碱基重复最丰富,有4336个,出现频率为43.27%;五碱基重复SSR最少仅占0.37%。本研究获得了大量基因序列信息以及SSR信息,为今后开展相关分子机制研究提供了数据资源和理论基础。To order to obtain the transcriptome information characteristics of Angelica dahurica,the root transcriptome dataset of Angelica dahurica was obtained using the high-throughput sequencing platform Illumina HiSeqⅩTen.A great number of 47742445 high quality Clean reads were obtained by the transcriptome sequencing analyses.Using Trinity denovo assembling,a total of 47044 Unigenes were finally obtained,with an average length of 1164.20 nt.BLAST analysis indicated that 32208(accounting 68.46%of the total Unigenes),23049(48.99%),10479(22.27%),17883(38.01%),28201(59.95%),20731(44.07%),55(0.12%)Unigenes were successfully annotated in the NR,Swiss-Prot,KEGG,KOG,gNOG,GO,and Pfam databases,respectively.And GO classification contained the basic three major groups,including biological process,cellular component,and molecular function with 57 subgroups.A total of 205 KEGG metabolic pathways were designated,27 of which were defied as the secondary metabolism.Of all Unigenes,32303 were predictedr to have CDS,and 58 families of plant transcription factors were also identified.Using MISA prediction,10020 simple sequence repeats(SSRs)were obtained,amongwhich the di-nucleotide SSRswere abundant with 4336(43.27%),whereas the penta-nucleotide SSRs accounted for 0.37%.In this study,rich sequence information of gene,SSR as well as transposon information of Angelica dahurica is helpful to carry out the research of the molecular mechanism of phorbol ester biosynthesis in Angelica dahurica in the future.

关 键 词:杭白芷 转录组 功能基因 代谢通路 简单重复序列 

分 类 号:S567.239[农业科学—中草药栽培]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象