药用资源植物山莨菪的转录组信息分析  被引量:4

Transcriptome Analysis for Medicinal Plant Anisodus tanguticus

在线阅读下载全文

作  者:张雨 夏铭泽 张发起[1] ZHANG Yu;XIA Ming-Ze;ZHANG Fa-Qi(Key Laboratory of Adaptation and Evolution of Plateau Biota,Northwest Institute of Plateau Biology,Chinese Academy of Sciences,Xi’ning 810001;College of Life Sciences,University of Chinese Academy of Sciences,Beijing 100039)

机构地区:[1]中国科学院高原生物适应与进化重点实验室,中国科学院西北高原生物研究所,西宁810001 [2]中国科学院大学,北京100039

出  处:《植物研究》2020年第3期458-467,共10页Bulletin of Botanical Research

基  金:青海省应用基础研究计划(2019-ZJ-7042);国家自然科学基金(31110103911)。

摘  要:为了增强对资源植物山莨菪的深入了解,本研究采用高通量测序技术对山莨菪进行转录组测序分析,经过处理得到71 463个Unigenes。通过与多个数据库进行比对,对基因进行分类和分析注释,最终成功获得注释的基因有47 624条。将Unigenes比对到KOG蛋白质库中,有13 110个基因被注释,共有26个子类;比对到NR库中后有39 621个Unigenes被注释;转录本与Swissprot、Tr EMBL的比对结果得到GO功能注释信息,注释得到的29 309个Unigenes可被分为分子功能、生物学过程和细胞组分3个大类,62个子类;以KEGG数据库为参考,3 679条基因被注释,参与的代谢通路可归为4个大类,分别是代谢相关的通路、遗传信息处理、细胞过程、环境信息处理,其中与代谢相关的通路最多,约占所有代谢通路的一半。对山莨菪的药用活性成分的代谢通路及相关Unigenes数量和类型的统计结果表明,与生物碱相关的代谢通路最多,萜类和苯丙素类所对应的Unigenes数量最多。另外,结果还检测到31 382个SNP位点,6种SSR重复类型,其中单碱基重复类型所占的比例最高,每百万碱基中出现的单碱基重复的SSR个数有56. 52个,占45. 30%。该结果丰富了山莨菪的转录组信息数据,为该物种分子生物学方面的研究奠定了基础,有助于进一步开展对山莨菪的合理保护及开发利用工作。The transcriptome sequencing analysis of Anisodus tanguticus was carried out by high throughput sequencing technology to understand resource plant A. tanguticus deeply. The 71 463 Unigenes were obtained after processing. By comparing with several databases,we classified and analyzed the genes,and finally succeeded in annotating 47 624 genes. The 13 110 genes with 26 subclasses were annotated after comparing with KOG protein library. Compared with NR library,39 621 unigenes were annotated. The transcripts were compared with Swissprot and Tr EMBL to obtain GO functional annotations. The 29 309 unigenes obtained from the annotations could be divided into three categories: molecular function,biological process,and cellular components,with 62 subcategories. Referring to the KEGG database,3 679 genes were annotated. The metabolic pathways involved can be classified into four categories: metabolic related pathways, genetic information processing,cellular processes,and environmental information processing. Among them,metabolic related pathways are the most,accounting for about half of all metabolic pathways. Statistical results of metabolic pathways and related unigenes of the active ingredients of A. tanguticus showed that the number of metabolic pathways related to alkaloids was the most,and the number of unigenes corresponding to terpenoids and phenylpropanoids was the most. In addition,31 382 SNP loci and 6 SSR repeat types were detected. Among them,single base repeat types accounted for the highest proportion,with 56.52 SSR repeats per million bases,accounting for 45.30%. These results enrich the transcriptome information of A. tanguticus and lay a foundation for the study of the molecular biology,which contribute to the further development and utilization of A.tanguticus.

关 键 词:山莨菪 转录组 高通量测序 基因注释 

分 类 号:Q949.777.7[生物学—植物学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象