机构地区:[1]福建农林大学蜂学与生物医药学院,福州350002 [2]天然生物毒素国家地方联合工程实验室,福州350002 [3]福建省蜂疗研究所,福州350002
出 处:《昆虫学报》2024年第2期183-192,共10页Acta Entomologica Sinica
基 金:国家自然科学基金项目(32172792,31702190);国家现代农业产业技术体系专项资金项目(CARS-44-KXJ7);福建省自然科学基金面上项目(2022J01131334);福建农林大学硕士生导师团队项目(郭睿);福建农林大学科技创新专项基金(KFb22060XA);福建农林大学动物科学学院(蜂学学院)科研扶持项目(郭睿,付中民);福建省大学生创新创业训练计划项目(202310389027,S202310389076)。
摘 要:【目的】通过纳米孔(nanopore)测序技术组装和注释中华蜜蜂Apis cerana cerana工蜂幼虫肠道高质量全长转录组。【方法】采用Nanopore PromethION系统对蜜蜂球囊菌Ascosphaera apis接种的中华蜜蜂工蜂3日龄幼虫后的4, 5和6日龄幼虫肠道(分别为AcT4, AcT5和AcT6)进行转录组测序,鉴定全长转录本序列;将前期未接种蜜蜂球囊菌中华蜜蜂工蜂4, 5和6日龄幼虫肠道转录组纳米孔测序数据中鉴定到的全长转录本与上述鉴定到的全长转录本混合后滤除冗余全长转录本;将鉴定到的非冗余全长转录本比对Nr, KOG, eggNOG和GO数据库进行注释。采用CPC, CNCI, CPAT和Pfam 4种方法预测长链非编码RNA (long non-coding RNA, lncRNA)。【结果】AcT4, AcT5和AcT6分别测得14 474 634, 10 461 827和11 890 978条原始读段(raw reads),分别包含11 898 582, 8 630 186和9 091 035条全长转录本,去冗余后分别鉴定到27 815, 21 781和20 004条非冗余全长转录本,N50长度分别为1 900, 1 961和2 294 bp,平均长度分别为1 534, 1 584和1 792 bp,最长读段长度分别为10 855, 10 837和10 887 bp。鉴定到40 562条去非冗余全长转录本,分别有35 415, 24 646, 34 054和23 053条转录本可分别注释到Nr, KOG, eggNOG和GO数据库。在Nr数据库中注释全长转录本数目和占比最高的物种是东方蜜蜂A.cerana(20 310条,57.35%),其次为西方蜜蜂A.mellifera(4 686条转录本,占13.23%)、大蜜蜂A.dorsata(2 536条转录本,占7.16%)和小蜜蜂A.florea(2 079条转录本,占5.87%)。非冗余全长转录本可注释到eggNOG数据库中的未知功能及翻译后修饰、蛋白质更新和分子伴侣等25个功能分类、KOG数据库中的仅一般功能预测和信号转导机制等25个功能分类、GO数据库中生物学进程、细胞组分和分子功能三大类中的50个功能条目以及KEGG数据库中核糖体和RNA转运等196条通路。共鉴定到2 301条高可信度lncRNA,涉及正义链lncRNA、反义链lncRNA、内含子lncRNA和基【Aim】To assemble and annotate the high-quality full-length transcriptome of the larval gut of Apis cerana cerana workers through nanopore sequencing technology.【Methods】The 3-day-old larvae of A.c.cerana workers were inoculated with Ascosphaera apis and the transcriptomes of the gut of the 4-,5-and 6-day-old larvae(AcT4,AcT5 and AcT6,respectively)were sequenced by Nanopore PromethION system to identify the full-length transcript sequences.The previously identified full-length transcripts in the nanopore sequencing gut transcriptome data of the 4-,5-and 6-day-old larvae of A.c.cerana workers uninoculated with A.apis were mixed with the above obtained full-length transcripts in this study to remove the redundant full-length transcripts.The identified non-redundant full-length transcripts were aligned to the Nr,KOG,eggNOG and GO databases for annotations.Four methods including CPC,CNCI,CPAT and Pfam were used to predict long non-coding RNAs(lncRNAs).【Results】From AcT4,AcT5 and AcT6,14474634,10461827 and 11890978 raw reads were obtained,including 11898582,8630186 and 9091035 full-length transcripts,27815,21781 and 20004 non-redundant full-length transcripts were identified after de-redundancy,the N50 lengths of the non-redundant full-length transcripts were 1900,1961 and 2294 bp with the average lengths of 1534,1584 and 1792 bp,and the longest read lengths of 10855,10837 and 10887 bp,respectively.A total of 40562 non-redundant full-length transcripts were identified,and 35415,24646,34054 and 23053 transcripts could be annotated to the Nr,KOG,eggNOG,and GO databases,respectively.The species with the highest number and proportion of annotated full-length transcripts was A.cerana(20310 transcripts,accounting for 57.35%),followed by A.mellifera(4686 transcripts,accounting for 13.23%),A.dorsata(2536 transcripts,accounting for 7.16%)and A.florea(2079,accounting for 5.87%)in the Nr database.The non-redundant full-length transcripts were annotated to 25 functional categories such as unknown function and post-tran
关 键 词:东方蜜蜂 中华蜜蜂 蜜蜂球囊菌 肠道 全长转录组 长链非编码RNA 第三代测序技术 纳米孔测序
分 类 号:S891[农业科学—特种经济动物饲养]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...