结膜吸吮线虫基因组序列特征研究  被引量:4

Sequential analysis of genome of Thelazia callipaeda

在线阅读下载全文

作  者:张露菲 王灵军[1,2] 郑明辉 曹建平[3] 刘晖 ZHANG Lu-fei1,2, WANG Ling-jun1,2, ZHENG Ming-hui1,2, CAO Jian-ping3, LIU Hui1,2(1 Department of Parasitology, Zunyi Medical College, Zunyi 563000, China ; 2 Special Key Laboratory of Gene Detection & Therapy of Guizhou Provincial Department of Education, China; 3 National Institute of Parasitic Diseases, Chinese Center for Dis- ease Control and Prevention, Chin)

机构地区:[1]遵义医学院寄生虫学教研室,遵义563000 [2]贵州省教育厅基因检测与治疗特色重点实验室 [3]中国疾病预防控制中心寄生虫病预防控制所

出  处:《中国血吸虫病防治杂志》2018年第3期312-316,共5页Chinese Journal of Schistosomiasis Control

基  金:国家自然科学基金(81560336;81760373);贵州省科技厅科技基金(黔科合基础[2016]1168);贵州省教育厅创新团队(黔合教人才团队字[2014]39号);遵义医学院博士启动基金(F-795)

摘  要:目的阐明结膜吸吮线虫(Thelazia callipaeda)基因组的序列特征。方法采用Gene Mark、Gene ID和Ge Mo Ma软件对结膜吸吮线虫基因组组装数据进行从头预测及同源预测,利用EVM软件对预测结果进行整合,以预测其基因组全部基因;将得到的基因序列分别在公共数据库及3个专有数据库(CAZyme、TCDB和PHI)中进行注释。结果对结膜吸吮线虫基因组(79.34 Mb)的Scaffolds和Contigs基因结构进行分析,共得到了6 333个基因;通过公共数据库中BLAST比对,发现97.85%的基因可以得到注释,其中NR数据库中注释的基因最多(98.69%),可以富集到KEGG途径的基因最少(50.50%)。通过KOG数据库分析功能基因,共发现4 517个功能基因。在3个专有数据库(CAZyme、TCDB和PHI)中比对,分别注释得到136、139个和1 498个基因,其中PHI数据库中注释基因数目最多(1 498)。此外,还通过细胞色素酶的专有数据库预测到了238个细胞色素P450基因。结论本研究初步揭示了结膜吸吮线虫基因组结构特征和注释信息,共得到了6 333个基因。Objective To investigate the molecular characteristics of genome sequence of Thelazia callipaeda (T. cp). Meth- ods The obtained T. cp genome assembling data were annotated by using a combination of ab initio gene by softwares, Gene- Mark and GenelD, and the homology of the experimentally confirmed genes was predicted by software GeMoMa. The results were integrated by software EVM to predict all genes of genome. The obtained genes were annotated in the common public data- base and three dedicated databases(CAZyme, TCDB and PHI), respectively. Results The Scaffolds and Contigs gene struc- ture of T. cp genome (79.34 Mb) was analyzed, and a total of 6 333 genes were obtained. The sequence search was conducted in the public databases using BLASTx, of which 97.85% of the genes could be annotated. The genes annotated in the NR database were the most (98.69%), and those enriched in the KEGG pathway were the least (50.50%). The functional genes were blasted by KOG database and totally 4 517 genes were found. The three special databases (CAZyme, TCDB and PHI) were used to an- notate all the genes, and 136, 139 and 1 498 genes were assigned respectively, and the number of genes in the PHI database was the largest. In the cytochrome proprietary database, 258 cytochrome P450 genes were predicted. Conclusion We have pre- liminarily revealed the T. cp genome structure characteristics and annotation information, and totally 6 333 genes are obtained.

关 键 词:结膜吸吮线虫 基因组 序列分析 功能基因 细胞色素P450 

分 类 号:R383.19[医药卫生—医学寄生虫学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象