基于马铃薯转录组数据的病毒组装软件比较  

Comparing of Three Softwares for Virus Genome Assembly Based on Potato Transcriptome Data

在线阅读下载全文

作  者:涂振 钟子旸 孟繁烨 邹莹 李佳炜 余涛 郑经涛 夏军辉 张舒[3] 聂碧华[1] 

机构地区:[1]华中农业大学园艺林学学院,湖北 武汉 [2]恩施州农业科学院,湖北 恩施 [3]湖北省农业科学院植保土肥研究所,湖北 武汉

出  处:《计算生物学》2022年第3期40-48,共9页Hans Journal of Computational Biology

摘  要:随着高通量测序技术的成熟和成本的降低,转录组数据呈现爆发式增长。转录组数据中除了包含寄主马铃薯自身的转录本以外,还可能包含寄主受到RNA病毒侵染而带来的病毒序列信息,因此可以低成本地从转录组数据中进行病毒基因组挖掘。本研究通过比较SOAPdenovo、IDBA-UD、Trinity 三种主流软件对RNA-seq数据的组装效果,发现Trinity软件组装得到的结果中序列信息最丰富,且长序列最多,但组装过程耗时较长;相对而言,SOAPdenovo和IDBA-UD耗时较短,但组装结果中序列信息较少且长序列较少,所以推荐使用Trinity软件进行基于转录组数据的病毒基因组组装。With the continuous maturity of high-throughput sequencing technology and the reduction of cost, transcriptome data show explosive growth. The potato transcriptome data not only contains the transcripts of the potato itself, but also contains the viral sequence information caused by the infection of viruses in the sample, so the virus genome mining can be carried out from the transcriptome data. In this study, the assembly results of three mainstream software (SOAPdenovo, IDBA-UD and Trinity3) were compared based on the same RNA-seq data, it was found that Trinity software resulted the most abundant sequence information and the longest sequences, but the assembly process took a long time;meanwhile, SOAPdenovo and IDBA-UD cost a relatively short time, but generated less sequence information and shorter sequences in the assembly results. Thus, it is recom-mended to use Trinity software to assemble virus genome based on transcriptome data.

关 键 词:SOAP 主流软件 基因组挖掘 数据呈现 序列信息 高通量测序技术 长序列 RNA病毒 

分 类 号:TP393.08[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象