De novo transcriptome assembly of RNA-Seq reads with different strategies  被引量:4

De novo transcriptome assembly of RNA-Seq reads with different strategies

在线阅读下载全文

作  者:CHEN Geng YIN KangPing WANG Charles SHI TieLiu 

机构地区:[1]Center for Bioinformatics and Computational Biology, Institute of Biomedical Sciences, School of Life Science, East China Normal University, Shanghai 200241, China [2]Functional Genomics Core, Beckman Research Institute, City of Hope Comprehensive Cancer Center, Duarte, CA 91010, USA

出  处:《Science China(Life Sciences)》2011年第12期1129-1133,共5页中国科学(生命科学英文版)

基  金:supported by the National Basic Research Program of China (Grant Nos. 2010CB945401, 2007CB108800);National Natural Science Foundation of China (Grant Nos. 30870575, 31071162,31000590);the Science and Technology Commission of Shanghai Municipality (Grant No. 11DZ2260300)

摘  要:De novo transcriptome assembly is an important approach in RNA-Seq data analysis and it can help us to reconstruct the transcriptome and investigate gene expression profiles without reference genome sequences.We carried out transcriptome assemblies with two RNA-Seq datasets generated from human brain and cell line,respectively.We then determined an efficient way to yield an optimal overall assembly using three different strategies.We first assembled brain and cell line transcriptome using a single k-mer length.Next we tested a range of values of k-mer length and coverage cutoff in assembling.Lastly,we combined the assembled contigs from a range of k values to generate a final assembly.By comparing these assembly results,we found that using only one k-mer value for assembly is not enough to generate good assembly results,but combining the contigs from different k-mer values could yield longer contigs and greatly improve the overall assembly.De novo transcriptome assembly is an important approach in RNA-Seq data analysis and it can help us to reconstruct the tran- scriptome and investigate gene expression profiles without reference genome sequences. We carried out transcriptome assem- blies with two RNA-Seq datasets generated from human brain and cell line, respectively. We then determined an efficient way to yield an optimal overall assembly using three different strategies. We first assembled brain and cell line transcriptome using a single k-mer length. Next we tested a range of values of k-mer length and coverage cutoff in assembling. Lastly, we com- bined the assembled contigs from a range of k values to generate a final assembly. By comparing these assembly results, we found that using only one k-mer value for assembly is not enough to generate good assembly results, but combining the contigs from different k-mer values could yield longer contigs and greatly improve the overall assembly.

关 键 词:RNA-SEQ de novo transcriptome assembly next generation sequencing 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程] TE973.4[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象