Artificial duplicate reads in sequencing data of 454 Genome Sequencer FLX System  被引量:1

Artificial duplicate reads in sequencing data of 454 Genome Sequencer FLX System

在线阅读下载全文

作  者:Hui Dong Yangyi Chen Yan Shen Shengyue Wang Guoping Zhao Weirong Jin 

机构地区:[1]Chinese National Human Genome Center at Shanghai, Shanghai 201203, China [2]National Engineering Center for Biochip at Shanghai, Shanghai 201203, China

出  处:《Acta Biochimica et Biophysica Sinica》2011年第6期496-500,共5页生物化学与生物物理学报(英文版)

摘  要:The 454 Genome Sequencer (GS) FLX System is one of the next-generation sequencing systems featured by long reads, high accuracy, and ultra-high throughput. Based on the mechanism of emulsion PCR, a unique DNA tem- plate would only generate a unique sequence read after being amplified and sequenced on GS FLX. However, biased amplification of DNA templates might occur in the process of emulsion PCR, which results in production of artificial duplicate reads. Under the condition that each DNA template is unique to another, 3.49%-18.14% of total reads in GS FLX-sequencing data were found to be artificial duplicate reads. These duplicate reads may lead to misunderstanding of sequencing data and special atten- tion should be paid to the potential biases they introduced to the data.The 454 Genome Sequencer (GS) FLX System is one of the next-generation sequencing systems featured by long reads, high accuracy, and ultra-high throughput. Based on the mechanism of emulsion PCR, a unique DNA tem- plate would only generate a unique sequence read after being amplified and sequenced on GS FLX. However, biased amplification of DNA templates might occur in the process of emulsion PCR, which results in production of artificial duplicate reads. Under the condition that each DNA template is unique to another, 3.49%-18.14% of total reads in GS FLX-sequencing data were found to be artificial duplicate reads. These duplicate reads may lead to misunderstanding of sequencing data and special atten- tion should be paid to the potential biases they introduced to the data.

关 键 词:duplicate reads 454 GS FLX System emulsion PCR 

分 类 号:Q987[生物学—遗传学] TP311.13[生物学—人类学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象