机构地区:[1]State Key Laboratory of Protein and Plant Gene Research,College of Life Sciences,Peking University [2]National Center for Plant Gene Research(Beijing)
出 处:《Journal of Integrative Plant Biology》2013年第7期576-585,共10页植物学报(英文版)
基 金:supported by grants from the China National Basic Research Program(2010CB126002);the National Natural Science Foundation of China(90717009);the 111 Project funded by the Chinese Ministry of Education
摘 要:We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the U niGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Tran- scriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the U niGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Tran- scriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.
关 键 词:Cotton fiber deep sequencing expressed sequence tag assembly functional genomics Gossypium.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...