Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clusters

dc.contributor.authorDallery, Jean-Félix
dc.contributor.authorLapalu, Nicolas
dc.contributor.authorZampounis, Antonios
dc.contributor.authorPigné, Sandrine
dc.contributor.authorLuyten, Isabelle
dc.contributor.authorAmselem, Joëlle
dc.contributor.authorWittenberg, Alexander H. J.
dc.contributor.authorZhou, Shiguo
dc.contributor.authorQueiroz, Marisa V. de
dc.contributor.authorRobin, Guillaume P.
dc.contributor.authorAuger, Annie
dc.contributor.authorHainaut, Matthieu
dc.contributor.authorHenrissat, Bernard
dc.contributor.authorKim, Ki-Tae
dc.contributor.authorLee, Yong-Hwan
dc.contributor.authorLespinet, Olivier
dc.contributor.authorSchwartz, David C.
dc.contributor.authorThon, Michael R.
dc.contributor.authorRichard J. O’Connell
dc.date.accessioned2017-11-28T13:31:38Z
dc.date.available2017-11-28T13:31:38Z
dc.date.issued2017-08-29
dc.description.abstractThe ascomycete fungus Colletotrichum higginsianum causes anthracnose disease of brassica crops and the model plant Arabidopsis thaliana. Previous versions of the genome sequence were highly fragmented, causing errors in the prediction of protein-coding genes and preventing the analysis of repetitive sequences and genome architecture. Here, we re-sequenced the genome using single-molecule real-time (SMRT) sequencing technology and, in combination with optical map data, this provided a gapless assembly of all twelve chromosomes except for the ribosomal DNA repeat cluster on chromosome 7. The more accurate gene annotation made possible by this new assembly revealed a large repertoire of secondary metabolism (SM) key genes (89) and putative biosynthetic pathways (77 SM gene clusters). The two mini-chromosomes differed from the ten core chromosomes in being repeat- and AT-rich and gene-poor but were significantly enriched with genes encoding putative secreted effector proteins. Transposable elements (TEs) were found to occupy 7% of the genome by length. Certain TE families showed a statistically significant association with effector genes and SM cluster genes and were transcriptionally active at particular stages of fungal development. All 24 subtelomeres were found to contain one of three highly- conserved repeat elements which, by providing sites for homologous recombination, were probably instrumental in four segmental duplications. The gapless genome of C. higginsianum provides access to repeat-rich regions that were previously poorly assembled, notably the mini-chromosomes and subtelomeres, and allowed prediction of the complete SM gene repertoire. It also provides insights into the potential role of TEs in gene and genome evolution and host adaptation in this asexual pathogen.en
dc.formatpdfpt-BR
dc.identifier.issn14712164
dc.identifier.urihttps://www.ncbi.nlm.nih.gov/pubmed/28851275
dc.identifier.urihttp://www.locus.ufv.br/handle/123456789/13887
dc.language.isoengpt-BR
dc.publisherBMC Genomicspt-BR
dc.relation.ispartofseries18(1):667, Aug 2017pt-BR
dc.rightsOpen Accesspt-BR
dc.subjectFungal genomept-BR
dc.subjectSMRT sequencingpt-BR
dc.subjectOptical mappt-BR
dc.subjectTransposable elementspt-BR
dc.subjectSecondary metabolism genespt-BR
dc.subjectSubtelomerespt-BR
dc.subjectSegmental duplicationpt-BR
dc.subjectAccessory chromosomespt-BR
dc.subjectColletotrichum higginsianumpt-BR
dc.titleGapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite gene clustersen
dc.typeArtigopt-BR

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
document.pdf
Size:
4.18 MB
Format:
Adobe Portable Document Format
Description:
Texto completo

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections