Computational protocol: Genome Sequence and Annotation of Trichoderma parareesei, the Ancestor of the Cellulase Producer Trichoderma reesei

[…] The filamentous ascomycete Trichoderma reesei is used in the biotechnological industry for the production of cellulolytic and hemicellulolytic enzymes and recombinant proteins (). In nature, T. reesei almost exclusively occurs in its sexual form on dead wood (). Earlier isolations of putative T. reesei anamorphs from soil (, ) have been shown to be a sympatric sister species that is now named Trichoderma parareesei (). Compared to T. reesei, the later taxon has an entirely clonal lifestyle, is considerably more versatile in substrate utilization, and has enhanced mycoparasitic vigor (, ).We sequenced the genome of the type strain of T. parareesei CBS 125925 in an Illumina-based whole-genome shotgun sequencing approach delivering 366,865,176 paired reads with an approximate insert size of 350 bp. The acquired sequence reads were assembled into 1,123 contigs using Velvet v1.0.12 () with a k-mer length of 75 nucleotides (nt). The resulting genome sequence has an estimated size of 32.0 Mb (N50, 68,608 bp; NMax, 286,763 bp; median coverage, 250.5) with a G+C content of 53.8%.The genome assembly was repeat masked by RepeatMasker (, and the protein-coding genes were predicted by combining ab initio and homology-based approaches. The training set combined self-training GeneMark-ES v2.3f predictions and homology protein alignment using Exonerate. This set was then used to train AUGUSTUS v.27 and SNAP and finally combined by EVidenceModeler (EVM) to yield 9,318 consensus gene models (), 8,651 (93%) of which had orthologs in T. reesei. […]

Pipeline specifications

Software tools Velvet, RepeatMasker, GeneMark, EVM
Application Genome annotation
Organisms Trichoderma reesei