Identifies the miss-assembly errors in contigs based on the paired-end-read distribution and ameliorates the performance of scaffolding results. PECC has four major functions: i) mapping paired-end reads, ii) locating candidate error regions, iii) validating candidate error regions and iv) identifying the clipping boundary. This algorithm removes sequence regions with lower paired-end reads, supports and tests them by using the distribution of paired-end supports.
School of Information Science and Engineering, Central South University, Changsha, China; Department of Computer Science, Georgia State University, Atlanta, GA, USA; Department of Mechanical Engineering and Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK, Canada
PECC funding source(s)
Supported in part by the National Science Fund for Excellent Young Scholars under Grant No. 61622213 and the National Natural Science Foundation of China under grant No.61232001, No. 61420106009, No.61379108 and No.61370172.