Duplicate reads are generally the result of PCR duplicates during library preparation, and in the pre-processing steps those reads are marked and removed. Since insertion of the Tn5 enzyme is random, PCR duplicates represent artifacts and can bias downstream analyses. A low duplication rate (< 5-10%) is optimal, and while slightly higher numbers are still acceptable (< 20-30%), higher percentages can suggest problems at the sample preparation step.
What is the expected duplication rate for ATAC-seq data? Print
Modified on: Thu, 3 Dec, 2020 at 10:38 PM
Did you find it helpful? Yes No
Send feedbackSorry we couldn't be helpful. Help us improve this article with your feedback.