What is the expected duplication rate for ATAC-seq data?

Modified on Thu, 3 Dec, 2020 at 10:38 PM

Duplicate reads are generally the result of PCR duplicates during library preparation, and in the pre-processing steps those reads are marked and removed. Since insertion of the Tn5 enzyme is random, PCR duplicates represent artifacts and can bias downstream analyses. A low duplication rate (< 5-10%) is optimal, and while slightly higher numbers are still acceptable (< 20-30%), higher percentages can suggest problems at the sample preparation step.

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article