Hi,
csaw allows one to remove PCR duplicates when doing read counting across windows. My question is how is this functionality actually executed.
I generally mark duplicates using Picard Tools or Samblaster and I was of the view that csaw would remove these reads when doing read counting. However, I was wondering if csaw calls some other program (or its own method) internally for marking and removing duplicates.
It would be great to get some insight into this.
Thank you,
Best,
Asif
Sure. I think I might have missed this, but thank you for pointing it out. So, csaw relies on bams in which duplicates have already been marked.