csaw: removing PCR duplicates using dedup flag
1
0
Entering edit mode
@asifzubair-6770
Last seen 8.2 years ago

Hi, 

csaw allows one to remove PCR duplicates when doing read counting across windows. My question is how is this functionality actually executed.

I generally mark duplicates using Picard Tools or Samblaster and I was of the view that csaw would remove these reads when doing read counting. However, I was wondering if csaw calls some other program (or its own method) internally for marking and removing duplicates. 

It would be great to get some insight into this. 

Thank you, 
Best, 

Asif 

csaw dedup • 1.3k views
ADD COMMENT
2
Entering edit mode
@james-w-macdonald-5106
Last seen 3 days ago
United States

I think section 2.2.2 of the csaw User's guide covers this pretty clearly. In addition, ?readParam has this as the first (!) paragraph under 'Removing low-quality or irrelevant reads':

     Marked duplicate reads will be removed with  dedup=TRUE . This may
     be necessary when many rounds of PCR have been performed during
     library preparation. However, it is not recommended for routine
     counting as it will interfere with the downstream statistical
     methods. Note that the duplicate field must be set beforehand in
     the BAM file for this argument to have any effect.

 

ADD COMMENT
0
Entering edit mode

Sure. I think I might have missed this, but thank you for pointing it out. So, csaw relies on bams in which duplicates have already been marked.

ADD REPLY

Login before adding your answer.

Traffic: 351 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6